AI RESEARCH PAPERS & ACADEMIC SOURCES
- Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats
- Dress Well via Fashion Cognitive Learning
- Privacy-Preserving Visual Localization with Event Cameras
- Limitations of Data-Driven Spectral Reconstruction -- An Optics-Aware Analysis
- FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching
- Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion
- Enhancing Test Time Adaptation with Few-shot Guidance
- Large Language Model-Guided Semantic Alignment for Human Activity Recognition
- Improvement of Spiking Neural Network with Bit Planes and Color Models
- VisualLens: Personalization through Task-Agnostic Visual History
- SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
- FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
- NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation
- DynVFX: Augmenting Real Videos with Dynamic Content
- Dual Caption Preference Optimization for Diffusion Models
- Indoor Heat Estimation from a Single Visible-Light Panorama
- ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
- Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
- Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
- Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking
- Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
- Hierarchical Feature Learning for Medical Point Clouds via State Space Model
- Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
- SSL4Eco: A Global Seasonal Dataset for Geospatial Foundation Models in Ecology
- Is Artificial Intelligence Generated Image Detection a Solved Problem?
- UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens
- GMatch: A Lightweight, Geometry-Constrained Keypoint Matcher for Zero-Shot 6DoF Pose Estimation in Robotic Grasp Tasks
- Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles
- Hierarchical Material Recognition from Local Appearance
- Grounded Reinforcement Learning for Visual Reasoning
- CReFT-CAD: Boosting Orthographic Projection Reasoning for CAD via Reinforcement Fine-Tuning
- Reasoning-Aligned Perception Decoupling for Scalable Multi-modal Reasoning
- Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
- GeoCAD: Local Geometry-Controllable CAD Generation with Large Language Models
- G$^{2}$D: Boosting Multimodal Learning with Gradient-Guided Distillation
- HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion
- Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection
- Attention (as Discrete-Time Markov) Chains
- Adaptive Convolutional Neural Network for Image Super-resolution
- Principled Feature Disentanglement for High-Fidelity Unified Brain MRI Synthesis
- A Synthetic Data-Driven Radiology Foundation Model for Pan-tumor Clinical Diagnosis
- Geodesic Diffusion Models for Efficient Medical Image Enhancement
- Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision
- Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process
- EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
- Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
- SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution
- Real-Time World Crafting: Generating Structured Game Behaviors from Natural Language with Large Language Models
- $\mathcal{V}isi\mathcal{P}runer$: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs
- DELULU: Discriminative Embedding Learning Using Latent Units for Speaker-Aware Self-Supervised Speech Foundational Model
- UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action
- Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
- Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference
- LEME: Open Large Language Models for Ophthalmology with Advanced Reasoning and Clinical Validation
- A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
- Automated Evaluation of Meter and Rhyme in Russian Generative and Human-Authored Poetry
- Leveraging Robust Optimization for LLM Alignment under Distribution Shifts
- Thinking Out Loud: Do Reasoning Models Know When They're Right?
- Understanding LLMs' Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From
- HCR-Reasoner: Synergizing Large Language Models and Theory for Human-like Causal Reasoning
- MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification
- Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning
- Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs
- A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
- A Controllable Examination for Long-Context Language Models
- KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
- AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models
- Compressed and Smooth Latent Space for Text Diffusion Modeling
- Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness
- A social context-aware graph-based multimodal attentive learning framework for disaster content classification during emergencies: a benchmark dataset and method
- Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
- Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
- MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
- Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition?
- ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
- MotionGPT3: Human Motion as a Second Modality
- CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
- IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
- StripRFNet: A Strip Receptive Field and Shape-Aware Network for Road Damage Detection
- ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles
- C-arm Guidance: A Self-supervised Approach To Automated Positioning During Stroke Thrombectomy
- DuetMatch: Harmonizing Semi-Supervised Brain MRI Segmentation via Decoupled Branch Optimization
- Automated C-Arm Positioning via Conformal Landmark Localization
- Cost Savings from Automatic Quality Assessment of Generated Images
- Data-Centric AI for Tropical Agricultural Mapping: Challenges, Strategies and Scalable Solutions
- StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
- VM-BeautyNet: A Synergistic Ensemble of Vision Transformer and Mamba for Facial Beauty Prediction
- Designing a Convolutional Neural Network for High-Accuracy Oral Cavity Squamous Cell Carcinoma (OCSCC) Detection
- Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset
- Proactive Scene Decomposition and Reconstruction
- Stroke2Sketch: Harnessing Stroke Attributes for Training-Free Sketch Generation
- Scaling Laws for Deepfake Detection
- Scale-DiT: Ultra-High-Resolution Image Generation with Hierarchical Local Attention
- TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
- On the Provable Importance of Gradients for Language-Assisted Image Clustering
- MIRAD - A comprehensive real-world robust anomaly detection dataset for Mass Individualization
- Demeter: A Parametric Model of Crop Plant Morphology from the Real World
- REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting
- LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching
- RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba
- Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
- Instance-Aware Pseudo-Labeling and Class-Focused Contrastive Learning for Weakly Supervised Domain Adaptive Segmentation of Electron Microscopy
- NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
- HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars
- PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
- OOS-DSD: Improving Out-of-stock Detection in Retail Images using Auxiliary Tasks
- Fit for Purpose? Deepfake Detection in the Real World
- VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
- Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs
- MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models
- HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications
- SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
- Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models
- UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid
- A Comprehensive Survey on World Models for Embodied AI
- Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling
- WaMaIR: Image Restoration via Multiscale Wavelet Convolutions and Mamba-based Channel Modeling with Texture Enhancement
- GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation
- Segmentation as A Plug-and-Play Capability for Frozen Multimodal LLMs
- Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry
- Personalized Image Filter: Mastering Your Photographic Style
- An RGB-D Image Dataset for Lychee Detection and Maturity Classification for Robotic Harvesting
- Robust Cross-Domain Adaptation in Texture Features Transferring for Wood Chip Moisture Content Prediction
- From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
- 2DGS-R: Revisiting the Normal Consistency Regularization in 2D Gaussian Splatting
- BARL: Bilateral Alignment in Representation and Label Spaces for Semi-Supervised Volumetric Medical Image Segmentation
- Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection
- Uncovering Brain-Like Hierarchical Patterns in Vision-Language Models through fMRI-Based Neural Encoding
- Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer Diagnosis
- Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
- Contrail-to-Flight Attribution Using Ground Visible Cameras and Flight Surveillance Data
- Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation
- Training-free Online Video Step Grounding
- An empirical study of the effect of video encoders on Temporal Video Grounding
- Do Satellite Tasks Need Special Pretraining?
- Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
- Where, Not What: Compelling Video LLMs to Learn Geometric Causality for 3D-Grounding
- Conditional Synthetic Live and Spoof Fingerprint Generation
- Click, Predict, Trust: Clinician-in-the-Loop AI Segmentation for Lung Cancer CT-Based Prognosis within the Knowledge-to-Action Framework
- Person Re-Identification via Generalized Class Prototypes
- How Universal Are SAM2 Features?
- ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding
- Towards a Generalizable Fusion Architecture for Multimodal Object Detection
- GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation
- Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement
- Towards Imperceptible Watermarking Via Environment Illumination for Consumer Cameras
- KineDiff3D: Kinematic-Aware Diffusion for Category-Level Articulated Object Shape Reconstruction and Generation
- Investigating Adversarial Robustness against Preprocessing used in Blackbox Face Recognition
- Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling
- Capturing Head Avatar with Hand Contacts from a Monocular Video
- HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery
- EndoCIL: A Class-Incremental Learning Framework for Endoscopic Image Classification
- Optimizing DINOv2 with Registers for Face Anti-Spoofing
- Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models
- SG-CLDFF: A Novel Framework for Automated White Blood Cell Classification and Segmentation
- Machine Vision-Based Surgical Lighting System:Design and Implementation
- Exploring Structural Degradation in Dense Representations for Self-supervised Learning
- LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding
- CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference
- A Single Set of Adversarial Clothes Breaks Multiple Defense Methods in the Physical World
- iDETEX: Empowering MLLMs for Intelligent DETailed EXplainable IQA
- Nearest-Class Mean and Logits Agreement for Wildlife Open-Set Recognition
- Exploring The Missing Semantics In Event Modality
- Beyond Real Faces: Synthetic Datasets Can Achieve Reliable Recognition Performance without Privacy Compromise
- Facial Expression-based Parkinson's Disease Severity Diagnosis via Feature Fusion and Adaptive Class Balancing
- Closed-Loop Transfer for Weakly-supervised Affordance Grounding
- Monitoring Horses in Stalls: From Object to Event Detection
- DeepDetect: Learning All-in-One Dense Keypoints
- Leveraging AV1 motion vectors for Fast and Dense Feature Matching
- Rethinking Nighttime Image Deraining via Learnable Color Space Transformation
- Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS
- Split-Fuse-Transport: Annotation-Free Saliency via Dual Clustering and Optimal Transport Alignment
- WP-CrackNet: A Collaborative Adversarial Learning Framework for End-to-End Weakly-Supervised Road Crack Detection
- PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception
- Expose Camouflage in the Water: Underwater Camouflaged Instance Segmentation and Dataset
- ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling
- Integrating BIM and UAV-based photogrammetry for Automated 3D Structure Model Segmentation
- One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection
- Self-supervised Pre-training for Mapping of Archaeological Stone Wall in Historic Landscapes Using High-Resolution DEM Derivatives
- 4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads
- Towards 3D Objectness Learning in an Open World
- Elastic ViTs from Pretrained Models without Retraining
- Automatic Classification of Circulating Blood Cell Clusters based on Multi-channel Flow Cytometry Imaging
- Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions
- Can Image-To-Video Models Simulate Pedestrian Dynamics?
- Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition
- SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
- ConsistEdit: Highly Consistent and Precise Training-free Visual Editing
- Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries
- Filtering of Small Components for Isosurface Generation
- Unlocking Off-the-Grid Sparse Recovery with Unlimited Sensing: Simultaneous Super-Resolution in Time and Amplitude
- Shape-aware Inertial Poser: Motion Tracking for Humans with Diverse Shapes Using Sparse Inertial Sensors
- DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment
- Detecting streaks in smart telescopes images with Deep Learning
- Conveying Meaning through Gestures: An Investigation into Semantic Co-Speech Gesture Generation
- ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
- Rao-Blackwell Gradient Estimators for Equivariant Denoising Diffusion
- Bayesian Computation in Deep Learning
- Weak-to-Strong Generalization Even in Random Feature Networks, Provably
- LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models
- From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs
- Physics-Informed Deep B-Spline Networks
- LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation
- Score-based deterministic density sampling
- Challenges and proposed solutions in modeling multimodal data: A systematic review
- A Generic Framework for Conformal Fairness
- UFT: Unifying Supervised and Reinforcement Fine-Tuning
- PICT -- A Differentiable, GPU-Accelerated Multi-Block PISO Solver for Simulation-Coupled Learning Tasks in Fluid Dynamics
- HERO: Heterogeneous Continual Graph Learning via Meta-Knowledge Distillation
- Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
- Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
- Navigating the Latent Space Dynamics of Neural Models
- Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback
- Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
- Progressive Tempering Sampler with Diffusion
- BLUR: A Bi-Level Optimization Approach for LLM Unlearning
- FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM Quantization
- GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining
- Improving Rectified Flow with Boundary Conditions
- Online Learning of Whittle Indices for Restless Bandits with Non-Stationary Transition Kernels
- ESSA: Evolutionary Strategies for Scalable Alignment
- Greedy Low-Rank Gradient Compression for Distributed Learning with Convergence Guarantees
- MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature
- Robust Anomaly Detection through Multi-Modal Autoencoder Fusion for Small Vehicle Damage Detection
- Federated Conditional Conformal Prediction via Generative Models
- Going with the Flow: Approximating Banzhaf Values via Graph Neural Networks
- SWIR-LightFusion: Multi-spectral Semantic Fusion of Synthetic SWIR with Thermal IR (LWIR/MWIR) and RGB
- Deep learning based numerical approximation algorithms for stochastic partial differential equations
- The Moral Foundations Reddit Corpus
- FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
- Predicting Patient Recovery or Mortality Using Deep Neural Decision Tree and Forest
- Conformal online model aggregation
- Neural Dynamic Data Valuation: A Stochastic Optimal Control Approach
- Approximately-symmetric neural networks for quantum spin liquids
- GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
- Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach
- GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model
- Accelerating MRI with Longitudinally-informed Latent Posterior Sampling
- Adv-SSL: Adversarial Self-Supervised Representation Learning with Theoretical Guarantees
- Invertible ResNets for Inverse Imaging Problems: Competitive Performance with Provable Regularization Properties
- Flow Matching for Accelerated Simulation of Atomic Transport in Crystalline Materials
- Learning Counterfactual Distributions via Kernel Nearest Neighbors
- Emergent field theories from neural networks
- Delta-Influence: Unlearning Poisons via Influence Functions
- What should a neuron aim for? Designing local objective functions based on information theory
- Auto-Prompt Generation is Not Robust: Prompt Optimization Driven by Pseudo Gradient
- Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization
- Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations
- Time-Varying Bayesian Optimization Without a Metronome
- Large Language Diffusion Models
- Nonlinear energy-preserving model reduction with lifting transformations that quadratize the energy
- DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
- Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey
- Subgradient Method for System Identification with Non-Smooth Objectives
- Fine-Grained Classification: Connecting Metadata via Cross-Contrastive Pre-Training
- Traceback of Poisoning Attacks to Retrieval-Augmented Generation
- Statistical Decision Theory with Counterfactual Loss
- Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Inverse Problems
- Path Gradients after Flow Matching
- Asymptotic Performance of Time-Varying Bayesian Optimization
- Hyperspectral Anomaly Detection Fused Unified Nonconvex Tensor Ring Factors Regularization
- A deep solver for backward stochastic Volterra integral equations
- A Pure Hypothesis Test for Inhomogeneous Random Graph Models Based on a Kernelised Stein Discrepancy
- ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
- Rao-Blackwellised Reparameterisation Gradients
- A Principled Path to Fitted Distributional Evaluation
- Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market
- Critically-Damped Higher-Order Langevin Dynamics for Generative Modeling
- A fast algorithm for solving the lasso problem exactly without homotopy using differential inclusions
- Observation-guided Interpolation Using Graph Neural Networks for High-Resolution Nowcasting in Switzerland
- Who Taught the Lie? Responsibility Attribution for Poisoned Knowledge in Retrieval-Augmented Generation
- Spacing Test for Fused Lasso
- In Generative AI We (Dis)Trust? Computational Analysis of Trust and Distrust in Reddit Discussions
- EgMM-Corpus: A Multimodal Vision-Language Dataset for Egyptian Culture
- Towards Low-Resource Alignment to Diverse Perspectives with Sparse Feedback
- Instant Personalized Large Language Model Adaptation via Hypernetwork
- Utilising Large Language Models for Generating Effective Counter Arguments to Anti-Vaccine Tweets
- FrugalPrompt: Reducing Contextual Overhead in Large Language Models via Token Attribution
- TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model
- RAVEN: Robust Advertisement Video Violation Temporal Grounding via Reinforcement Reasoning
- Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations
- Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety
- ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation
- Hallucination Benchmark for Speech Foundation Models
- Fine-tuning of Large Language Models for Constituency Parsing Using a Sequence to Sequence Approach
- Temporal Understanding under Deictic Frame of Reference
- Investigating the Impact of Rationales for LLMs on Natural Language Understanding
- so much depends / upon / a whitespace: Why Whitespace Matters for Poets and LLMs
- Enhancing Language Agent Strategic Reasoning through Self-Play in Adversarial Games
- Cross-Genre Authorship Attribution via LLM-Based Retrieve-and-Rerank
- Does Visual Grounding Enhance the Understanding of Embodied Knowledge in Large Language Models?
- ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
- Prompt-MII: Meta-Learning Instruction Induction for LLMs
- Back to Bytes: Revisiting Tokenization Through UTF-8
- Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic
- Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
- DiscoTrack: A Multilingual LLM Benchmark for Discourse Tracking
- SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
- Rethinking On-policy Optimization for Query Augmentation
- When AI companions become witty: Can human brain recognize AI-generated irony?
- Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting
- StreamingThinker: Large Language Models Can Think While Reading
- From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models
- Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations
- TaxoAlign: Scholarly Taxonomy Generation Using Language Models
- Addressing Antisocial Behavior in Multi-Party Dialogs Through Multimodal Representation Learning
- The Atomic Instruction Gap: Instruction-Tuned LLMs Struggle with Simple, Self-Contained Directives
- Agentic Reinforcement Learning for Search is Unsafe
- Multilingual Clinical NER for Diseases and Medications Recognition in Cardiology Texts using BERT Embeddings
- Evaluating Large Language Models on Urdu Idiom Translation
- Disparities in Multilingual LLM-Based Healthcare Q&A
- ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts
- Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
- Deep Self-Evolving Reasoning
- Lingua Custodi's participation at the WMT 2025 Terminology shared task
- Annotation-Efficient Universal Honesty Alignment
- When Annotators Disagree, Topology Explains: Mapper, a Topological Tool for Exploring Text Embedding Geometry and Ambiguity
- Language Confusion Gate: Language-Aware Decoding Through Model Self-Distillation
- LawChain: Modeling Legal Reasoning Chains for Chinese Tort Case Analysis
- Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models
- Qomhra: A Bilingual Irish-English Large Language Model
- Towards Mining Effective Pedagogical Strategies from Learner-LLM Educational Dialogues
- QueST: Incentivizing LLMs to Generate Difficult Problems
- Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
- HealthDial: A No-Code LLM-Assisted Dialogue Authoring Tool for Healthcare Virtual Agents
- PrivacyPAD: A Reinforcement Learning Framework for Dynamic Privacy-Aware Delegation
- SIADAFIX: issue description response for adaptive program repair
- Cerberus: Real-Time Video Anomaly Detection via Cascaded Vision-Language Models
- Investigating the Association Between Text-Based Indications of Foodborne Illness from Yelp Reviews and New York City Health Inspection Outcomes (2023)
- What Questions Should Robots Be Able to Answer? A Dataset of User Questions for Explainable Robotics
- Verifiable Fine-Tuning for LLMs: Zero-Knowledge Training Proofs Bound to Data Provenance and Policy
- Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
- A Prototypical Network with an Attention-based Encoder for Drivers Identification Application
- Adaptive Discretization for Consistency Models
- Uncertainty-aware data assimilation through variational inference
- Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems
- Symmetries in PAC-Bayesian Learning
- Disentanglement Beyond Static vs. Dynamic: A Benchmark and Evaluation Framework for Multi-Factor Sequential Representations
- Model Metamers Reveal Invariances in Graph Neural Networks
- Beyond Binary Out-of-Distribution Detection: Characterizing Distributional Shifts with Multi-Statistic Diffusion Trajectories
- Latent Spaces Beyond Synthesis: From GANs to Diffusion Models
- Exploration via Feature Perturbation in Contextual Bandits
- Finite-Time Bounds for Average-Reward Fitted Q-Iteration
- MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
- RINS-T: Robust Implicit Neural Solvers for Time Series Linear Inverse Problems
- S4ECG: Exploring the impact of long-range interactions for arrhythmia prediction
- A Conditional Diffusion Model for Probabilistic Prediction of Battery Capacity Degradation
- Diffusion Models as Dataset Distillation Priors
- Deeper with Riemannian Geometry: Overcoming Oversmoothing and Oversquashing for Graph Foundation Models
- Explainable AI for microseismic event detection
- CrossStateECG: Multi-Scale Deep Convolutional Network with Attention for Rest-Exercise ECG Biometrics
- Towards geological inference with process-based and deep generative modeling, part 2: inversion of fluvial deposits and latent-space disentanglement
- Unified Privacy Guarantees for Decentralized Learning via Matrix Factorization
- Local properties of neural networks through the lens of layer-wise Hessians
- Stochastic Difference-of-Convex Optimization with Momentum
- Convergence Rates for Gradient Descent on the Edge of Stability in Overparametrised Least Squares
- SAFE-D: A Spatiotemporal Detection Framework for Abnormal Driving Among Parkinson's Disease-like Drivers
- Curiosity Meets Cooperation: A Game-Theoretic Approach to Long-Tail Multi-Label Learning
- Mitigating Clever Hans Strategies in Image Classifiers through Generating Counterexamples
- How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?
- Reliable Inference in Edge-Cloud Model Cascades via Conformal Alignment
- TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model
- The Free Transformer
- Formally Exploring Time-Series Anomaly Detection Evaluation Metrics
- Semi-supervised Latent Bayesian Optimization for Designing Antimicrobial Peptides
- ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification
- Handling Extreme Class Imbalance: Using GANs in Data Augmentation for Suicide Prediction
- Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning
- Enabling Fine-Grained Operating Points for Black-Box LLMs
- Atlas-based Manifold Representations for Interpretable Riemannian Machine Learning
- Inference-Time Compute Scaling For Flow Matching
- Functional Distribution Networks (FDN)
- Time Series Analysis in Frequency Domain: A Survey of Open Challenges, Opportunities and Benchmarks
- Geometric Dynamics of Consumer Credit Cycles: A Multivector-based Linear-Attention Framework for Explanatory Economic Analysis
- LLM-VeriPPA: Power, Performance, and Area Optimization aware Verilog Code Generation with Large Language Models
- Bitcoin Price Forecasting Based on Hybrid Variational Mode Decomposition and Long Short Term Memory Network
- Quantum and Classical Machine Learning in Decentralized Finance: Comparative Evidence from Multi-Asset Backtesting of Automated Market Makers
- TeLLMe v2: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs
- Dynamic Factor Analysis of Price Movements in the Philippine Stock Exchange
- Attention to Non-Adopters
- Aligning Language Models with Investor and Market Behavior for Financial Recommendations
- The Invisible Handshake: Tacit Collusion between Adaptive Market Agents
- Convolutional Attention in Betting Exchange Markets
- Data for Inclusion: The Redistributive Power of Data Economics
- AGNES: Adaptive Graph Neural Network and Dynamic Programming Hybrid Framework for Real-Time Nanopore Seed Chaining
- A Storm-Centric 250 m NEXRAD Level-II Dataset for High-Resolution ML Nowcasting
- A Novel GPT-Based Framework for Anomaly Detection in System Logs
- Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch
- Identifying multi-omics interactions for lung cancer drug targets discovery using Kernel Machine Regression
- Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
- The Hidden Cost of Modeling P(X): Vulnerability to Membership Inference Attacks in Generative Text Classifiers
- Learning density ratios in causal inference using Bregman-Riesz regression
- The Cultural Mapping and Pattern Analysis (CMAP) Visualization Toolkit: Open Source Text Analysis for Qualitative and Computational Social Science
- Extending Prediction-Powered Inference through Conformal Prediction
- Personalized Collaborative Learning with Affinity-Based Variance Reduction
- DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution
- RL makes MLLMs see better than SFT
- MLCPD: A Unified Multi-Language Code Parsing Dataset with Universal AST Schema
- iWatchRoadv2: Pothole Detection, Geospatial Mapping, and Intelligent Road Governance
- Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades
- AoI-Aware Task Offloading and Transmission Optimization for Industrial IoT Networks: A Branching Deep Reinforcement Learning Approach
- A Relative Error-Based Evaluation Framework of Heterogeneous Treatment Effect Estimators
- VIPAMIN: Visual Prompt Initialization via Embedding Selection and Subspace Expansion
- Edge-Based Speech Transcription and Synthesis for Kinyarwanda and Swahili Languages
- From Reviews to Actionable Insights: An LLM-Based Approach for Attribute and Feature Extraction
- Multi-Marginal Schr\"odinger Bridge Matching
- Accelerated Learning on Large Scale Screens using Generative Library Models
- A three-step machine learning approach to predict market bubbles with financial news
- A Versatile Framework for Designing Group-Sparse Adversarial Attacks
- ARCO-BO: Adaptive Resource-aware COllaborative Bayesian Optimization for Heterogeneous Multi-Agent Design
- Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence
- Universal and Transferable Attacks on Pathology Foundation Models
- Robust Dynamic Staffing with Predictions
- Infinite Neural Operators: Gaussian processes on functions
- Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization
- DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge
- U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation
- Local regression on path spaces with signature metrics
- A Control-Theoretic Approach to Dynamic Payment Routing for Success Rate Optimization
- Kernel-Based Nonparametric Tests For Shape Constraints
- Prominence-Aware Artifact Detection and Dataset for Image Super-Resolution
- Near-Optimal Quantum Algorithms for Computing (Coarse) Correlated Equilibria of General-Sum Games
- Black-box Optimization of LLM Outputs by Asking for Directions
- Prediction-Augmented Trees for Reliable Statistical Inference
- A Topological Approach to Parameterizing Deep Hedging Networks
- Adaptive Sample Sharing for Linear Regression
- Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
- Extended LSTM: Adaptive Feature Gating for Toxic Comment Classification
- Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models
- Mode Collapse of Mean-Field Variational Inference
- Convergence of Regret Matching in Potential Games and Constrained Optimization
- DFNN: A Deep Fr\'echet Neural Network Framework for Learning Metric-Space-Valued Responses
- HyperSearch: Prediction of New Hyperedges through Unconstrained yet Efficient Search
- QR\"iS: A Preemptive Novel Method for Quishing Detection Through Structural Features of QR
- High-Level Multi-Robot Trajectory Planning And Spurious Behavior Detection
- Fair and Interpretable Deepfake Detection in Videos
- Optimal Best Arm Identification under Differential Privacy
- M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception
- Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
- Quantifying Climate Policy Action and Its Links to Development Outcomes: A Cross-National Data-Driven Analysis
- Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network
- Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
- DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
- AWARE: Audio Watermarking with Adversarial Resistance to Edits
- Plasma Shape Control via Zero-shot Generative Reinforcement Learning
- OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction
- Non-asymptotic error bounds for probability flow ODEs under weak log-concavity
- Just-In-Time Piecewise-Linear Semantics for ReLU-type Networks
- Quantum Federated Learning: Architectural Elements and Future Directions
- Quantum Synthetic Data Generation for Industrial Bioprocess Monitoring
- GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver
- The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions
- Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations
- Efficient Tensor Completion Algorithms for Highly Oscillatory Operators
- VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models
- Glyph: Scaling Context Windows via Visual-Text Compression
- HUMAP: Hierarchical Uniform Manifold Approximation and Projection
- Identification and Adaptive Control of Markov Jump Systems: Sample Complexity and Regret Bounds
- Transfer Q-learning
- UniCrossFi: A Unified Framework For Cross-Domain Wi-Fi-based Gesture Recognition
- Neural Green's Operators for Parametric Partial Differential Equations
- Absolute abstraction: a renormalisation group approach
- Identifiable Latent Bandits: Leveraging observational data for personalized decision-making
- Navigating Uncertainties in Machine Learning for Structural Dynamics: A Comprehensive Survey of Probabilistic and Non-Probabilistic Approaches in Forward and Inverse Problems
- Solving Oscillator Ordinary Differential Equations in the Time Domain with High Performance via Soft-constrained Physics-informed Neural Network with Small Data
- Channel Matters: Estimating Channel Influence for Multivariate Time Series
- Riemannian Federated Learning via Averaging Gradient Streams
- Intrinsic Dimensionality of Fermi-Pasta-Ulam-Tsingou High-Dimensional Trajectories Through Manifold Learning: A Linear Approach
- OneProt: Towards Multi-Modal Protein Foundation Models
- SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI
- Understanding Generalization of Federated Learning: the Trade-off between Model Stability and Optimization
- A Survey and Benchmarking of Spatial-Temporal Traffic Data Imputation Models
- CEReBrO: Compact Encoder for Representations of Brain Oscillations Using Efficient Alternating Attention
- KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
- Boosting Graph Robustness Against Backdoor Attacks: An Over-Similarity Perspective
- Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models
- Fire-EnSF: Wildfire Spread Data Assimilation using Ensemble Score Filter
- Hydrogen production from blended waste biomass: pyrolysis, thermodynamic-kinetic analysis and AI-based modelling
- User Profiles of Sleep Disorder Sufferers: Towards Explainable Clustering and Differential Variable Analysis
- STAR: Boosting Time Series Foundation Models for Anomaly Detection through State-aware Adapter
- Decision-focused Sensing and Forecasting for Adaptive and Rapid Flood Response: An Implicit Learning Approach
- Transfer learning strategies for accelerating reinforcement-learning-based flow control
- Airfoil optimization using Design-by-Morphing with minimized design-space dimensionality
- Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
- Breaking Memorization Barriers in LLM Code Fine-Tuning via Information Bottleneck for Improved Generalization
- Unifying Polymer Modeling and Design via a Conformation-Centric Generative Foundation Model
- A tutorial on discovering and quantifying the effect of latent causal sources of multimodal EHR data
- Near-Equilibrium Propagation training in nonlinear wave systems
- FSRF: Factorization-guided Semantic Recovery for Incomplete Multimodal Sentiment Analysis
- Zero-shot World Models via Search in Memory
- A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
- Expert Merging in Sparse Mixture of Experts with Nash Bargaining
- Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
- Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction
- AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures
- Alignment is Localized: A Causal Probe into Preference Layers
- Human-Allied Relational Reinforcement Learning
- Explore-then-Commit for Nonstationary Linear Bandits with Latent Dynamics
- Benchmarking noisy label detection methods
- One-Bit Quantization for Random Features Models
- WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale
- QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models
- Toward General Digraph Contrastive Learning: A Dual Spatial Perspective
- Memorizing Long-tail Data Can Help Generalization Through Composition
- MGTS-Net: Exploring Graph-Enhanced Multimodal Fusion for Augmented Time Series Forecasting
- Sparse Transformer Architectures via Regularized Wasserstein Proximal Operator with $L_1$ Prior
- Colliding with Adversaries at ECML-PKDD 2025 Adversarial Attack Competition 1st Prize Solution
- Colliding with Adversaries at ECML-PKDD 2025 Model Robustness Competition 1st Prize Solution
- Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making
- SCALAR: Self-Calibrating Adaptive Latent Attention Representation Learning
- eDCF: Estimating Intrinsic Dimension using Local Connectivity
- Realizing LLMs' Causal Potential Requires Science-Grounded, Novel Benchmarks
- NeurIPT: Foundation Model for Neural Interfaces
- Copy-Augmented Representation for Structure Invariant Template-Free Retrosynthesis
- On the Impossibility of Retrain Equivalence in Machine Unlearning
- Simulation-free Structure Learning for Stochastic Dynamics
- Evaluating protein binding interfaces with PUMBA
- Active Target Discovery under Uninformative Prior: The Power of Permanent and Transient Memory
- High-Dimensional Privacy-Utility Dynamics of Noisy Stochastic Gradient Descent on Least Squares
- CLIP: Client-Side Invariant Pruning for Mitigating Stragglers in Secure Federated Learning
- Resolution-Aware Retrieval Augmented Zero-Shot Forecasting
- LSTM-Based Forecasting and Analysis of EV Charging Demand in a Dense Urban Campus
- Zero-Shot Performance Prediction for Probabilistic Scaling Laws
- An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications
- 3D-GSRD: 3D Molecular Graph Auto-Encoder with Selective Re-mask Decoding
- Computational Budget Should Be Considered in Data Selection
- Graph Learning is Suboptimal in Causal Bandits
- Trace Regularity PINNs: Enforcing $\mathrm{H}^{\frac{1}{2}}(\partial \Omega)$ for Boundary Data
- Finding Manifolds With Bilinear Autoencoders
- ProtoMol: Enhancing Molecular Property Prediction via Prototype-Guided Multimodal Learning
- UniGTE: Unified Graph-Text Encoding for Zero-Shot Generalization across Graph Tasks and Domains
- DeepChem Equivariant: SE(3)-Equivariant Support in an Open-Source Molecular Machine Learning Library
- SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search
- Closing the Curvature Gap: Full Transformer Hessians and Their Implications for Scaling Laws
- Differentially Private Linear Regression and Synthetic Data Generation with Statistical Guarantees
- Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision
- MuonBP: Faster Muon via Block-Periodic Orthogonalization
- Graph4MM: Weaving Multimodal Learning with Structural Information
- EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit
- Forgetting to Forget: Attention Sink as A Gateway for Backdooring LLM Unlearning
- Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation
- Diverse Influence Component Analysis: A Geometric Approach to Nonlinear Mixture Identifiability
- Consistent Zero-Shot Imitation with Contrastive Goal Inference
- Data Reliability Scoring
- On the Universal Near Optimality of Hedge in Combinatorial Settings
- Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
- Fighter: Unveiling the Graph Convolutional Nature of Transformers in Time Series Modeling
- Matricial Free Energy as a Gaussianizing Regularizer: Enhancing Autoencoders for Gaussian Code Generation
- Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
- In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models
- Learning After Model Deployment
- ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing
- Robustness in Text-Attributed Graph Learning: Insights, Trade-offs, and New Defenses
- A Standardized Benchmark for Machine-Learned Molecular Dynamics using Weighted Ensemble Sampling
- SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
- CooT: Learning to Coordinate In-Context with Coordination Transformers
- Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
- A Markovian Framing of WaveFunctionCollapse for Procedurally Generating Aesthetically Complex Environments
- From Next Token Prediction to (STRIPS) World Models -- Preliminary Results
- Graph Neural Networks for the Offline Nanosatellite Task Scheduling Problem
- Membership Privacy Risks of Sharpness Aware Minimization
- Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints
- LinkedIn Post Embeddings: Industrial Scale Embedding Generation and Usage across LinkedIn
- Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
- Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
- Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
- Exploration of Marker-Based Approaches in Argument Mining through Augmented Natural Language
- MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
- EasyRec: Simple yet Effective Language Models for Recommendation
- Familiarity-Aware Evidence Compression for Retrieval-Augmented Generation
- Packet Inspection Transformer: A Self-Supervised Journey to Unseen Malware Detection with Few Samples
- A Prospect-Theoretic Policy Gradient Framework for Behaviorally Nuanced Reinforcement Learning
- Beyond Uncertainty Quantification: Learning Uncertainty for Trust-Informed Neural Network Decisions - A Case Study in COVID-19 Classification
- Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning
- Parameter Efficient Fine-tuning via Explained Variance Adaptation
- HardNet: Hard-Constrained Neural Networks with Universal Approximation Guarantees
- Enhancing Osteoporosis Detection: An Explainable Multi-Modal Learning Framework with Feature Fusion and Variable Clustering
- An Empirical Study on LLM-based Agents for Automated Bug Fixing
- Diffusion Transformers as Open-World Spatiotemporal Foundation Models
- Improving training time and GPU utilization in geo-distributed language model training
- Free$^2$Guide: Training-Free Text-to-Video Alignment using Image LVLM
- StarWhisper Telescope: An AI framework for automating end-to-end astronomical observations
- Tracing Partisan Bias to Its Emotional Fingerprints: A Computational Approach to Mitigation
- Consistency of Responses and Continuations Generated by Large Language Models on Social Media
- GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation
- VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
- Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning
- Seeing in the Dark: A Teacher-Student Framework for Dark Video Action Recognition via Knowledge Distillation and Contrastive Learning
- Towards Principled Unsupervised Multi-Agent Reinforcement Learning
- Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models
- GRIFFIN: Effective Token Alignment for Faster Speculative Decoding
- Repo2Run: Automated Building Executable Environment for Code Repository at Scale
- Cross-Domain Graph Anomaly Detection via Test-Time Training with Homophily-Guided Self-Supervision
- FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis
- Large Language Models are Powerful Electronic Health Record Encoders
- Hallucination Detection in LLMs Using Spectral Features of Attention Maps
- $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
- Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
- Robust Optimization with Diffusion Models for Green Security
- GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images
- Late Fusion and Multi-Level Fission Amplify Cross-Modal Transfer in Text-Speech LMs
- The Shape of Attraction in UMAP: Exploring the Embedding Forces in Dimensionality Reduction
- DeepSeek-Inspired Exploration of RL-based LLMs and Synergy with Wireless Networks: A Survey
- Provably Efficient Reward Transfer in Reinforcement Learning with Discrete Markov Decision Processes
- Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
- When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
- Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction
- Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism
- LLMTaxo: Leveraging Large Language Models for Constructing Taxonomy of Factual Claims from Social Media
- CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation
- LLM-Enhanced Black-Litterman Portfolio Optimization
- Improving Coverage in Combined Prediction Sets with Weighted p-values
- Intrinsic Self-Correction in LLMs: Towards Explainable Prompting via Mechanistic Interpretability
- PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
- When majority rules, minority loses: bias amplification of gradient descent
- Incentivizing Truthful Language Models via Peer Elicitation Games
- Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
- Understanding Prompt Tuning and In-Context Learning via Meta-Learning
- CLIMB: Class-imbalanced Learning Benchmark on Tabular Data
- Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
- CrossRF: A Domain-Invariant Deep Learning Approach for RF Fingerprinting
- DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation
- DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
- Efficient Large Language Model Inference with Neural Block Linearization
- RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation
- VERINA: Benchmarking Verifiable Code Generation
- REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
- SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions
- KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision
- CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching
- VisuRiddles: Fine-grained Perception is a Primary Bottleneck for Multimodal Large Language Models in Abstract Visual Reasoning
- RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
- HauntAttack: When Attack Follows Reasoning as a Shadow
- Denoising the Future: Top-p Distributions for Moving Through Time
- Code Execution as Grounded Supervision for LLM Reasoning
- Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
- From Multimodal Perception to Strategic Reasoning: A Survey on AI-Generated Game Commentary
- GeNIE: A Generalizable Navigation System for In-the-Wild Environments
- Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning
- From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging
- AI-Generated Video Detection via Perceptual Straightening
- DP-Fusion: Token-Level Differentially Private Inference for Large Language Models
- Controlling What You Share: Assessing Language Model Adherence to Privacy Preferences
- Multimodal Fusion at Three Tiers: Physics-Driven Data Generation and Vision-Language Guidance for Brain Tumor Segmentation
- From Sequence to Structure: Uncovering Substructure Reasoning in Transformers
- Adaptive Policy Synchronization for Scalable Reinforcement Learning
- ReDi: Rectified Discrete Flow
- Why and How Auxiliary Tasks Improve JEPA Representations
- Pursuing Minimal Sufficiency in Spatial Reasoning
- On the Granularity of Causal Effect Identifiability
- Natural Language Processing Applications in Cardiology: A Narrative Review
- HumanCM: One Step Human Motion Prediction
- The Chameleon Nature of LLMs: Quantifying Multi-Turn Stance Instability in Search-Enabled Language Models
- Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
- Beacon: Single-Turn Diagnosis and Mitigation of Latent Sycophancy in Large Language Models
- SAMOSA: Sharpness Aware Minimization for Open Set Active learning
- Region in Context: Text-condition Image editing with Human-like semantic reasoning
- Learning to play: A Multimodal Agent for 3D Game-Play
- EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation
- Xiaoice: Training-Free Video Understanding via Self-Supervised Spatio-Temporal Clustering of Semantic Features
- LC-Eval: A Bilingual Multi-Task Evaluation Benchmark for Long-Context Understanding
- More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
- MOSAIC: Masked Objective with Selective Adaptation for In-domain Contrastive Learning
- Mixed-Precision Quantization for Language Models: Techniques and Prospects
- Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
- When Many-Shot Prompting Fails: An Empirical Study of LLM Code Translation
- Needles in the Landscape: Semi-Supervised Pseudolabeling for Archaeological Site Discovery under Label Scarcity
- Knowing the Facts but Choosing the Shortcut: Understanding How Large Language Models Compare Entities
- Efficient High-Accuracy PDEs Solver with the Linear Attention Neural Operator
- ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification
- Who's Asking? Simulating Role-Based Questions for Conversational AI Evaluation
- Schr\"odinger Bridge Mamba for One-Step Speech Enhancement
- FinSight: Towards Real-World Financial Deep Research
- Neuronal Group Communication for Efficient Neural representation
- Agentic Inequality
- ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
- DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization
- Fly-CL: A Fly-Inspired Framework for Enhancing Efficient Decorrelation and Reduced Training Time in Pre-trained Model-based Continual Representation Learning
- Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
- Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations
- Adaptive Online Learning with LSTM Networks for Energy Price Prediction
- SNOMED CT-powered Knowledge Graphs for Structured Clinical Data and Diagnostic Reasoning
- A Lightweight DL Model for Smart Grid Power Forecasting with Feature and Resolution Mismatch
- Domain Generalizable Continual Learning
- SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
- UNDREAM: Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks
- Tutoring LLM into a Better CUDA Optimizer
- A Primer on Kolmogorov-Arnold Networks (KANs) for Probabilistic Time Series Forecasting
- Peering Inside the Black Box: Uncovering LLM Errors in Optimization Modelling through Component-Level Evaluation
- Quantile Regression, Variational Autoencoders, and Diffusion Models for Uncertainty Quantification: A Spatial Analysis of Sub-seasonal Wind Speed Prediction
- Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures
- Foundation Models in Medical Image Analysis: A Systematic Review and Meta-Analysis
- One-step Diffusion Models with Bregman Density Ratio Matching
- Parameter-Efficient Fine-Tuning for Low-Resource Languages: A Comparative Study of LLMs for Bengali Hate Speech Detection
- CARE: Contrastive Alignment for ADL Recognition from Event-Triggered Sensor Streams
- ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI
- Justitia: Fair and Efficient Scheduling for LLM Applications
- Curiosity-driven RL for symbolic equation solving
- DINO-CVA: A Multimodal Goal-Conditioned Vision-to-Action Model for Autonomous Catheter Navigation
- Video Reasoning without Training
- The Ends Justify the Thoughts: RL-Induced Motivated Reasoning in LLMs
- Bitwidth-Specific Logarithmic Arithmetic for Future Hardware-Accelerated Training
- Investigating Thinking Behaviours of Reasoning-Based Language Models for Social Bias Mitigation
- Explainable Heterogeneous Anomaly Detection in Financial Networks via Adaptive Expert Routing
- Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models
- Verification-Aware Planning for Multi-Agent Systems
- Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
- DVAGen: Dynamic Vocabulary Augmented Generation
- GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection
- Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction
- GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image
- TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
- Benchmarking Out-of-Distribution Detection for Plankton Recognition: A Systematic Evaluation of Advanced Methods in Marine Ecological Monitoring
- SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving
- Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models
- ZSPAPrune: Zero-Shot Prompt-Aware Token Pruning for Vision-Language Models
- From Pixels to People: Satellite-Based Mapping and Quantification of Riverbank Erosion and Lost Villages in Bangladesh
- Round Outcome Prediction in VALORANT Using Tactical Features from Video Analysis
- Soft-Masked Diffusion Language Models
- D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks
- Diagnosis of Fuel Cell Health Status with Deep Sparse Auto-Encoder Neural Network
- When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
- Taming Modality Entanglement in Continual Audio-Visual Segmentation
- Visibility Allocation Systems: How Algorithmic Design Shapes Online Visibility and Societal Outcomes
- How News Feels: Understanding Affective Bias in Multilingual Headlines for Human-Centered Media Design
- Augmented Web Usage Mining and User Experience Optimization with CAWAL's Enriched Analytics Data
- FineVision: Open Data Is All You Need
- MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
- Comprehending Spatio-temporal Data via Cinematic Storytelling using Large Language Models
- Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling
- CharDiff: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration
- DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
- TopSeg: A Multi-Scale Topological Framework for Data-Efficient Heart Sound Segmentation
- Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation
- Localist LLMs with Recruitment Learning
- Bridging Embodiment Gaps: Deploying Vision-Language-Action Models on Soft Robots
- Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks
- TabR1: Taming GRPO for tabular reasoning LLMs
- Inference of Deterministic Finite Automata via Q-Learning
- EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs
- Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine
- AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages
- BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine
- Navigating the Alignment-Calibration Trade-off: A Pareto-Superior Frontier via Model Merging
- From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
- The Parameterized Complexity of Computing the VC-Dimension
- Layer Specialization Underlying Compositional Reasoning in Transformers
- DAMSDAN: Distribution-Aware Multi-Source Domain Adaptation Network for Cross-Domain EEG-based Emotion Recognition
- SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
- I-RAVEN-X: Benchmarking Generalization and Robustness of Analogical and Mathematical Reasoning in Large Language and Reasoning Models
- Context-Aware Pseudo-Label Scoring for Zero-Shot Video Summarization
- The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
- SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
- MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
- MambaX-Net: Dual-Input Mamba-Enhanced Cross-Attention Network for Longitudinal MRI Segmentation
- An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning
- Intent-Driven LLM Ensemble Planning for Flexible Multi-Robot Disassembly: Demonstration on EV Batteries
- CEPerFed: Communication-Efficient Personalized Federated Learning for Multi-Pulse MRI Classification
- HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection
- GUIDE: Enhancing Gradient Inversion Attacks in Federated Learning with Denoising Models
- CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
- RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation
- Frugal Federated Learning for Violence Detection: A Comparison of LoRA-Tuned VLMs and Personalized CNNs
- On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
- LILO: Bayesian Optimization with Interactive Natural Language Feedback
- PICABench: How Far Are We from Physically Realistic Image Editing?
- Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model
- Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
- CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks
- Improving Cross-Patient Generalization in Parkinson's Disease Detection through Chunk-Based Analysis of Hand-Drawn Patterns
- Closing the Sim2Real Performance Gap in RL
- PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition
- MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues
- Signature Forgery Detection: Improving Cross-Dataset Generalization
- AcademicEval: Live Long-Context LLM Benchmark
- A Multi-Threading Kernel for Enabling Neuromorphic Edge Applications
- Human-AI Interactions: Cognitive, Behavioral, and Emotional Impacts
- Prediction of Sea Ice Velocity and Concentration in the Arctic Ocean using Physics-informed Neural Network
- Towards Explainable Skin Cancer Classification: A Dual-Network Attention Model with Lesion Segmentation and Clinical Metadata Fusion
- Mapping Post-Training Forgetting in Language Models at Scale
- SoftMimic: Learning Compliant Whole-body Control from Examples
- Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains
- Executable Knowledge Graphs for Replicating AI Research
- Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
- Unbiased Gradient Low-Rank Projection
- A Survey on Self-play Methods in Reinforcement Learning
- Fully Autonomous AI Agents Should Not be Developed
- Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
- Automated Knowledge Component Generation for Interpretable Knowledge Tracing in Coding Problems
- Online Feedback Efficient Active Target Discovery in Partially Observable Environments
- RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics
- Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities
- Visual Instruction Bottleneck Tuning
- Smart Traffic Signals: Comparing MARL and Fixed-Time Strategies
- Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems in Math Competitions
- AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents
- macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
- CTR-LoRA: Curvature-Aware and Trust-Region Guided Low-Rank Adaptation for Large Language Models
- ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
- Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity
- One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
- Gains: Fine-grained Federated Domain Adaptation in Open Set
- Self-Attention to Operator Learning-based 3D-IC Thermal Simulation
- LinearizeLLM: An Agent-Based Framework for LLM-Driven Exact Linear Reformulation of Nonlinear Optimization Problems
- Predict Training Data Quality via Its Geometry in Metric Space
- A Graph-Attentive LSTM Model for Malicious URL Detection
- Quantum NLP models on Natural Language Inference
- Safeguarding Efficacy in Large Language Models: Evaluating Resistance to Human-Written and Algorithmic Adversarial Prompts
- Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective Optimization
- Bolster Hallucination Detection via Prompt-Guided Data Augmentation
- DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space
- Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
- AMiD: Knowledge Distillation for LLMs with $\alpha$-mixture Assistant Distribution
- MEET-Sepsis: Multi-Endogenous-View Enhanced Time-Series Representation Learning for Early Sepsis Prediction Representation Learning for Early Sepsis Prediction
- Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models
- Can GRPO Help LLMs Transcend Their Pretraining Origin?
- Stratos: An End-to-End Distillation Pipeline for Customized LLMs under Distributed Cloud Environments
- MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
- Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning
- AMStraMGRAM: Adaptive Multi-cutoff Strategy Modification for ANaGRAM
- Breaking Guardrails, Facing Walls: Insights on Adversarial AI for Defenders & Researchers
- Layer-Aware Influence for Online Data Valuation Estimation
- InfraGPT Smart Infrastructure: An End-to-End VLM-Based Framework for Detecting and Managing Urban Defects
- On-Chain Decentralized Learning and Cost-Effective Inference for DeFi Attack Mitigation
- Nondeterminism-Aware Optimistic Verification for Floating-Point Neural Networks
- Disaster Management in the Era of Agentic AI Systems: A Vision for Collective Human-Machine Intelligence for Augmented Resilience
- RoBCtrl: Attacking GNN-Based Social Bot Detectors via Reinforced Manipulation of Bots Control Interaction
- Membership Inference over Diffusion-models-based Synthetic Tabular Data
- Vector Quantization in the Brain: Grid-like Codes in World Models
- Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing
- Does Capital Dream of Artificial Labour?
- AMS-QUANT: Adaptive Mantissa Sharing for Floating-point Quantization
- Open Shouldn't Mean Exempt: Open-Source Exceptionalism and Generative AI
- In the Mood to Exclude: Revitalizing Trespass to Chattels in the Era of GenAI Scraping
- GUIrilla: A Scalable Framework for Automated Desktop UI Exploration
- FUSE-Traffic: Fusion of Unstructured and Structured Data for Event-aware Traffic Forecasting
- Algorithmic Fairness in AI Surrogates for End-of-Life Decision-Making
- Fusion-Augmented Large Language Models: Boosting Diagnostic Trustworthiness via Model Consensus
- Beyond Accuracy: Are Time Series Foundation Models Well-Calibrated?
- Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
- Learning a Generalized Model for Substation Level Voltage Estimation in Distribution Networks
- Residual Correction Models for AC Optimal Power Flow Using DC Optimal Power Flow Solutions
- FedPURIN: Programmed Update and Reduced INformation for Sparse Personalized Federated Learning
- Cash Flow Underwriting with Bank Transaction Data: Advancing MSME Financial Inclusion in Malaysia
- Co-Designing Interdisciplinary Design Projects with AI
- Human or AI? Comparing Design Thinking Assessments by Teaching Assistants and Bots
- Effect of Reporting Mode and Clinical Experience on Radiologists' Gaze and Image Analysis Behavior in Chest Radiography
- MNO: Multiscale Neural Operator for Computational Fluid Dynamics with 3D Point Cloud Data
- Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation
- Early-stopping for Transformer model training
- Optimization of the quantization of dense neural networks from an exact QUBO formulation
- BPL: Bias-adaptive Preference Distillation Learning for Recommender System
- Continual Knowledge Consolidation LORA for Domain Incremental Learning
- ISO/IEC-Compliant Match-on-Card Face Verification with Short Binary Templates
- EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
- TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration
- SARHAchat: An LLM-Based Chatbot for Sexual and Reproductive Health Counseling
- Interpretable RNA-Seq Clustering with an LLM-Based Agentic Evidence-Grounded Framework
- PassREfinder-FL: Privacy-Preserving Credential Stuffing Risk Prediction via Graph-Based Federated Learning for Representing Password Reuse between Websites
- MoPHES:Leveraging on-device LLMs as Agent for Mobile Psychological Health Evaluation and Support
- STABLE: Gated Continual Learning for Large Language Models
- Evaluating Prompting Strategies and Large Language Models in Systematic Literature Review Screening: Relevance and Task-Stage Classification
- Compressing Many-Shots in In-Context Learning
- Narrowing Action Choices with AI Improves Human Sequential Decisions
- Aria Gen 2 Pilot Dataset
- GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer
- Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance
- Publication Trend Analysis and Synthesis via Large Language Model: A Case Study of Engineering in PNAS
- AsyncVoice Agent: Real-Time Explanation for LLM Planning and Reasoning
- Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
- The Formalism-Implementation Gap in Reinforcement Learning Research
- Expressive Reward Synthesis with the Runtime Monitoring Language
- Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
- Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
- Revealing Low-Dimensional Structure in 2D Richtmyer-Meshkov Instabilities via Parametric Reduced-Order Modeling
- SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection
- What Can String Probability Tell Us About Grammaticality?
- Machine Learning for Climate Policy: Understanding Policy Progression in the European Green Deal
- Protein Folding with Neural Ordinary Differential Equations
- Detecting Adversarial Fine-tuning with Auditing Agents
- NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?
- MuseTok: Symbolic Music Tokenization for Generation and Semantic Understanding
- Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
- Disentangling Hyperedges through the Lens of Category Theory
- Synergizing chemical and AI communities for advancing laboratories of the future
- OpenLVLM-MIA: A Controlled Benchmark Revealing the Limits of Membership Inference Attacks on Large Vision-Language Models
- Scaffold-Aware Generative Augmentation and Reranking for Enhanced Virtual Screening
- Lung Cancer Classification from CT Images Using ResNet
- Time-Embedded Algorithm Unrolling for Computational MRI
- Thinking About Thinking: Evaluating Reasoning in Post-Trained Language Models
- Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models
- End-to-End Argument Mining through Autoregressive Argumentative Structure Prediction
- Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis
- Navigating through the hidden embedding space: steering LLMs to improve mental health assessment
- Conformal Prediction in The Loop: A Feedback-Based Uncertainty Model for Trajectory Optimization
- MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
- ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents
- Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment
- SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation
- Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
- SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
- EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning
- Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts
- Declarative Techniques for NL Queries over Heterogeneous Data
- Automated Composition of Agents: A Knapsack Approach for Agentic Component Selection
- Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection
- Image Categorization and Search via a GAT Autoencoder and Representative Models
- DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
- Few-Label Multimodal Modeling of SNP Variants and ECG Phenotypes Using Large Language Models for Cardiovascular Risk Stratification
- Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
- Watch Where You Move: Region-aware Dynamic Aggregation and Excitation for Gait Recognition
- Predicting life satisfaction using machine learning and explainable AI
- LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
- Toward Understanding Security Issues in the Model Context Protocol Ecosystem
- Language over Content: Tracing Cultural Understanding in Multilingual Large Language Models
- AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
- Atom-anchored LLMs speak Chemistry: A Retrosynthesis Demonstration
- Symmetry and Generalisation in Neural Approximations of Renormalisation Transformations
- SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
- Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules
- Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
- A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications
- Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis
- Structured Interfaces for Automated Reasoning with 3D Scene Graphs
- Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration
- Safire: Similarity Framework for Visualization Retrieval
- All You Need is One: Capsule Prompt Tuning with a Single Vector
- Renaissance of RNNs in Streaming Clinical Time Series: Compact Recurrence Remains Competitive with Transformers
- VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search
- Executable Epistemology: The Structured Cognitive Loop as an Architecture of Intentional Understanding
- Exploring the Potential of Citiverses for Regulatory Learning
- PISA: A Pragmatic Psych-Inspired Unified Memory System for Enhanced AI Agency
- Limits of Emergent Reasoning of Large Language Models in Agentic Frameworks for Deterministic Games
- Cognitive Load Traces as Symbolic and Visual Accounts of Deep Model Cognition
- ProofFlow: A Dependency Graph Approach to Faithful Proof Autoformalization
- Ontologies in Motion: A BFO-Based Approach to Knowledge Graph Construction for Motor Performance Research Data in Sports Science
- A Non-overlap-based Conflict Measure for Random Permutation Sets
- PAINT: Parallel-in-time Neural Twins for Dynamical System Reconstruction
- Global-focal Adaptation with Information Separation for Noise-robust Transfer Fault Diagnosis
- Algorithms for dynamic scheduling in manufacturing, towards digital factories Improving Deadline Feasibility and Responsiveness via Temporal Networks
- Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
- Operationalising Extended Cognition: Formal Metrics for Corporate Knowledge and Legal Accountability
- Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration
- The Right to Be Remembered: Preserving Maximally Truthful Digital Memory in the Age of AI
- ScholarEval: Research Idea Evaluation Grounded in Literature
- Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
- What Limits Agentic Systems Efficiency?
- DTKG: Dual-Track Knowledge Graph-Verified Reasoning Framework for Multi-Hop QA
- MedRule-KG: A Knowledge-Graph--Steered Scaffold for Mathematical Reasoning with a Lightweight Verifier
- Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts
- The Burden of Interactive Alignment with Inconsistent Preferences
- Before you , monitor: Implementing Flavell's metacognitive framework in LLMs
- Humanoid-inspired Causal Representation Learning for Domain Generalization
- RGMem: Renormalization Group-based Memory Evolution for Language Agent User Profile
- ReviewSense: Transforming Customer Review Dynamics into Actionable Business Insights
- NP-Engine: Empowering Optimization Reasoning in Large Language Models with Verifiable Synthetic NP Problems
- Hey Pentti, We Did It Again!: Differentiable vector-symbolic types that prove polynomial termination
- Urban-R1: Reinforced MLLMs Mitigate Geospatial Biases for Urban General Intelligence
- BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction
- Ripple Effect Protocol: Coordinating Agent Populations
- Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?
- Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning
- Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
- Foundation and Large-Scale AI Models in Neuroscience: A Comprehensive Review
- An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems
- Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI
- A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
- Surrogate Modeling and Explainable Artificial Intelligence for Complex Systems: A Workflow for Automated Simulation Exploration
- ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion
- End-to-end Listen, Look, Speak and Act
- See or Say Graphs: Agent-Driven Scalable Graph Understanding with Vision-Language Models
- Domain-Contextualized Concept Graphs: A Computable Framework for Knowledge Representation
- DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
- VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
- A Comparative User Evaluation of XRL Explanations using Goal Identification
- STARK: Strategic Team of Agents for Refining Kernels
- ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems
- A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation
- Structured Debate Improves Corporate Credit Reasoning in Financial AI
- Enhanced Fish Freshness Classification with Incremental Handcrafted Feature Fusion
- Physics-Informed Large Language Models for HVAC Anomaly Detection with Autonomous Rule Generation
- Which LLM Multi-Agent Protocol to Choose?
- Combining ECG Foundation Model and XGBoost to Predict In-Hospital Malignant Ventricular Arrhythmias in AMI Patients
- Offline Policy Evaluation of Multi-Turn LLM Health Coaching with Real Users
- Temporally Detailed Hypergraph Neural ODEs for Type 2 Diabetes Progression Modeling
- Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis
- RubiSCoT: A Framework for AI-Supported Academic Assessment
- Graph Attention-Guided Search for Dense Multi-Agent Pathfinding
- Diverse Planning with Simulators via Linear Temporal Logic
- Active Inference for an Intelligent Agent in Autonomous Reconnaissance Missions
- Label Indeterminacy in AI & Law
- MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning
- Reasoning Distillation and Structural Alignment for Improved Code Generation
- OG-Rank: Learning to Rank Fast and Slow with Uncertainty and Reward-Trend Guided Adaptive Exploration
- LLM-as-a-Prophet: Understanding Predictive Intelligence with Prophet Arena
- A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
- Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models
- Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs
- A Semantic Generalization of Shannon's Information Theory and Applications
- Multimodal Chip Physical Design Engineer Assistant
- FlexLink: Boosting your NVLink Bandwidth by 27% without accuracy concern
- FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance
- Mitigating Harmful Erraticism in LLMs Through Dialectical Behavior Therapy Based De-Escalation Strategies
- A Real-Time BCI for Stroke Hand Rehabilitation Using Latent EEG Features from Healthy Subjects
- Detecting and Preventing Harmful Behaviors in AI Companions: Development and Evaluation of the SHIELD Supervisory System
- Accelerating Frontier MoE Training with 3D Integrated Optics
- BREATH: A Bio-Radar Embodied Agent for Tonal and Human-Aware Diffusion Music Generation
- From Coordination to Personalization: A Trust-Aware Simulation Framework for Emergency Department Decision Support
- "She's Like a Person but Better": Characterizing Companion-Assistant Dynamics in Human-AI Relationships
- FVDebug: An LLM-Driven Debugging Assistant for Automated Root Cause Analysis of Formal Verification Failures
- Sleeping Kelly is a Thirder
- VeriGRAG: Enhancing LLM-Based Verilog Code Generation with Structure-Aware Soft Prompts
- Intent-Driven Storage Systems: From Low-Level Tuning to High-Level Understanding
- Comparing LLMs for Sentiment Analysis in Financial Market News
- Impl\'ementation Efficiente de Fonctions de Convolution sur FPGA \`a l'Aide de Blocs Param\'etrables et d'Approximations Polynomiales
- Lean Finder: Semantic Search for Mathlib That Understands User Intents
- Lyapunov-Stable Adaptive Control for Multimodal Concept Drift
- BEACON: Bayesian Optimal Stopping for Efficient LLM Sampling
- Learning from Mistakes: Enhancing Harmful Meme Detection via Misjudgment Risk Patterns
- WaveNet's Precision in EEG Classification
- ATLAS: Adaptive Trading with LLM AgentS Through Dynamic Prompt Optimization and Multi-Agent Coordination
- Cross-dataset Multivariate Time-series Model for Parkinson's Diagnosis via Keyboard Dynamics
- How Good Are LLMs at Processing Tool Outputs?
- Interpretable Graph-Language Modeling for Detecting Youth Illicit Drug Use
Research Sources: 1024 | Generated: 10/21/2025
