AI Research News Feeds for October 21st, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats
Dress Well via Fashion Cognitive Learning
Privacy-Preserving Visual Localization with Event Cameras
Limitations of Data-Driven Spectral Reconstruction -- An Optics-Aware Analysis
FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching
Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion
Enhancing Test Time Adaptation with Few-shot Guidance
Large Language Model-Guided Semantic Alignment for Human Activity Recognition
Improvement of Spiking Neural Network with Bit Planes and Color Models
VisualLens: Personalization through Task-Agnostic Visual History
SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation
DynVFX: Augmenting Real Videos with Dynamic Content
Dual Caption Preference Optimization for Diffusion Models
Indoor Heat Estimation from a Single Visible-Light Panorama
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Hierarchical Feature Learning for Medical Point Clouds via State Space Model
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
SSL4Eco: A Global Seasonal Dataset for Geospatial Foundation Models in Ecology
Is Artificial Intelligence Generated Image Detection a Solved Problem?
UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens
GMatch: A Lightweight, Geometry-Constrained Keypoint Matcher for Zero-Shot 6DoF Pose Estimation in Robotic Grasp Tasks
Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles
Hierarchical Material Recognition from Local Appearance
Grounded Reinforcement Learning for Visual Reasoning
CReFT-CAD: Boosting Orthographic Projection Reasoning for CAD via Reinforcement Fine-Tuning
Reasoning-Aligned Perception Decoupling for Scalable Multi-modal Reasoning
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
GeoCAD: Local Geometry-Controllable CAD Generation with Large Language Models
G$^{2}$D: Boosting Multimodal Learning with Gradient-Guided Distillation
HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion
Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection
Attention (as Discrete-Time Markov) Chains
Adaptive Convolutional Neural Network for Image Super-resolution
Principled Feature Disentanglement for High-Fidelity Unified Brain MRI Synthesis
A Synthetic Data-Driven Radiology Foundation Model for Pan-tumor Clinical Diagnosis
Geodesic Diffusion Models for Efficient Medical Image Enhancement
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution
Real-Time World Crafting: Generating Structured Game Behaviors from Natural Language with Large Language Models
$\mathcal{V}isi\mathcal{P}runer$: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs
DELULU: Discriminative Embedding Learning Using Latent Units for Speaker-Aware Self-Supervised Speech Foundational Model
UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action
Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference
LEME: Open Large Language Models for Ophthalmology with Advanced Reasoning and Clinical Validation
A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
Automated Evaluation of Meter and Rhyme in Russian Generative and Human-Authored Poetry
Leveraging Robust Optimization for LLM Alignment under Distribution Shifts
Thinking Out Loud: Do Reasoning Models Know When They're Right?
Understanding LLMs' Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From
HCR-Reasoner: Synergizing Large Language Models and Theory for Human-like Causal Reasoning
MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification
Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning
Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs
A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
A Controllable Examination for Long-Context Language Models
KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models
Compressed and Smooth Latent Space for Text Diffusion Modeling
Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness
A social context-aware graph-based multimodal attentive learning framework for disaster content classification during emergencies: a benchmark dataset and method
Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition?
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
MotionGPT3: Human Motion as a Second Modality
CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
StripRFNet: A Strip Receptive Field and Shape-Aware Network for Road Damage Detection
ObjectTransforms for Uncertainty Quantification and Reduction in Vision-Based Perception for Autonomous Vehicles
C-arm Guidance: A Self-supervised Approach To Automated Positioning During Stroke Thrombectomy
DuetMatch: Harmonizing Semi-Supervised Brain MRI Segmentation via Decoupled Branch Optimization
Automated C-Arm Positioning via Conformal Landmark Localization
Cost Savings from Automatic Quality Assessment of Generated Images
Data-Centric AI for Tropical Agricultural Mapping: Challenges, Strategies and Scalable Solutions
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
VM-BeautyNet: A Synergistic Ensemble of Vision Transformer and Mamba for Facial Beauty Prediction
Designing a Convolutional Neural Network for High-Accuracy Oral Cavity Squamous Cell Carcinoma (OCSCC) Detection
Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset
Proactive Scene Decomposition and Reconstruction
Stroke2Sketch: Harnessing Stroke Attributes for Training-Free Sketch Generation
Scaling Laws for Deepfake Detection
Scale-DiT: Ultra-High-Resolution Image Generation with Hierarchical Local Attention
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
On the Provable Importance of Gradients for Language-Assisted Image Clustering
MIRAD - A comprehensive real-world robust anomaly detection dataset for Mass Individualization
Demeter: A Parametric Model of Crop Plant Morphology from the Real World
REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting
LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching
RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Instance-Aware Pseudo-Labeling and Class-Focused Contrastive Learning for Weakly Supervised Domain Adaptive Segmentation of Electron Microscopy
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars
PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
OOS-DSD: Improving Out-of-stock Detection in Retail Images using Auxiliary Tasks
Fit for Purpose? Deepfake Detection in the Real World
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs
MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models
HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications
SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models
UKANFormer: Noise-Robust Semantic Segmentation for Coral Reef Mapping via a Kolmogorov-Arnold Network-Transformer Hybrid
A Comprehensive Survey on World Models for Embodied AI
Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling
WaMaIR: Image Restoration via Multiscale Wavelet Convolutions and Mamba-based Channel Modeling with Texture Enhancement
GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation
Segmentation as A Plug-and-Play Capability for Frozen Multimodal LLMs
Unsupervised Monocular Road Segmentation for Autonomous Driving via Scene Geometry
Personalized Image Filter: Mastering Your Photographic Style
An RGB-D Image Dataset for Lychee Detection and Maturity Classification for Robotic Harvesting
Robust Cross-Domain Adaptation in Texture Features Transferring for Wood Chip Moisture Content Prediction
From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
2DGS-R: Revisiting the Normal Consistency Regularization in 2D Gaussian Splatting
BARL: Bilateral Alignment in Representation and Label Spaces for Semi-Supervised Volumetric Medical Image Segmentation
Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection
Uncovering Brain-Like Hierarchical Patterns in Vision-Language Models through fMRI-Based Neural Encoding
Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer Diagnosis
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Contrail-to-Flight Attribution Using Ground Visible Cameras and Flight Surveillance Data
Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation
Training-free Online Video Step Grounding
An empirical study of the effect of video encoders on Temporal Video Grounding
Do Satellite Tasks Need Special Pretraining?
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Where, Not What: Compelling Video LLMs to Learn Geometric Causality for 3D-Grounding
Conditional Synthetic Live and Spoof Fingerprint Generation
Click, Predict, Trust: Clinician-in-the-Loop AI Segmentation for Lung Cancer CT-Based Prognosis within the Knowledge-to-Action Framework
Person Re-Identification via Generalized Class Prototypes
How Universal Are SAM2 Features?
ProDAT: Progressive Density-Aware Tail-Drop for Point Cloud Coding
Towards a Generalizable Fusion Architecture for Multimodal Object Detection
GSPlane: Concise and Accurate Planar Reconstruction via Structured Representation
Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement
Towards Imperceptible Watermarking Via Environment Illumination for Consumer Cameras
KineDiff3D: Kinematic-Aware Diffusion for Category-Level Articulated Object Shape Reconstruction and Generation
Investigating Adversarial Robustness against Preprocessing used in Blackbox Face Recognition
Generation then Reconstruction: Accelerating Masked Autoregressive Models via Two-Stage Sampling
Capturing Head Avatar with Hand Contacts from a Monocular Video
HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery
EndoCIL: A Class-Incremental Learning Framework for Endoscopic Image Classification
Optimizing DINOv2 with Registers for Face Anti-Spoofing
Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models
SG-CLDFF: A Novel Framework for Automated White Blood Cell Classification and Segmentation
Machine Vision-Based Surgical Lighting System:Design and Implementation
Exploring Structural Degradation in Dense Representations for Self-supervised Learning
LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding
CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference
A Single Set of Adversarial Clothes Breaks Multiple Defense Methods in the Physical World
iDETEX: Empowering MLLMs for Intelligent DETailed EXplainable IQA
Nearest-Class Mean and Logits Agreement for Wildlife Open-Set Recognition
Exploring The Missing Semantics In Event Modality
Beyond Real Faces: Synthetic Datasets Can Achieve Reliable Recognition Performance without Privacy Compromise
Facial Expression-based Parkinson's Disease Severity Diagnosis via Feature Fusion and Adaptive Class Balancing
Closed-Loop Transfer for Weakly-supervised Affordance Grounding
Monitoring Horses in Stalls: From Object to Event Detection
DeepDetect: Learning All-in-One Dense Keypoints
Leveraging AV1 motion vectors for Fast and Dense Feature Matching
Rethinking Nighttime Image Deraining via Learnable Color Space Transformation
Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS
Split-Fuse-Transport: Annotation-Free Saliency via Dual Clustering and Optimal Transport Alignment
WP-CrackNet: A Collaborative Adversarial Learning Framework for End-to-End Weakly-Supervised Road Crack Detection
PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception
Expose Camouflage in the Water: Underwater Camouflaged Instance Segmentation and Dataset
ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling
Integrating BIM and UAV-based photogrammetry for Automated 3D Structure Model Segmentation
One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection
Self-supervised Pre-training for Mapping of Archaeological Stone Wall in Historic Landscapes Using High-Resolution DEM Derivatives
4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads
Towards 3D Objectness Learning in an Open World
Elastic ViTs from Pretrained Models without Retraining
Automatic Classification of Circulating Blood Cell Clusters based on Multi-channel Flow Cytometry Imaging
Raindrop GS: A Benchmark for 3D Gaussian Splatting under Raindrop Conditions
Can Image-To-Video Models Simulate Pedestrian Dynamics?
Joint Multi-Condition Representation Modelling via Matrix Factorisation for Visual Place Recognition
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
ConsistEdit: Highly Consistent and Precise Training-free Visual Editing
Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries
Filtering of Small Components for Isosurface Generation
Unlocking Off-the-Grid Sparse Recovery with Unlimited Sensing: Simultaneous Super-Resolution in Time and Amplitude
Shape-aware Inertial Poser: Motion Tracking for Humans with Diverse Shapes Using Sparse Inertial Sensors
DiffVLA++: Bridging Cognitive Reasoning and End-to-End Driving through Metric-Guided Alignment
Detecting streaks in smart telescopes images with Deep Learning
Conveying Meaning through Gestures: An Investigation into Semantic Co-Speech Gesture Generation
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
Rao-Blackwell Gradient Estimators for Equivariant Denoising Diffusion
Bayesian Computation in Deep Learning
Weak-to-Strong Generalization Even in Random Feature Networks, Provably
LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models
From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs
Physics-Informed Deep B-Spline Networks
LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation
Score-based deterministic density sampling
Challenges and proposed solutions in modeling multimodal data: A systematic review
A Generic Framework for Conformal Fairness
UFT: Unifying Supervised and Reinforcement Fine-Tuning
PICT -- A Differentiable, GPU-Accelerated Multi-Block PISO Solver for Simulation-Coupled Learning Tasks in Fluid Dynamics
HERO: Heterogeneous Continual Graph Learning via Meta-Knowledge Distillation
Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
Navigating the Latent Space Dynamics of Neural Models
Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Progressive Tempering Sampler with Diffusion
BLUR: A Bi-Level Optimization Approach for LLM Unlearning
FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM Quantization
GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining
Improving Rectified Flow with Boundary Conditions
Online Learning of Whittle Indices for Restless Bandits with Non-Stationary Transition Kernels
ESSA: Evolutionary Strategies for Scalable Alignment
Greedy Low-Rank Gradient Compression for Distributed Learning with Convergence Guarantees
MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature
Robust Anomaly Detection through Multi-Modal Autoencoder Fusion for Small Vehicle Damage Detection
Federated Conditional Conformal Prediction via Generative Models
Going with the Flow: Approximating Banzhaf Values via Graph Neural Networks
SWIR-LightFusion: Multi-spectral Semantic Fusion of Synthetic SWIR with Thermal IR (LWIR/MWIR) and RGB
Deep learning based numerical approximation algorithms for stochastic partial differential equations
The Moral Foundations Reddit Corpus
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
Predicting Patient Recovery or Mortality Using Deep Neural Decision Tree and Forest
Conformal online model aggregation
Neural Dynamic Data Valuation: A Stochastic Optimal Control Approach
Approximately-symmetric neural networks for quantum spin liquids
GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach
GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model
Accelerating MRI with Longitudinally-informed Latent Posterior Sampling
Adv-SSL: Adversarial Self-Supervised Representation Learning with Theoretical Guarantees
Invertible ResNets for Inverse Imaging Problems: Competitive Performance with Provable Regularization Properties
Flow Matching for Accelerated Simulation of Atomic Transport in Crystalline Materials
Learning Counterfactual Distributions via Kernel Nearest Neighbors
Emergent field theories from neural networks
Delta-Influence: Unlearning Poisons via Influence Functions
What should a neuron aim for? Designing local objective functions based on information theory
Auto-Prompt Generation is Not Robust: Prompt Optimization Driven by Pseudo Gradient
Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization
Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations
Time-Varying Bayesian Optimization Without a Metronome
Large Language Diffusion Models
Nonlinear energy-preserving model reduction with lifting transformations that quadratize the energy
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey
Subgradient Method for System Identification with Non-Smooth Objectives
Fine-Grained Classification: Connecting Metadata via Cross-Contrastive Pre-Training
Traceback of Poisoning Attacks to Retrieval-Augmented Generation
Statistical Decision Theory with Counterfactual Loss
Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Inverse Problems
Path Gradients after Flow Matching
Asymptotic Performance of Time-Varying Bayesian Optimization
Hyperspectral Anomaly Detection Fused Unified Nonconvex Tensor Ring Factors Regularization
A deep solver for backward stochastic Volterra integral equations
A Pure Hypothesis Test for Inhomogeneous Random Graph Models Based on a Kernelised Stein Discrepancy
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
Rao-Blackwellised Reparameterisation Gradients
A Principled Path to Fitted Distributional Evaluation
Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market
Critically-Damped Higher-Order Langevin Dynamics for Generative Modeling
A fast algorithm for solving the lasso problem exactly without homotopy using differential inclusions
Observation-guided Interpolation Using Graph Neural Networks for High-Resolution Nowcasting in Switzerland
Who Taught the Lie? Responsibility Attribution for Poisoned Knowledge in Retrieval-Augmented Generation
Spacing Test for Fused Lasso
In Generative AI We (Dis)Trust? Computational Analysis of Trust and Distrust in Reddit Discussions
EgMM-Corpus: A Multimodal Vision-Language Dataset for Egyptian Culture
Towards Low-Resource Alignment to Diverse Perspectives with Sparse Feedback
Instant Personalized Large Language Model Adaptation via Hypernetwork
Utilising Large Language Models for Generating Effective Counter Arguments to Anti-Vaccine Tweets
FrugalPrompt: Reducing Contextual Overhead in Large Language Models via Token Attribution
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model
RAVEN: Robust Advertisement Video Violation Temporal Grounding via Reinforcement Reasoning
Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations
Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety
ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation
Hallucination Benchmark for Speech Foundation Models
Fine-tuning of Large Language Models for Constituency Parsing Using a Sequence to Sequence Approach
Temporal Understanding under Deictic Frame of Reference
Investigating the Impact of Rationales for LLMs on Natural Language Understanding
so much depends / upon / a whitespace: Why Whitespace Matters for Poets and LLMs
Enhancing Language Agent Strategic Reasoning through Self-Play in Adversarial Games
Cross-Genre Authorship Attribution via LLM-Based Retrieve-and-Rerank
Does Visual Grounding Enhance the Understanding of Embodied Knowledge in Large Language Models?
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
Prompt-MII: Meta-Learning Instruction Induction for LLMs
Back to Bytes: Revisiting Tokenization Through UTF-8
Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic
Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
DiscoTrack: A Multilingual LLM Benchmark for Discourse Tracking
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
Rethinking On-policy Optimization for Query Augmentation
When AI companions become witty: Can human brain recognize AI-generated irony?
Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting
StreamingThinker: Large Language Models Can Think While Reading
From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models
Explainability of Large Language Models: Opportunities and Challenges toward Generating Trustworthy Explanations
TaxoAlign: Scholarly Taxonomy Generation Using Language Models
Addressing Antisocial Behavior in Multi-Party Dialogs Through Multimodal Representation Learning
The Atomic Instruction Gap: Instruction-Tuned LLMs Struggle with Simple, Self-Contained Directives
Agentic Reinforcement Learning for Search is Unsafe
Multilingual Clinical NER for Diseases and Medications Recognition in Cardiology Texts using BERT Embeddings
Evaluating Large Language Models on Urdu Idiom Translation
Disparities in Multilingual LLM-Based Healthcare Q&A
ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Deep Self-Evolving Reasoning
Lingua Custodi's participation at the WMT 2025 Terminology shared task
Annotation-Efficient Universal Honesty Alignment
When Annotators Disagree, Topology Explains: Mapper, a Topological Tool for Exploring Text Embedding Geometry and Ambiguity
Language Confusion Gate: Language-Aware Decoding Through Model Self-Distillation
LawChain: Modeling Legal Reasoning Chains for Chinese Tort Case Analysis
Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models
Qomhra: A Bilingual Irish-English Large Language Model
Towards Mining Effective Pedagogical Strategies from Learner-LLM Educational Dialogues
QueST: Incentivizing LLMs to Generate Difficult Problems
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
HealthDial: A No-Code LLM-Assisted Dialogue Authoring Tool for Healthcare Virtual Agents
PrivacyPAD: A Reinforcement Learning Framework for Dynamic Privacy-Aware Delegation
SIADAFIX: issue description response for adaptive program repair
Cerberus: Real-Time Video Anomaly Detection via Cascaded Vision-Language Models
Investigating the Association Between Text-Based Indications of Foodborne Illness from Yelp Reviews and New York City Health Inspection Outcomes (2023)
What Questions Should Robots Be Able to Answer? A Dataset of User Questions for Explainable Robotics
Verifiable Fine-Tuning for LLMs: Zero-Knowledge Training Proofs Bound to Data Provenance and Policy
Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
A Prototypical Network with an Attention-based Encoder for Drivers Identification Application
Adaptive Discretization for Consistency Models
Uncertainty-aware data assimilation through variational inference
Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems
Symmetries in PAC-Bayesian Learning
Disentanglement Beyond Static vs. Dynamic: A Benchmark and Evaluation Framework for Multi-Factor Sequential Representations
Model Metamers Reveal Invariances in Graph Neural Networks
Beyond Binary Out-of-Distribution Detection: Characterizing Distributional Shifts with Multi-Statistic Diffusion Trajectories
Latent Spaces Beyond Synthesis: From GANs to Diffusion Models
Exploration via Feature Perturbation in Contextual Bandits
Finite-Time Bounds for Average-Reward Fitted Q-Iteration
MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
RINS-T: Robust Implicit Neural Solvers for Time Series Linear Inverse Problems
S4ECG: Exploring the impact of long-range interactions for arrhythmia prediction
A Conditional Diffusion Model for Probabilistic Prediction of Battery Capacity Degradation
Diffusion Models as Dataset Distillation Priors
Deeper with Riemannian Geometry: Overcoming Oversmoothing and Oversquashing for Graph Foundation Models
Explainable AI for microseismic event detection
CrossStateECG: Multi-Scale Deep Convolutional Network with Attention for Rest-Exercise ECG Biometrics
Towards geological inference with process-based and deep generative modeling, part 2: inversion of fluvial deposits and latent-space disentanglement
Unified Privacy Guarantees for Decentralized Learning via Matrix Factorization
Local properties of neural networks through the lens of layer-wise Hessians
Stochastic Difference-of-Convex Optimization with Momentum
Convergence Rates for Gradient Descent on the Edge of Stability in Overparametrised Least Squares
SAFE-D: A Spatiotemporal Detection Framework for Abnormal Driving Among Parkinson's Disease-like Drivers
Curiosity Meets Cooperation: A Game-Theoretic Approach to Long-Tail Multi-Label Learning
Mitigating Clever Hans Strategies in Image Classifiers through Generating Counterexamples
How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?
Reliable Inference in Edge-Cloud Model Cascades via Conformal Alignment
TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model
The Free Transformer
Formally Exploring Time-Series Anomaly Detection Evaluation Metrics
Semi-supervised Latent Bayesian Optimization for Designing Antimicrobial Peptides
ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification
Handling Extreme Class Imbalance: Using GANs in Data Augmentation for Suicide Prediction
Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning
Enabling Fine-Grained Operating Points for Black-Box LLMs
Atlas-based Manifold Representations for Interpretable Riemannian Machine Learning
Inference-Time Compute Scaling For Flow Matching
Functional Distribution Networks (FDN)
Time Series Analysis in Frequency Domain: A Survey of Open Challenges, Opportunities and Benchmarks
Geometric Dynamics of Consumer Credit Cycles: A Multivector-based Linear-Attention Framework for Explanatory Economic Analysis
LLM-VeriPPA: Power, Performance, and Area Optimization aware Verilog Code Generation with Large Language Models
Bitcoin Price Forecasting Based on Hybrid Variational Mode Decomposition and Long Short Term Memory Network
Quantum and Classical Machine Learning in Decentralized Finance: Comparative Evidence from Multi-Asset Backtesting of Automated Market Makers
TeLLMe v2: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs
Dynamic Factor Analysis of Price Movements in the Philippine Stock Exchange
Attention to Non-Adopters
Aligning Language Models with Investor and Market Behavior for Financial Recommendations
The Invisible Handshake: Tacit Collusion between Adaptive Market Agents
Convolutional Attention in Betting Exchange Markets
Data for Inclusion: The Redistributive Power of Data Economics
AGNES: Adaptive Graph Neural Network and Dynamic Programming Hybrid Framework for Real-Time Nanopore Seed Chaining
A Storm-Centric 250 m NEXRAD Level-II Dataset for High-Resolution ML Nowcasting
A Novel GPT-Based Framework for Anomaly Detection in System Logs
Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch
Identifying multi-omics interactions for lung cancer drug targets discovery using Kernel Machine Regression
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
The Hidden Cost of Modeling P(X): Vulnerability to Membership Inference Attacks in Generative Text Classifiers
Learning density ratios in causal inference using Bregman-Riesz regression
The Cultural Mapping and Pattern Analysis (CMAP) Visualization Toolkit: Open Source Text Analysis for Qualitative and Computational Social Science
Extending Prediction-Powered Inference through Conformal Prediction
Personalized Collaborative Learning with Affinity-Based Variance Reduction
DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution
RL makes MLLMs see better than SFT
MLCPD: A Unified Multi-Language Code Parsing Dataset with Universal AST Schema
iWatchRoadv2: Pothole Detection, Geospatial Mapping, and Intelligent Road Governance
Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades
AoI-Aware Task Offloading and Transmission Optimization for Industrial IoT Networks: A Branching Deep Reinforcement Learning Approach
A Relative Error-Based Evaluation Framework of Heterogeneous Treatment Effect Estimators
VIPAMIN: Visual Prompt Initialization via Embedding Selection and Subspace Expansion
Edge-Based Speech Transcription and Synthesis for Kinyarwanda and Swahili Languages
From Reviews to Actionable Insights: An LLM-Based Approach for Attribute and Feature Extraction
Multi-Marginal Schr\"odinger Bridge Matching
Accelerated Learning on Large Scale Screens using Generative Library Models
A three-step machine learning approach to predict market bubbles with financial news
A Versatile Framework for Designing Group-Sparse Adversarial Attacks
ARCO-BO: Adaptive Resource-aware COllaborative Bayesian Optimization for Heterogeneous Multi-Agent Design
Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence
Universal and Transferable Attacks on Pathology Foundation Models
Robust Dynamic Staffing with Predictions
Infinite Neural Operators: Gaussian processes on functions
Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization
DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge
U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation
Local regression on path spaces with signature metrics
A Control-Theoretic Approach to Dynamic Payment Routing for Success Rate Optimization
Kernel-Based Nonparametric Tests For Shape Constraints
Prominence-Aware Artifact Detection and Dataset for Image Super-Resolution
Near-Optimal Quantum Algorithms for Computing (Coarse) Correlated Equilibria of General-Sum Games
Black-box Optimization of LLM Outputs by Asking for Directions
Prediction-Augmented Trees for Reliable Statistical Inference
A Topological Approach to Parameterizing Deep Hedging Networks
Adaptive Sample Sharing for Linear Regression
Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
Extended LSTM: Adaptive Feature Gating for Toxic Comment Classification
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models
Mode Collapse of Mean-Field Variational Inference
Convergence of Regret Matching in Potential Games and Constrained Optimization
DFNN: A Deep Fr\'echet Neural Network Framework for Learning Metric-Space-Valued Responses
HyperSearch: Prediction of New Hyperedges through Unconstrained yet Efficient Search
QR\"iS: A Preemptive Novel Method for Quishing Detection Through Structural Features of QR
High-Level Multi-Robot Trajectory Planning And Spurious Behavior Detection
Fair and Interpretable Deepfake Detection in Videos
Optimal Best Arm Identification under Differential Privacy
M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception
Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
Quantifying Climate Policy Action and Its Links to Development Outcomes: A Cross-National Data-Driven Analysis
Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network
Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
AWARE: Audio Watermarking with Adversarial Resistance to Edits
Plasma Shape Control via Zero-shot Generative Reinforcement Learning
OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction
Non-asymptotic error bounds for probability flow ODEs under weak log-concavity
Just-In-Time Piecewise-Linear Semantics for ReLU-type Networks
Quantum Federated Learning: Architectural Elements and Future Directions
Quantum Synthetic Data Generation for Industrial Bioprocess Monitoring
GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver
The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions
Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations
Efficient Tensor Completion Algorithms for Highly Oscillatory Operators
VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models
Glyph: Scaling Context Windows via Visual-Text Compression
HUMAP: Hierarchical Uniform Manifold Approximation and Projection
Identification and Adaptive Control of Markov Jump Systems: Sample Complexity and Regret Bounds
Transfer Q-learning
UniCrossFi: A Unified Framework For Cross-Domain Wi-Fi-based Gesture Recognition
Neural Green's Operators for Parametric Partial Differential Equations
Absolute abstraction: a renormalisation group approach
Identifiable Latent Bandits: Leveraging observational data for personalized decision-making
Navigating Uncertainties in Machine Learning for Structural Dynamics: A Comprehensive Survey of Probabilistic and Non-Probabilistic Approaches in Forward and Inverse Problems
Solving Oscillator Ordinary Differential Equations in the Time Domain with High Performance via Soft-constrained Physics-informed Neural Network with Small Data
Channel Matters: Estimating Channel Influence for Multivariate Time Series
Riemannian Federated Learning via Averaging Gradient Streams
Intrinsic Dimensionality of Fermi-Pasta-Ulam-Tsingou High-Dimensional Trajectories Through Manifold Learning: A Linear Approach
OneProt: Towards Multi-Modal Protein Foundation Models
SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI
Understanding Generalization of Federated Learning: the Trade-off between Model Stability and Optimization
A Survey and Benchmarking of Spatial-Temporal Traffic Data Imputation Models
CEReBrO: Compact Encoder for Representations of Brain Oscillations Using Efficient Alternating Attention
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
Boosting Graph Robustness Against Backdoor Attacks: An Over-Similarity Perspective
Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models
Fire-EnSF: Wildfire Spread Data Assimilation using Ensemble Score Filter
Hydrogen production from blended waste biomass: pyrolysis, thermodynamic-kinetic analysis and AI-based modelling
User Profiles of Sleep Disorder Sufferers: Towards Explainable Clustering and Differential Variable Analysis
STAR: Boosting Time Series Foundation Models for Anomaly Detection through State-aware Adapter
Decision-focused Sensing and Forecasting for Adaptive and Rapid Flood Response: An Implicit Learning Approach
Transfer learning strategies for accelerating reinforcement-learning-based flow control
Airfoil optimization using Design-by-Morphing with minimized design-space dimensionality
Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
Breaking Memorization Barriers in LLM Code Fine-Tuning via Information Bottleneck for Improved Generalization
Unifying Polymer Modeling and Design via a Conformation-Centric Generative Foundation Model
A tutorial on discovering and quantifying the effect of latent causal sources of multimodal EHR data
Near-Equilibrium Propagation training in nonlinear wave systems
FSRF: Factorization-guided Semantic Recovery for Incomplete Multimodal Sentiment Analysis
Zero-shot World Models via Search in Memory
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
Expert Merging in Sparse Mixture of Experts with Nash Bargaining
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction
AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures
Alignment is Localized: A Causal Probe into Preference Layers
Human-Allied Relational Reinforcement Learning
Explore-then-Commit for Nonstationary Linear Bandits with Latent Dynamics
Benchmarking noisy label detection methods
One-Bit Quantization for Random Features Models
WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models
Toward General Digraph Contrastive Learning: A Dual Spatial Perspective
Memorizing Long-tail Data Can Help Generalization Through Composition
MGTS-Net: Exploring Graph-Enhanced Multimodal Fusion for Augmented Time Series Forecasting
Sparse Transformer Architectures via Regularized Wasserstein Proximal Operator with $L_1$ Prior
Colliding with Adversaries at ECML-PKDD 2025 Adversarial Attack Competition 1st Prize Solution
Colliding with Adversaries at ECML-PKDD 2025 Model Robustness Competition 1st Prize Solution
Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making
SCALAR: Self-Calibrating Adaptive Latent Attention Representation Learning
eDCF: Estimating Intrinsic Dimension using Local Connectivity
Realizing LLMs' Causal Potential Requires Science-Grounded, Novel Benchmarks
NeurIPT: Foundation Model for Neural Interfaces
Copy-Augmented Representation for Structure Invariant Template-Free Retrosynthesis
On the Impossibility of Retrain Equivalence in Machine Unlearning
Simulation-free Structure Learning for Stochastic Dynamics
Evaluating protein binding interfaces with PUMBA
Active Target Discovery under Uninformative Prior: The Power of Permanent and Transient Memory
High-Dimensional Privacy-Utility Dynamics of Noisy Stochastic Gradient Descent on Least Squares
CLIP: Client-Side Invariant Pruning for Mitigating Stragglers in Secure Federated Learning
Resolution-Aware Retrieval Augmented Zero-Shot Forecasting
LSTM-Based Forecasting and Analysis of EV Charging Demand in a Dense Urban Campus
Zero-Shot Performance Prediction for Probabilistic Scaling Laws
An Efficient Semantic Segmentation Decoder for In-Car or Distributed Applications
3D-GSRD: 3D Molecular Graph Auto-Encoder with Selective Re-mask Decoding
Computational Budget Should Be Considered in Data Selection
Graph Learning is Suboptimal in Causal Bandits
Trace Regularity PINNs: Enforcing $\mathrm{H}^{\frac{1}{2}}(\partial \Omega)$ for Boundary Data
Finding Manifolds With Bilinear Autoencoders
ProtoMol: Enhancing Molecular Property Prediction via Prototype-Guided Multimodal Learning
UniGTE: Unified Graph-Text Encoding for Zero-Shot Generalization across Graph Tasks and Domains
DeepChem Equivariant: SE(3)-Equivariant Support in an Open-Source Molecular Machine Learning Library
SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search
Closing the Curvature Gap: Full Transformer Hessians and Their Implications for Scaling Laws
Differentially Private Linear Regression and Synthetic Data Generation with Statistical Guarantees
Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision
MuonBP: Faster Muon via Block-Periodic Orthogonalization
Graph4MM: Weaving Multimodal Learning with Structural Information
EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit
Forgetting to Forget: Attention Sink as A Gateway for Backdooring LLM Unlearning
Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation
Diverse Influence Component Analysis: A Geometric Approach to Nonlinear Mixture Identifiability
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Data Reliability Scoring
On the Universal Near Optimality of Hedge in Combinatorial Settings
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
Fighter: Unveiling the Graph Convolutional Nature of Transformers in Time Series Modeling
Matricial Free Energy as a Gaussianizing Regularizer: Enhancing Autoencoders for Gaussian Code Generation
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
In-situ Autoguidance: Eliciting Self-Correction in Diffusion Models
Learning After Model Deployment
ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing
Robustness in Text-Attributed Graph Learning: Insights, Trade-offs, and New Defenses
A Standardized Benchmark for Machine-Learned Molecular Dynamics using Weighted Ensemble Sampling
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
CooT: Learning to Coordinate In-Context with Coordination Transformers
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
A Markovian Framing of WaveFunctionCollapse for Procedurally Generating Aesthetically Complex Environments
From Next Token Prediction to (STRIPS) World Models -- Preliminary Results
Graph Neural Networks for the Offline Nanosatellite Task Scheduling Problem
Membership Privacy Risks of Sharpness Aware Minimization
Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints
LinkedIn Post Embeddings: Industrial Scale Embedding Generation and Usage across LinkedIn
Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
Exploration of Marker-Based Approaches in Argument Mining through Augmented Natural Language
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
EasyRec: Simple yet Effective Language Models for Recommendation
Familiarity-Aware Evidence Compression for Retrieval-Augmented Generation
Packet Inspection Transformer: A Self-Supervised Journey to Unseen Malware Detection with Few Samples
A Prospect-Theoretic Policy Gradient Framework for Behaviorally Nuanced Reinforcement Learning
Beyond Uncertainty Quantification: Learning Uncertainty for Trust-Informed Neural Network Decisions - A Case Study in COVID-19 Classification
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning
Parameter Efficient Fine-tuning via Explained Variance Adaptation
HardNet: Hard-Constrained Neural Networks with Universal Approximation Guarantees
Enhancing Osteoporosis Detection: An Explainable Multi-Modal Learning Framework with Feature Fusion and Variable Clustering
An Empirical Study on LLM-based Agents for Automated Bug Fixing
Diffusion Transformers as Open-World Spatiotemporal Foundation Models
Improving training time and GPU utilization in geo-distributed language model training
Free$^2$Guide: Training-Free Text-to-Video Alignment using Image LVLM
StarWhisper Telescope: An AI framework for automating end-to-end astronomical observations
Tracing Partisan Bias to Its Emotional Fingerprints: A Computational Approach to Mitigation
Consistency of Responses and Continuations Generated by Large Language Models on Social Media
GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning
Seeing in the Dark: A Teacher-Student Framework for Dark Video Action Recognition via Knowledge Distillation and Contrastive Learning
Towards Principled Unsupervised Multi-Agent Reinforcement Learning
Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models
GRIFFIN: Effective Token Alignment for Faster Speculative Decoding
Repo2Run: Automated Building Executable Environment for Code Repository at Scale
Cross-Domain Graph Anomaly Detection via Test-Time Training with Homophily-Guided Self-Supervision
FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis
Large Language Models are Powerful Electronic Health Record Encoders
Hallucination Detection in LLMs Using Spectral Features of Attention Maps
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Optimization with Diffusion Models for Green Security
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images
Late Fusion and Multi-Level Fission Amplify Cross-Modal Transfer in Text-Speech LMs
The Shape of Attraction in UMAP: Exploring the Embedding Forces in Dimensionality Reduction
DeepSeek-Inspired Exploration of RL-based LLMs and Synergy with Wireless Networks: A Survey
Provably Efficient Reward Transfer in Reinforcement Learning with Discrete Markov Decision Processes
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction
Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism
LLMTaxo: Leveraging Large Language Models for Constructing Taxonomy of Factual Claims from Social Media
CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation
LLM-Enhanced Black-Litterman Portfolio Optimization
Improving Coverage in Combined Prediction Sets with Weighted p-values
Intrinsic Self-Correction in LLMs: Towards Explainable Prompting via Mechanistic Interpretability
PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
When majority rules, minority loses: bias amplification of gradient descent
Incentivizing Truthful Language Models via Peer Elicitation Games
Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
Understanding Prompt Tuning and In-Context Learning via Meta-Learning
CLIMB: Class-imbalanced Learning Benchmark on Tabular Data
Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
CrossRF: A Domain-Invariant Deep Learning Approach for RF Fingerprinting
DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Efficient Large Language Model Inference with Neural Block Linearization
RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation
VERINA: Benchmarking Verifiable Code Generation
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions
KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching
VisuRiddles: Fine-grained Perception is a Primary Bottleneck for Multimodal Large Language Models in Abstract Visual Reasoning
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
HauntAttack: When Attack Follows Reasoning as a Shadow
Denoising the Future: Top-p Distributions for Moving Through Time
Code Execution as Grounded Supervision for LLM Reasoning
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
From Multimodal Perception to Strategic Reasoning: A Survey on AI-Generated Game Commentary
GeNIE: A Generalizable Navigation System for In-the-Wild Environments
Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning
From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging
AI-Generated Video Detection via Perceptual Straightening
DP-Fusion: Token-Level Differentially Private Inference for Large Language Models
Controlling What You Share: Assessing Language Model Adherence to Privacy Preferences
Multimodal Fusion at Three Tiers: Physics-Driven Data Generation and Vision-Language Guidance for Brain Tumor Segmentation
From Sequence to Structure: Uncovering Substructure Reasoning in Transformers
Adaptive Policy Synchronization for Scalable Reinforcement Learning
ReDi: Rectified Discrete Flow
Why and How Auxiliary Tasks Improve JEPA Representations
Pursuing Minimal Sufficiency in Spatial Reasoning
On the Granularity of Causal Effect Identifiability
Natural Language Processing Applications in Cardiology: A Narrative Review
HumanCM: One Step Human Motion Prediction
The Chameleon Nature of LLMs: Quantifying Multi-Turn Stance Instability in Search-Enabled Language Models
Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
Beacon: Single-Turn Diagnosis and Mitigation of Latent Sycophancy in Large Language Models
SAMOSA: Sharpness Aware Minimization for Open Set Active learning
Region in Context: Text-condition Image editing with Human-like semantic reasoning
Learning to play: A Multimodal Agent for 3D Game-Play
EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation
Xiaoice: Training-Free Video Understanding via Self-Supervised Spatio-Temporal Clustering of Semantic Features
LC-Eval: A Bilingual Multi-Task Evaluation Benchmark for Long-Context Understanding
More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
MOSAIC: Masked Objective with Selective Adaptation for In-domain Contrastive Learning
Mixed-Precision Quantization for Language Models: Techniques and Prospects
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
When Many-Shot Prompting Fails: An Empirical Study of LLM Code Translation
Needles in the Landscape: Semi-Supervised Pseudolabeling for Archaeological Site Discovery under Label Scarcity
Knowing the Facts but Choosing the Shortcut: Understanding How Large Language Models Compare Entities
Efficient High-Accuracy PDEs Solver with the Linear Attention Neural Operator
ReefNet: A Large scale, Taxonomically Enriched Dataset and Benchmark for Hard Coral Classification
Who's Asking? Simulating Role-Based Questions for Conversational AI Evaluation
Schr\"odinger Bridge Mamba for One-Step Speech Enhancement
FinSight: Towards Real-World Financial Deep Research
Neuronal Group Communication for Efficient Neural representation
Agentic Inequality
ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification
DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization
Fly-CL: A Fly-Inspired Framework for Enhancing Efficient Decorrelation and Reduced Training Time in Pre-trained Model-based Continual Representation Learning
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations
Adaptive Online Learning with LSTM Networks for Energy Price Prediction
SNOMED CT-powered Knowledge Graphs for Structured Clinical Data and Diagnostic Reasoning
A Lightweight DL Model for Smart Grid Power Forecasting with Feature and Resolution Mismatch
Domain Generalizable Continual Learning
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
UNDREAM: Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks
Tutoring LLM into a Better CUDA Optimizer
A Primer on Kolmogorov-Arnold Networks (KANs) for Probabilistic Time Series Forecasting
Peering Inside the Black Box: Uncovering LLM Errors in Optimization Modelling through Component-Level Evaluation
Quantile Regression, Variational Autoencoders, and Diffusion Models for Uncertainty Quantification: A Spatial Analysis of Sub-seasonal Wind Speed Prediction
Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures
Foundation Models in Medical Image Analysis: A Systematic Review and Meta-Analysis
One-step Diffusion Models with Bregman Density Ratio Matching
Parameter-Efficient Fine-Tuning for Low-Resource Languages: A Comparative Study of LLMs for Bengali Hate Speech Detection
CARE: Contrastive Alignment for ADL Recognition from Event-Triggered Sensor Streams
ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI
Justitia: Fair and Efficient Scheduling for LLM Applications
Curiosity-driven RL for symbolic equation solving
DINO-CVA: A Multimodal Goal-Conditioned Vision-to-Action Model for Autonomous Catheter Navigation
Video Reasoning without Training
The Ends Justify the Thoughts: RL-Induced Motivated Reasoning in LLMs
Bitwidth-Specific Logarithmic Arithmetic for Future Hardware-Accelerated Training
Investigating Thinking Behaviours of Reasoning-Based Language Models for Social Bias Mitigation
Explainable Heterogeneous Anomaly Detection in Financial Networks via Adaptive Expert Routing
Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models
Verification-Aware Planning for Multi-Agent Systems
Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
DVAGen: Dynamic Vocabulary Augmented Generation
GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection
Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction
GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image
TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
Benchmarking Out-of-Distribution Detection for Plankton Recognition: A Systematic Evaluation of Advanced Methods in Marine Ecological Monitoring
SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving
Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models
ZSPAPrune: Zero-Shot Prompt-Aware Token Pruning for Vision-Language Models
From Pixels to People: Satellite-Based Mapping and Quantification of Riverbank Erosion and Lost Villages in Bangladesh
Round Outcome Prediction in VALORANT Using Tactical Features from Video Analysis
Soft-Masked Diffusion Language Models
D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks
Diagnosis of Fuel Cell Health Status with Deep Sparse Auto-Encoder Neural Network
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Taming Modality Entanglement in Continual Audio-Visual Segmentation
Visibility Allocation Systems: How Algorithmic Design Shapes Online Visibility and Societal Outcomes
How News Feels: Understanding Affective Bias in Multilingual Headlines for Human-Centered Media Design
Augmented Web Usage Mining and User Experience Optimization with CAWAL's Enriched Analytics Data
FineVision: Open Data Is All You Need
MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
Comprehending Spatio-temporal Data via Cinematic Storytelling using Large Language Models
Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling
CharDiff: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration
DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift
TopSeg: A Multi-Scale Topological Framework for Data-Efficient Heart Sound Segmentation
Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation
Localist LLMs with Recruitment Learning
Bridging Embodiment Gaps: Deploying Vision-Language-Action Models on Soft Robots
Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks
TabR1: Taming GRPO for tabular reasoning LLMs
Inference of Deterministic Finite Automata via Q-Learning
EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs
Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine
AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages
BenCao: An Instruction-Tuned Large Language Model for Traditional Chinese Medicine
Navigating the Alignment-Calibration Trade-off: A Pareto-Superior Frontier via Model Merging
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
The Parameterized Complexity of Computing the VC-Dimension
Layer Specialization Underlying Compositional Reasoning in Transformers
DAMSDAN: Distribution-Aware Multi-Source Domain Adaptation Network for Cross-Domain EEG-based Emotion Recognition
SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
I-RAVEN-X: Benchmarking Generalization and Robustness of Analogical and Mathematical Reasoning in Large Language and Reasoning Models
Context-Aware Pseudo-Label Scoring for Zero-Shot Video Summarization
The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
MambaX-Net: Dual-Input Mamba-Enhanced Cross-Attention Network for Longitudinal MRI Segmentation
An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning
Intent-Driven LLM Ensemble Planning for Flexible Multi-Robot Disassembly: Demonstration on EV Batteries
CEPerFed: Communication-Efficient Personalized Federated Learning for Multi-Pulse MRI Classification
HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection
GUIDE: Enhancing Gradient Inversion Attacks in Federated Learning with Denoising Models
CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation
Frugal Federated Learning for Violence Detection: A Comparison of LoRA-Tuned VLMs and Personalized CNNs
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
LILO: Bayesian Optimization with Interactive Natural Language Feedback
PICABench: How Far Are We from Physically Realistic Image Editing?
Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model
Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks
Improving Cross-Patient Generalization in Parkinson's Disease Detection through Chunk-Based Analysis of Hand-Drawn Patterns
Closing the Sim2Real Performance Gap in RL
PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition
MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues
Signature Forgery Detection: Improving Cross-Dataset Generalization
AcademicEval: Live Long-Context LLM Benchmark
A Multi-Threading Kernel for Enabling Neuromorphic Edge Applications
Human-AI Interactions: Cognitive, Behavioral, and Emotional Impacts
Prediction of Sea Ice Velocity and Concentration in the Arctic Ocean using Physics-informed Neural Network
Towards Explainable Skin Cancer Classification: A Dual-Network Attention Model with Lesion Segmentation and Clinical Metadata Fusion
Mapping Post-Training Forgetting in Language Models at Scale
SoftMimic: Learning Compliant Whole-body Control from Examples
Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains
Executable Knowledge Graphs for Replicating AI Research
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Unbiased Gradient Low-Rank Projection
A Survey on Self-play Methods in Reinforcement Learning
Fully Autonomous AI Agents Should Not be Developed
Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
Automated Knowledge Component Generation for Interpretable Knowledge Tracing in Coding Problems
Online Feedback Efficient Active Target Discovery in Partially Observable Environments
RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics
Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities
Visual Instruction Bottleneck Tuning
Smart Traffic Signals: Comparing MARL and Fixed-Time Strategies
Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems in Math Competitions
AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
CTR-LoRA: Curvature-Aware and Trust-Region Guided Low-Rank Adaptation for Large Language Models
ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
Gains: Fine-grained Federated Domain Adaptation in Open Set
Self-Attention to Operator Learning-based 3D-IC Thermal Simulation
LinearizeLLM: An Agent-Based Framework for LLM-Driven Exact Linear Reformulation of Nonlinear Optimization Problems
Predict Training Data Quality via Its Geometry in Metric Space
A Graph-Attentive LSTM Model for Malicious URL Detection
Quantum NLP models on Natural Language Inference
Safeguarding Efficacy in Large Language Models: Evaluating Resistance to Human-Written and Algorithmic Adversarial Prompts
Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective Optimization
Bolster Hallucination Detection via Prompt-Guided Data Augmentation
DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space
Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
AMiD: Knowledge Distillation for LLMs with $\alpha$-mixture Assistant Distribution
MEET-Sepsis: Multi-Endogenous-View Enhanced Time-Series Representation Learning for Early Sepsis Prediction Representation Learning for Early Sepsis Prediction
Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models
Can GRPO Help LLMs Transcend Their Pretraining Origin?
Stratos: An End-to-End Distillation Pipeline for Customized LLMs under Distributed Cloud Environments
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning
AMStraMGRAM: Adaptive Multi-cutoff Strategy Modification for ANaGRAM
Breaking Guardrails, Facing Walls: Insights on Adversarial AI for Defenders & Researchers
Layer-Aware Influence for Online Data Valuation Estimation
InfraGPT Smart Infrastructure: An End-to-End VLM-Based Framework for Detecting and Managing Urban Defects
On-Chain Decentralized Learning and Cost-Effective Inference for DeFi Attack Mitigation
Nondeterminism-Aware Optimistic Verification for Floating-Point Neural Networks
Disaster Management in the Era of Agentic AI Systems: A Vision for Collective Human-Machine Intelligence for Augmented Resilience
RoBCtrl: Attacking GNN-Based Social Bot Detectors via Reinforced Manipulation of Bots Control Interaction
Membership Inference over Diffusion-models-based Synthetic Tabular Data
Vector Quantization in the Brain: Grid-like Codes in World Models
Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing
Does Capital Dream of Artificial Labour?
AMS-QUANT: Adaptive Mantissa Sharing for Floating-point Quantization
Open Shouldn't Mean Exempt: Open-Source Exceptionalism and Generative AI
In the Mood to Exclude: Revitalizing Trespass to Chattels in the Era of GenAI Scraping
GUIrilla: A Scalable Framework for Automated Desktop UI Exploration
FUSE-Traffic: Fusion of Unstructured and Structured Data for Event-aware Traffic Forecasting
Algorithmic Fairness in AI Surrogates for End-of-Life Decision-Making
Fusion-Augmented Large Language Models: Boosting Diagnostic Trustworthiness via Model Consensus
Beyond Accuracy: Are Time Series Foundation Models Well-Calibrated?
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
Learning a Generalized Model for Substation Level Voltage Estimation in Distribution Networks
Residual Correction Models for AC Optimal Power Flow Using DC Optimal Power Flow Solutions
FedPURIN: Programmed Update and Reduced INformation for Sparse Personalized Federated Learning
Cash Flow Underwriting with Bank Transaction Data: Advancing MSME Financial Inclusion in Malaysia
Co-Designing Interdisciplinary Design Projects with AI
Human or AI? Comparing Design Thinking Assessments by Teaching Assistants and Bots
Effect of Reporting Mode and Clinical Experience on Radiologists' Gaze and Image Analysis Behavior in Chest Radiography
MNO: Multiscale Neural Operator for Computational Fluid Dynamics with 3D Point Cloud Data
Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation
Early-stopping for Transformer model training
Optimization of the quantization of dense neural networks from an exact QUBO formulation
BPL: Bias-adaptive Preference Distillation Learning for Recommender System
Continual Knowledge Consolidation LORA for Domain Incremental Learning
ISO/IEC-Compliant Match-on-Card Face Verification with Short Binary Templates
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration
SARHAchat: An LLM-Based Chatbot for Sexual and Reproductive Health Counseling
Interpretable RNA-Seq Clustering with an LLM-Based Agentic Evidence-Grounded Framework
PassREfinder-FL: Privacy-Preserving Credential Stuffing Risk Prediction via Graph-Based Federated Learning for Representing Password Reuse between Websites
MoPHES:Leveraging on-device LLMs as Agent for Mobile Psychological Health Evaluation and Support
STABLE: Gated Continual Learning for Large Language Models
Evaluating Prompting Strategies and Large Language Models in Systematic Literature Review Screening: Relevance and Task-Stage Classification
Compressing Many-Shots in In-Context Learning
Narrowing Action Choices with AI Improves Human Sequential Decisions
Aria Gen 2 Pilot Dataset
GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer
Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance
Publication Trend Analysis and Synthesis via Large Language Model: A Case Study of Engineering in PNAS
AsyncVoice Agent: Real-Time Explanation for LLM Planning and Reasoning
Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
The Formalism-Implementation Gap in Reinforcement Learning Research
Expressive Reward Synthesis with the Runtime Monitoring Language
Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
Revealing Low-Dimensional Structure in 2D Richtmyer-Meshkov Instabilities via Parametric Reduced-Order Modeling
SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection
What Can String Probability Tell Us About Grammaticality?
Machine Learning for Climate Policy: Understanding Policy Progression in the European Green Deal
Protein Folding with Neural Ordinary Differential Equations
Detecting Adversarial Fine-tuning with Auditing Agents
NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?
MuseTok: Symbolic Music Tokenization for Generation and Semantic Understanding
Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
Disentangling Hyperedges through the Lens of Category Theory
Synergizing chemical and AI communities for advancing laboratories of the future
OpenLVLM-MIA: A Controlled Benchmark Revealing the Limits of Membership Inference Attacks on Large Vision-Language Models
Scaffold-Aware Generative Augmentation and Reranking for Enhanced Virtual Screening
Lung Cancer Classification from CT Images Using ResNet
Time-Embedded Algorithm Unrolling for Computational MRI
Thinking About Thinking: Evaluating Reasoning in Post-Trained Language Models
Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models
End-to-End Argument Mining through Autoregressive Argumentative Structure Prediction
Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis
Navigating through the hidden embedding space: steering LLMs to improve mental health assessment
Conformal Prediction in The Loop: A Feedback-Based Uncertainty Model for Trajectory Optimization
MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents
Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment
SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation
Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
EDVD-LLaMA: Explainable Deepfake Video Detection via Multimodal Large Language Model Reasoning
Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts
Declarative Techniques for NL Queries over Heterogeneous Data
Automated Composition of Agents: A Knapsack Approach for Agentic Component Selection
Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection
Image Categorization and Search via a GAT Autoencoder and Representative Models
DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
Few-Label Multimodal Modeling of SNP Variants and ECG Phenotypes Using Large Language Models for Cardiovascular Risk Stratification
Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
Watch Where You Move: Region-aware Dynamic Aggregation and Excitation for Gait Recognition
Predicting life satisfaction using machine learning and explainable AI
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
Toward Understanding Security Issues in the Model Context Protocol Ecosystem
Language over Content: Tracing Cultural Understanding in Multilingual Large Language Models
AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
Atom-anchored LLMs speak Chemistry: A Retrosynthesis Demonstration
Symmetry and Generalisation in Neural Approximations of Renormalisation Transformations
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense
Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules
Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications
Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis
Structured Interfaces for Automated Reasoning with 3D Scene Graphs
Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration
Safire: Similarity Framework for Visualization Retrieval
All You Need is One: Capsule Prompt Tuning with a Single Vector
Renaissance of RNNs in Streaming Clinical Time Series: Compact Recurrence Remains Competitive with Transformers
VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search
Executable Epistemology: The Structured Cognitive Loop as an Architecture of Intentional Understanding
Exploring the Potential of Citiverses for Regulatory Learning
PISA: A Pragmatic Psych-Inspired Unified Memory System for Enhanced AI Agency
Limits of Emergent Reasoning of Large Language Models in Agentic Frameworks for Deterministic Games
Cognitive Load Traces as Symbolic and Visual Accounts of Deep Model Cognition
ProofFlow: A Dependency Graph Approach to Faithful Proof Autoformalization
Ontologies in Motion: A BFO-Based Approach to Knowledge Graph Construction for Motor Performance Research Data in Sports Science
A Non-overlap-based Conflict Measure for Random Permutation Sets
PAINT: Parallel-in-time Neural Twins for Dynamical System Reconstruction
Global-focal Adaptation with Information Separation for Noise-robust Transfer Fault Diagnosis
Algorithms for dynamic scheduling in manufacturing, towards digital factories Improving Deadline Feasibility and Responsiveness via Temporal Networks
Reliability of Large Language Model Generated Clinical Reasoning in Assisted Reproductive Technology: Blinded Comparative Evaluation Study
Operationalising Extended Cognition: Formal Metrics for Corporate Knowledge and Legal Accountability
Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration
The Right to Be Remembered: Preserving Maximally Truthful Digital Memory in the Age of AI
ScholarEval: Research Idea Evaluation Grounded in Literature
Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
What Limits Agentic Systems Efficiency?
DTKG: Dual-Track Knowledge Graph-Verified Reasoning Framework for Multi-Hop QA
MedRule-KG: A Knowledge-Graph--Steered Scaffold for Mathematical Reasoning with a Lightweight Verifier
Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts
The Burden of Interactive Alignment with Inconsistent Preferences
Before you , monitor: Implementing Flavell's metacognitive framework in LLMs
Humanoid-inspired Causal Representation Learning for Domain Generalization
RGMem: Renormalization Group-based Memory Evolution for Language Agent User Profile
ReviewSense: Transforming Customer Review Dynamics into Actionable Business Insights
NP-Engine: Empowering Optimization Reasoning in Large Language Models with Verifiable Synthetic NP Problems
Hey Pentti, We Did It Again!: Differentiable vector-symbolic types that prove polynomial termination
Urban-R1: Reinforced MLLMs Mitigate Geospatial Biases for Urban General Intelligence
BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction
Ripple Effect Protocol: Coordinating Agent Populations
Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?
Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning
Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
Foundation and Large-Scale AI Models in Neuroscience: A Comprehensive Review
An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems
Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Surrogate Modeling and Explainable Artificial Intelligence for Complex Systems: A Workflow for Automated Simulation Exploration
ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion
End-to-end Listen, Look, Speak and Act
See or Say Graphs: Agent-Driven Scalable Graph Understanding with Vision-Language Models
Domain-Contextualized Concept Graphs: A Computable Framework for Knowledge Representation
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
A Comparative User Evaluation of XRL Explanations using Goal Identification
STARK: Strategic Team of Agents for Refining Kernels
ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems
A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation
Structured Debate Improves Corporate Credit Reasoning in Financial AI
Enhanced Fish Freshness Classification with Incremental Handcrafted Feature Fusion
Physics-Informed Large Language Models for HVAC Anomaly Detection with Autonomous Rule Generation
Which LLM Multi-Agent Protocol to Choose?
Combining ECG Foundation Model and XGBoost to Predict In-Hospital Malignant Ventricular Arrhythmias in AMI Patients
Offline Policy Evaluation of Multi-Turn LLM Health Coaching with Real Users
Temporally Detailed Hypergraph Neural ODEs for Type 2 Diabetes Progression Modeling
Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis
RubiSCoT: A Framework for AI-Supported Academic Assessment
Graph Attention-Guided Search for Dense Multi-Agent Pathfinding
Diverse Planning with Simulators via Linear Temporal Logic
Active Inference for an Intelligent Agent in Autonomous Reconnaissance Missions
Label Indeterminacy in AI & Law
MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning
Reasoning Distillation and Structural Alignment for Improved Code Generation
OG-Rank: Learning to Rank Fast and Slow with Uncertainty and Reward-Trend Guided Adaptive Exploration
LLM-as-a-Prophet: Understanding Predictive Intelligence with Prophet Arena
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models
Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs
A Semantic Generalization of Shannon's Information Theory and Applications
Multimodal Chip Physical Design Engineer Assistant
FlexLink: Boosting your NVLink Bandwidth by 27% without accuracy concern
FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance
Mitigating Harmful Erraticism in LLMs Through Dialectical Behavior Therapy Based De-Escalation Strategies
A Real-Time BCI for Stroke Hand Rehabilitation Using Latent EEG Features from Healthy Subjects
Detecting and Preventing Harmful Behaviors in AI Companions: Development and Evaluation of the SHIELD Supervisory System
Accelerating Frontier MoE Training with 3D Integrated Optics
BREATH: A Bio-Radar Embodied Agent for Tonal and Human-Aware Diffusion Music Generation
From Coordination to Personalization: A Trust-Aware Simulation Framework for Emergency Department Decision Support
"She's Like a Person but Better": Characterizing Companion-Assistant Dynamics in Human-AI Relationships
FVDebug: An LLM-Driven Debugging Assistant for Automated Root Cause Analysis of Formal Verification Failures
Sleeping Kelly is a Thirder
VeriGRAG: Enhancing LLM-Based Verilog Code Generation with Structure-Aware Soft Prompts
Intent-Driven Storage Systems: From Low-Level Tuning to High-Level Understanding
Comparing LLMs for Sentiment Analysis in Financial Market News
Impl\'ementation Efficiente de Fonctions de Convolution sur FPGA \`a l'Aide de Blocs Param\'etrables et d'Approximations Polynomiales
Lean Finder: Semantic Search for Mathlib That Understands User Intents
Lyapunov-Stable Adaptive Control for Multimodal Concept Drift
BEACON: Bayesian Optimal Stopping for Efficient LLM Sampling
Learning from Mistakes: Enhancing Harmful Meme Detection via Misjudgment Risk Patterns
WaveNet's Precision in EEG Classification
ATLAS: Adaptive Trading with LLM AgentS Through Dynamic Prompt Optimization and Multi-Agent Coordination
Cross-dataset Multivariate Time-series Model for Parkinson's Diagnosis via Keyboard Dynamics
How Good Are LLMs at Processing Tool Outputs?
Interpretable Graph-Language Modeling for Detecting Youth Illicit Drug Use

Research Sources: 1024 | Generated: 10/21/2025