AI Research News Feeds for September 25th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

Challenges and Trends in Egocentric Vision: A Survey
RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Deciphering Functions of Neurons in Vision-Language Models
Generating 360{\deg} Video is What You Need For a 3D Scene
Imaging Biomarkers for Neurodegenerative Diseases from Detailed Segmentation of Medial Temporal Lobe Subregions on in vivo Brain MRI Using Upsampling Strategy Guided by High-resolution ex vivo MRI
EDBench: Large-Scale Electron Density Data for Molecular Modeling
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation
CellCLIP -- Learning Perturbation Effects in Cell Painting via Text-Guided Contrastive Learning
HAZEMATCHING: Dehazing Light Microscopy Images with Guided Conditional Flow Matching
Infrared Image Super-Resolution: Systematic Review, and Future Trends
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion
VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model
Probabilistic Online Event Downsampling
A Quad-Step Approach to Uncertainty-Aware Deep Learning for Skin Cancer Classification
NERO: Explainable Out-of-Distribution Detection with Neuron-level Relevance
SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding
Diffusion models for multivariate subsurface generation and efficient probabilistic inversion
AAPO: Enhancing the Reasoning Capabilities of LLMs with Advantage Momentum
WikiGap: Promoting Epistemic Equity by Surfacing Knowledge Gaps Between English Wikipedia and other Language Editions
Localized LoRA: A Structured Low-Rank Approximation for Efficient Fine-Tuning
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
Urania: Differentially Private Insights into AI Use
CLOSP: A Unified Semantic Space for SAR, MSI, and Text in Remote Sensing
Latent Wavelet Diffusion For Ultra-High-Resolution Image Synthesis
Macroeconomic Forecasting with Large Language Models
Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization
A GEN AI Framework for Medical Note Generation
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering
HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks
Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO
LEMUR Neural Network Dataset: Towards Seamless AutoML
MAME: Multidimensional Adaptive Metamer Exploration with Human Perceptual Feedback
Enhanced uncertainty quantification variational autoencoders for the solution of Bayesian inverse problems
Diffusion Classifier-Driven Reward for Offline Preference-based Reinforcement Learning
A Transformer Model for Predicting Chemical Products from Generic SMARTS Templates with Data Augmentation
Examining the robustness of Physics-Informed Neural Networks to noise for Inverse Problems
Deep learning for exoplanet detection and characterization by direct imaging at high contrast
Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Survey of Deep Learning and Physics-Based Approaches in Computational Wave Imaging
Model-Agnostic AI Framework with Explicit Time Integration for Long-Term Fluid Dynamics Prediction
LLMs for Cold-Start Cutting Plane Separator Configuration
Multimodal Representation-disentangled Information Bottleneck for Multimodal Recommendation
AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving
Investigating Security Implications of Automatically Generated Code on the Software Supply Chain
RAG Security and Privacy: Formalizing the Threat Model and Attack Surface
Adaptive Event-Triggered Policy Gradient for Multi-Agent Reinforcement Learning
Tree Search for Language Model Agents
Reinforcement Learning and Machine ethics:a systematic review
Multi-Agents are Social Groups: Investigating Social Influence of Multiple Agents in Human-Agent Interactions
Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks
STRIVE: Structured Reasoning for Self-Improvement in Claim Verification
AutoEval: A Practical Framework for Autonomous Evaluation of Mobile Agents
Causal Inference under Threshold Manipulation: Bayesian Mixture Modeling and Heterogeneous Treatment Effects
Eliminating stability hallucinations in llm-based tts models via attention guidance
CollaPipe: Adaptive Segment-Optimized Pipeline Parallelism for Collaborative LLM Training in Heterogeneous Edge Networks
Choosing to Be Green: Advancing Green AI via Dynamic Model Selection
Affective Computing and Emotional Data: Challenges and Implications in Privacy Regulations, The AI Act, and Ethics in Large Language Models
CyberSOCEval: Benchmarking LLMs Capabilities for Malware Analysis and Threat Intelligence Reasoning
How People Manage Knowledge in their "Second Brains"- A Case Study with Industry Researchers Using Obsidian
STAF: Leveraging LLMs for Automated Attack Tree-Based Security Test Generation
Wrapped Gaussian on the manifold of Symmetric Positive Definite Matrices
Stein's unbiased risk estimate and Hyv\"arinen's score matching
Error Propagation in Dynamic Programming: From Stochastic Control to Option Pricing
Chiseling: Powerful and Valid Subgroup Selection via Interactive Machine Learning
Hierarchical Bayesian Operator-induced Symbolic Regression Trees for Structural Learning of Scientific Expressions
Generalized Nonnegative Structured Kruskal Tensor Regression
Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees
Differentially Private Bootstrap: New Privacy Analysis and Inference Strategies
Sparse Max-Affine Regression
A Scalable Nystr\"om-Based Kernel Two-Sample Test with Permutations
Beyond Grids: Multi-objective Bayesian Optimization With Adaptive Discretization
The 2020 United States Decennial Census Is More Private Than You (Might) Think
Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms
ChartQA-X: Generating Explanations for Visual Chart Reasoning
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking
Redemption Score: A Multi-Modal Evaluation Framework for Image Captioning via Distributional, Perceptual, and Linguistic Signal Triangulation
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
To Trust Or Not To Trust Your Vision-Language Model's Prediction
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning
GaussianSeal: Rooting Adaptive Watermarks for 3D Gaussian Generation Model
Robust Computer-Vision based Construction Site Detection for Assistive-Technology Applications
LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding
Adversarial Robustness of Discriminative Self-Supervised Learning in Vision
Cross-Domain Underwater Image Enhancement Guided by No-Reference Image Quality Assessment: A Transfer Learning Approach
Multimodal Reference Visual Grounding
Towards Visual Text Grounding of Multimodal Large Language Model
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Long Video Understanding with Learnable Retrieval in Video-Language Models
CLIP Can Understand Depth
MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas
Positional Prompt Tuning for Efficient 3D Representation Learning
Lagrangian Motion Fields for Long-term Motion Generation
Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion
Replay-Free Continual Low-Rank Adaptation with Dynamic Memory
SMLNet: A SPD Manifold Learning Network for Infrared and Visible Image Fusion
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization
Ensuring Reliable Participation in Subjective Video Quality Tests Across Platforms
Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning
KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation
VisualMimic: Visual Humanoid Loco-Manipulation via Motion Tracking and Generation
An Optimized PatchMatch for Multi-scale and Multi-feature Label Fusion
Robust superpixels using color and contour features along linear path
Texture Superpixel Clustering from Patch-based Nearest Neighbor Matching
Multi-Scale Superpatch Matching using Dual Superpixel Descriptors
PerFace: Metric Learning in Perceptual Facial Similarity for Enhanced Face Anonymization
FAST: Foreground-aware Diffusion with Accelerated Sampling Trajectory for Segmentation-oriented Anomaly Synthesis
A Comprehensive Evaluation of YOLO-based Deer Detection Performance on Edge Devices
Efficient Encoder-Free Pose Conditioning and Pose Control for Virtual Try-On
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
Frequency-Aware Ensemble Learning for BraTS 2025 Pediatric Brain Tumor Segmentation
Agentic Scene Policies: Unifying Space, Semantics, and Affordances for Robot Action
AJAHR: Amputated Joint Aware 3D Human Mesh Recovery
PU-Gaussian: Point Cloud Upsampling using 3D Gaussian Representation
ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression
An Anisotropic Cross-View Texture Transfer with Multi-Reference Non-Local Attention for CT Slice Interpolation
4D Driving Scene Generation With Stereo Forcing
A Versatile Foundation Model for AI-enabled Mammogram Interpretation
A co-evolving agentic AI system for medical imaging analysis
HiPerformer: A High-Performance Global-Local Segmentation Model with Modular Hierarchical Fusion Strategy
SHMoAReg: Spark Deformable Image Registration via Spatial Heterogeneous Mixture of Experts and Attention Heads
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
A Simple Data Augmentation Strategy for Text-in-Image Scientific VQA
EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models
Smaller is Better: Enhancing Transparency in Vehicle AI Systems via Pruning
C$^2$MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis
U-Mamba2-SSL for Semi-Supervised Tooth and Pulp Segmentation in CBCT
Optical Ocean Recipes: Creating Realistic Datasets to Facilitate Underwater Vision Research
Universal Camouflage Attack on Vision-Language Models for Autonomous Driving
Interpreting ResNet-based CLIP via Neuron-Attention Decomposition
When Words Can't Capture It All: Towards Video-Based User Complaint Text Generation with Multimodal Video Complaint Dataset
SynchroRaMa : Lip-Synchronized and Emotion-Aware Talking Face Generation via Multi-Modal Emotion Embedding
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion
SDE-DET: A Precision Network for Shatian Pomelo Detection in Complex Orchard Environments
Improving Generalizability and Undetectability for Targeted Adversarial Attacks on Multimodal Pre-trained Models
Does the Manipulation Process Matter? RITA: Reasoning Composite Image Manipulations via Reversely-Ordered Incremental-Transition Autoregression
PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction
Generative Adversarial Networks Applied for Privacy Preservation in Biometric-Based Authentication and Identification
PersONAL: Towards a Comprehensive Benchmark for Personalized Embodied Agents
FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models
Adaptive Guidance Semantically Enhanced via Multimodal LLM for Edge-Cloud Object Detection
Generalized Shortest Path-based Superpixels for 3D Spherical Image Segmentation
Efficient Cell Painting Image Representation Learning via Cross-Well Aligned Masked Siamese Network
Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering
CapStARE: Capsule-based Spatiotemporal Architecture for Robust and Efficient Gaze Estimation
GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes
nnFilterMatch: A Unified Semi-Supervised Learning Framework with Uncertainty-Aware Pseudo-Label Filtering for Efficient Medical Segmentation
Talking Head Generation via AU-Guided Landmark Prediction
Logics-Parsing Technical Report
Sex-based Bias Inherent in the Dice Similarity Coefficient: A Model Independent Analysis for Multiple Anatomical Structures
EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction
BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting
StrCGAN: A Generative Framework for Stellar Image Restoration
Adaptive Model Ensemble for Continual Learning
ThinkFake: Reasoning in Multimodal Large Language Models for AI-Generated Image Detection
Anatomically Constrained Transformers for Cardiac Amyloidosis Classification
Learning to Stop: Reinforcement Learning for Efficient Patient-Level Echocardiographic Classification
Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis
VIMD: Monocular Visual-Inertial Motion and Depth Estimation
Frequency-domain Multi-modal Fusion for Language-guided Medical Image Segmentation
PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction
CAMILA: Context-Aware Masking for Image Editing with Language Alignment
Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Rectified Decoupled Dataset Distillation: A Closer Look for Fair and Comprehensive Evaluation
CURE: Centroid-guided Unsupervised Representation Erasure for Facial Recognition Systems
Synthesizing Artifact Dataset for Pixel-level Detection
Parameter-Efficient Multi-Task Learning via Progressive Task-Specific Adaptation
Raw-JPEG Adapter: Efficient Raw Image Compression with JPEG
The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar
Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment
Enhancing Transformer-Based Vision Models: Addressing Feature Map Anomalies Through Novel Optimization Strategies
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
RealitySummary: Exploring On-Demand Mixed Reality Text Summarization and Question Answering using Large Language Models
Overview of LifeCLEF Plant Identification task 2020
iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning
How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?
Augmenting Multi-Agent Communication with State Delta Trajectory
The Medium Is Not the Message: Deconfounding Document Embeddings via Linear Concept Erasure
Detecting Token-Level Hallucinations Using Variance Signals: A Reference-Free Approach
VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation
Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Safeguarding Privacy of Retrieval Data against Membership Inference Attacks: Is This Query Too Close to Home?
LASER: Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy
Advancing Expert Specialization for Better MoE
DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation
RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
Aligned Probing: Relating Toxic Behavior and Model Internals
Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
Playpen: An Environment for Exploring Learning Through Conversational Interaction
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
Meeseeks: A Feedback-Driven, Iterative Self-Correction Benchmark evaluating LLMs' Instruction Following Capability
Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging
SAFE: Improving LLM Systems using Sentence-Level In-generation Attribution
LLMs Reproduce Stereotypes of Sexual and Gender Minorities
BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues
LLMs as a synthesis between symbolic and distributed approaches to language
Bridging Information Gaps with Comprehensive Answers: Improving the Diversity and Informativeness of Follow-Up Questions
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs
Large Language Models for Multilingual Previously Fact-Checked Claim Detection
Language Models Fail to Introspect About Their Knowledge of Language
Modeling Subjectivity in Cognitive Appraisal with Language Models
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Muse-it: A Tool for Analyzing Music Discourse on Reddit
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot
Context-Masked Meta-Prompting for Privacy-Preserving LLM Adaptation in Finance
Efficient Fine-Tuning of Large Language Models for Automated Medical Documentation
Evading Toxicity Detection with ASCII-art: A Benchmark of Spatial Attacks on Moderation Systems
UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
Blind Men and the Elephant: Diverse Perspectives on Gender Stereotypes in Benchmark Datasets
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting
SIM-CoT: Supervised Implicit Chain-of-Thought
Z-Scores: A Metric for Linguistically Assessing Disfluency Removal
DRES: Benchmarking LLMs for Disfluency Removal
Morphological Synthesizer for Ge'ez Language: Addressing Morphological Complexity and Resource Limitations
EmbeddingGemma: Powerful and Lightweight Text Representations
Language Models that Think, Chat Better
STARQA: A Question Answering Dataset for Complex Analytical Reasoning over Structured Databases
Multimodal Language Models with Modality-Specific Experts for Financial Forecasting from Interleaved Sequences of Text and Time Series
Human-AI Narrative Synthesis to Foster Shared Understanding in Civic Decision-Making
Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian
Thinking Augmented Pre-training
Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs
Low-Resource English-Tigrinya MT: Leveraging Multilingual Models, Custom Tokenizers, and Clean Evaluation Benchmarks
Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models
Instruction Boundary: Quantifying Biases in LLM Reasoning under Various Coverage
Feeding Two Birds or Favoring One? Adequacy-Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation
Multilingual Hope Speech Detection: A Comparative Study of Logistic Regression, mBERT, and XLM-RoBERTa with Active Learning
From Input Perception to Predictive Insight: Modeling Model Blind Spots Before They Become Errors
From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training
Can Constructions "SCAN" Compositionality ?
OLaPh: Optimal Language Phonemizer
Causal Understanding by LLMs: The Role of Uncertainty
Integrated Framework for LLM Evaluation with Answer Generation
Less is More: The Effectiveness of Compact Typological Language Representations
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
SwissGPC v1.0 -- The Swiss German Podcasts Corpus
Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation
Future Policy Aware Preference Learning for Mathematical Reasoning
WEST: LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
CorIL: Towards Enriching Indian Language to Indian Language Parallel Corpora and Machine Translation Systems
The Knowledge-Behaviour Disconnect in LLM-based Chatbots
DiffNator: Generating Structured Explanations of Time-Series Differences
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
Responsible AI Technical Report
Personality Vector: Modulating Personality of Large Language Models by Model Merging
PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition
EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
bi-GRPO: Bidirectional Optimization for Jailbreak Backdoor Injection on LLMs
Polarity Detection of Sustainable Detection Goals in News Text
TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios
Mah\={a}n\={a}ma: A Unique Testbed for Literary Entity Discovery and Linking
Benchmarking Gaslighting Attacks Against Speech Large Language Models
SINAI at eRisk@CLEF 2025: Transformer-Based and Conversational Strategies for Depression Detection
Benchmarking ChatGPT and DeepSeek in April 2025: A Novel Dual Perspective Sentiment Analysis Using Lexicon-Based and Deep Learning Approaches
Characterizing Knowledge Graph Tasks in LLM Benchmarks Using Cognitive Complexity Frameworks
A Pipeline to Assess Merging Methods via Behavior and Internals
Do LLMs Encode Frame Semantics? Evidence from Frame Identification
Retrieval Augmented Generation based context discovery for ASR
ExPe: Exact Positional Encodings for Generative Transformer Models with Extrapolating Capabilities
LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines
Anatomy of a Feeling: Narrating Embodied Emotions via Large Vision-Language Models
Evaluating Language Translation Models by Playing Telephone
AutoSpec: An Agentic Framework for Automatically Drafting Patent Specification
Projective Kolmogorov Arnold Neural Networks (P-KANs): Entropy-Driven Functional Space Discovery for Interpretable Machine Learning
A Novel Short-Term Anomaly Prediction for IIoT with Software Defined Twin Network
First-Extinction Law for Resampling Processes
Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models
How Much of Your Data Can Suck? Thresholds for Domain Performance and Emergent Misalignment in LLMs
How Model Size, Temperature, and Prompt Style Affect LLM-Human Assessment Score Alignment
Performance of Large Language Models in Answering Critical Care Medicine Questions
Diffusion and Flow-based Copulas: Forgetting and Remembering Dependencies
Convex Regression with a Penalty
High-Dimensional Statistical Process Control via Manifold Fitting and Learning
Modeling and Control of Deep Sign-Definite Dynamics with Application to Hybrid Powertrain Control
Geometric Autoencoder Priors for Bayesian Inversion: Learn First Observe Later
BioBO: Biology-informed Bayesian Optimization for Perturbation Design
Anomaly Detection by Clustering DINO Embeddings using a Dirichlet Process Mixture
Table Detection with Active Learning
The Syntax and Semantics of einsum
Predictive Quality Assessment for Mobile Secure Graphics
Confidence Calibration in Large Language Model-Based Entity Matching
Stochastic Path Planning in Correlated Obstacle Fields
Uncertainty in Semantic Language Modeling with PIXELS
MAGIC: Multi-task Gaussian process for joint imputation and classification in healthcare time series
Discovery of Sustainable Refrigerants through Physics-Informed RL Fine-Tuning of Sequence Models
Graph-based Neural Space Weather Forecasting
EgoBridge: Domain Adaptation for Generalizable Imitation from Egocentric Human Data
Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy
Efficient Online Large-Margin Classification via Dual Certificates
Formal Safety Verification and Refinement for Generative Motion Planners via Certified Local Stabilization
A Statistical Mixture-of-Experts Framework for EMG Artifact Removal in EEG: Empirical Insights and a Proof-of-Concept Application
Hybrid Pipeline SWD Detection in Long-Term EEG Signals
SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs
Poster: ChatIYP: Enabling Natural Language Access to the Internet Yellow Pages Database
The Pareto Frontier of Resilient Jet Tagging
HUNT: High-Speed UAV Navigation and Tracking in Unstructured Environments via Instantaneous Relative Frames
The Platonic Universe: Do Foundation Models See the Same Sky?
Anchored Langevin Algorithms
Quantum Harmonic Analysis and the Structure in Data: Augmentation
OmniVLA: An Omni-Modal Vision-Language-Action Model for Robot Navigation
AnySafe: Adapting Latent Safety Filters at Runtime via Safety Constraint Parameterization in the Latent Space
STL-FFT-STFT-TCN-LSTM: An Effective Wave Height High Accuracy Prediction Model Fusing Time-Frequency Domain Features
Electric Vehicle Identification from Behind Smart Meter Data
A Spatio-Temporal Feature Fusion EEG Virtual Channel Signal Generation Network and Its Application in Anxiety Assessment
A Measurement Report Data-Driven Framework for Localized Statistical Channel Modeling
ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution
LLM-Assisted Topic Reduction for BERTopic on Social Media Data
Low-Cost Sensor Fusion Framework for Organic Substance Classification and Quality Control Using Classification Methods
Short-Term Regional Electricity Demand Forecasting in Argentina Using LSTM Networks
Vision-Based Perception for Autonomous Vehicles in Off-Road Environment Using Deep Learning
Neural Network Based Framework for Passive Intermodulation Cancellation in MIMO Systems
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity
Alignment-Sensitive Minimax Rates for Spectral Algorithms with Learned Kernels
Graph Variate Neural Networks
A Recovery Guarantee for Sparse Neural Networks
Video models are zero-shot learners and reasoners
Feature Dynamics as Implicit Data Augmentation: A Depth-Decomposed View on Deep Neural Network Generalization
Uncovering Graph Reasoning in Decoder-only Transformers with Circuit Tracing
Spatio-Temporal Directed Graph Learning for Account Takeover Fraud Detection
Process-Informed Forecasting of Complex Thermal Dynamics in Pharmaceutical Manufacturing
Graph-Based Spatio-temporal Attention and Multi-Scale Fusion for Clinically Interpretable, High-Fidelity Fetal ECG Extraction
Time-adaptive H\'enonNets for separable Hamiltonian systems
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
Beyond Sharp Minima: Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization
A HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
Energy Use of AI Inference: Efficiency Pathways and Test-Time Compute
Dynamic Lagging for Time-Series Forecasting in E-Commerce Finance: Mitigating Information Loss with A Hybrid ML Architecture
Failure Modes of Maximum Entropy RLHF
Predictive Coding-based Deep Neural Network Fine-tuning for Computationally Efficient Domain Adaptation
Extended Low-Rank Approximation Accelerates Learning of Elastic Response in Heterogeneous Materials
PGCLODA: Prompt-Guided Graph Contrastive Learning for Oligopeptide-Infectious Disease Association Prediction
One Filters All: A Generalist Filter for State Estimation
You Only Measure Once: On Designing Single-Shot Quantum Machine Learning Models
Incomplete Data, Complete Dynamics: A Diffusion Approach
Discovering Association Rules in High-Dimensional Small Tabular Data
Beyond Slater's Condition in Online CMDPs with Stochastic and Adversarial Constraints
Probability Signature: Bridging Data Semantics and Embedding Structure in Language Models
Generative Model Inversion Through the Lens of the Manifold Hypothesis
An Improved Time Series Anomaly Detection by Applying Structural Similarity
FairEquityFL -- A Fair and Equitable Client Selection in Federated Learning for Heterogeneous IoV Networks
Staying on the Manifold: Geometry-Aware Noise Injection
Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference
Exploration with Foundation Models: Capabilities, Limitations, and Hybrid Approaches
MMSE-Calibrated Few-Shot Prompting for Alzheimer's Detection
TABFAIRGDT: A Fast Fair Tabular Data Generator using Autoregressive Decision Trees
How deep is your network? Deep vs. shallow learning of transfer operators
Learnable Sampler Distillation for Discrete Diffusion Models
From Samples to Scenarios: A New Paradigm for Probabilistic Forecasting
Faster Than SVD, Smarter Than SGD: The OPLoRA Alternating Update
RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis
Pi-Transformer: A Physics-informed Attention Mechanism for Time Series Anomaly Detection
Learning Robust Penetration-Testing Policies under Partial Observability: A systematic evaluation
Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations
BoreaRL: A Multi-Objective Reinforcement Learning Environment for Climate-Adaptive Boreal Forest Management
Analyzing Generalization in Pre-Trained Symbolic Regression
Oversampling and Downsampling with Core-Boundary Awareness: A Data Quality-Driven Approach
Advancing Universal Deep Learning for Electronic-Structure Hamiltonian Prediction of Materials
MCGrad:: Multicalibration at Web Scale
Towards Self-Supervised Foundation Models for Critical Care Time Series
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning
Pure Exploration via Frank-Wolfe Self-Play
Latent Iterative Refinement Flow: A Geometric-Constrained Approach for Few-Shot Generation
On the Fragility of Contribution Score Computation in Federated Learning
Revisiting Performance Claims for Chest X-Ray Models Using Clinical Context
C${}^2$Prompt: Class-aware Client Knowledge Interaction for Federated Continual Learning
Frictional Q-Learning
Sobolev acceleration for neural networks
PPGFlowECG: Latent Rectified Flow with Cross-Modal Encoding for PPG-Guided ECG Generation and Cardiovascular Disease Detection
Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference
RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving
VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models
An Efficient Conditional Score-based Filter for High Dimensional Nonlinear Filtering Problems
On the Rate of Convergence of Kolmogorov-Arnold Network Regression Estimators
Frame-based Equivariant Diffusion Models for 3D Molecular Generation
Metriplectic Conditional Flow Matching for Dissipative Dynamics
Modular Machine Learning with Applications to Genetic Circuit Composition
Improved Therapeutic Antibody Reformatting through Multimodal Machine Learning
Adaptive von Mises-Fisher Likelihood Loss for Supervised Deep Time Series Hashing
TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation
Toward Scalable and Structured Global Station Weather Forecasting
Symbol-Temporal Consistency Self-supervised Learning for Robust Time Series Classification
Consistent Estimation of Numerical Distributions under Local Differential Privacy by Wavelet Expansion
Dynamicasome: a molecular dynamics-guided and AI-driven pathogenicity prediction catalogue for all genetic mutations
FusedANN: Convexified Hybrid ANN via Attribute-Vector Fusion
Enhancing Credit Default Prediction Using Boruta Feature Selection and DBSCAN Algorithm with Different Resampling Techniques
Analyzing Uncertainty Quantification in Statistical and Deep Learning Models for Probabilistic Electricity Price Forecasting
THINNs: Thermodynamically Informed Neural Networks
Transformer Modeling for Both Scalability and Performance in Multivariate Time Series
Constraint-Reduced MILP with Local Outlier Factor Modeling for Plausible Counterfactual Explanations in Credit Approval
Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks
A Unified Noise-Curvature View of Loss of Trainability
Linear Transformers Implicitly Discover Unified Numerical Algorithms
Causal Machine Learning for Surgical Interventions
Intuition to Evidence: Measuring AI's True Impact on Developer Productivity
SMILES-Inspired Transfer Learning for Quantum Operators in Generative Quantum Eigensolver
HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST
Cuffless Blood Pressure Prediction from Speech Sentences using Deep Learning Methods
ExpFace: Exponential Angular Margin Loss for Deep Face Recognition
ARCADE: A Real-Time Data System for Hybrid and Continuous Query Processing across Diverse Data Modalities
Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling
Where 6G Stands Today: Evolution, Enablers, and Research Gaps
Large Language Models for Pedestrian Safety: An Application to Predicting Driver Yielding Behavior at Unsignalized Intersections
RoboSSM: Scalable In-context Imitation Learning via State-Space Models
MoTiC: Momentum Tightness and Contrast for Few-Shot Class-Incremental Learning
Selective Classifier-free Guidance for Zero-shot Text-to-speech
Games Are Not Equal: Classifying Cloud Gaming Contexts for Effective User Experience Measurement
Thinking While Listening: Simple Test Time Scaling For Audio Classification
PolicyPad: Collaborative Prototyping of LLM Policies
DyBBT: Dynamic Balance via Bandit inspired Targeting for Dialog Policy with Cognitive Dual-Systems
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
Learning Dynamics of Deep Learning -- Force Analysis of Deep Neural Networks
A Foundation Chemical Language Model for Comprehensive Fragment-Based Drug Discovery
Reverse Engineering User Stories from Code using Large Language Models
Frame-Stacked Local Transformers For Efficient Multi-Codebook Speech Generation
GuessingGame: Measuring the Informativeness of Open-Ended Questions in Large Language Models
Knowledge Base-Aware Orchestration: A Dynamic, Privacy-Preserving Method for Multi-Agent Systems
Advancing Speech Summarization in Multi-modal LLMs with Reinforcement Learning
Mamba Modulation: On the Length Generalization of Mamba
ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation
Self-evolved Imitation Learning in Simulated World
A Realistic Evaluation of Cross-Frequency Transfer Learning and Foundation Forecasting Models
Identifying and Addressing User-level Security Concerns in Smart Homes Using "Smaller" LLMs
ArtiFree: Detecting and Reducing Generative Artifacts in Diffusion-based Speech Enhancement
Generative AI as a catalyst for democratic Innovation: Enhancing citizen engagement in participatory budgeting
AIRwaves at CheckThat! 2025: Retrieving Scientific Sources for Implicit Claims on Social Media with Dual Encoders and Neural Re-Ranking
The Heterogeneous Multi-Agent Challenge
A Longitudinal Randomized Control Study of Companion Chatbot Use: Anthropomorphism and Its Mediating Role on Social Impacts
Semantic-Aware Fuzzing: An Empirical Framework for LLM-Guided, Reasoning-Driven Input Mutation
TensLoRA: Tensor Alternatives for Low-Rank Adaptation
OmniFed: A Modular Framework for Configurable Federated Learning from Edge to HPC
Self-Alignment Learning to Improve Myocardial Infarction Detection from Single-Lead ECG
FedOC: Multi-Server FL with Overlapping Client Relays in Wireless Edge Networks
Online Adaptation via Dual-Stage Alignment and Self-Supervision for Fast-Calibration Brain-Computer Interfaces
Improving Outdoor Multi-cell Fingerprinting-based Positioning via Mobile Data Augmentation
TimeMosaic: Temporal Heterogeneity Guided Time Series Forecasting via Adaptive Granularity Patch and Segment-wise Decoding
EngravingGNN: A Hybrid Graph Neural Network for End-to-End Piano Score Engraving
Probabilistic Runtime Verification, Evaluation and Risk Assessment of Visual Deep Learning Systems
Learning from Observation: A Survey of Recent Advances
Data-Driven Reconstruction of Significant Wave Heights from Sparse Observations
Unsupervised Outlier Detection in Audit Analytics: A Case Study Using USA Spending Data
Pipeline Parallelism is All You Need for Optimized Early-Exit Based Self-Speculative Decoding
SLM-Based Agentic AI with P-C-G: Optimized for Korean Tool Use
Meow: End-to-End Outline Writing for Automatic Academic Survey
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
Representation-based Broad Hallucination Detectors Fail to Generalize Out of Distribution
Uncertainty Quantification of Large Language Models using Approximate Bayesian Computation
Solving Freshness in RAG: A Simple Recency Prior and the Limits of Heuristic Trend Detection
The Impact of Structural Changes on Learning Capacity in the Fly Olfactory Neural Circuit
TriSPrompt: A Hierarchical Soft Prompt Model for Multimodal Rumor Detection with Incomplete Modalities
RoadMind: Towards a Geospatial AI Expert for Disaster Response
Benchmarking and Improving LLM Robustness for Personalized Generation
Anti-Money Laundering Systems Using Deep Learning
Semantic Representation Attack against Aligned Large Language Models
DeepACTIF: Efficient Feature Attribution via Activation Traces in Neural Sequence Models
Analyzing the Impact of Credit Card Fraud on Economic Fluctuations of American Households Using an Adaptive Neuro-Fuzzy Inference System
The Inadequacy of Offline LLM Evaluations: A Need to Account for Personalization in Model Behavior
Quantifying Compositionality of Classic and State-of-the-Art Embeddings
Pluralistic Off-policy Evaluation and Alignment
CSIYOLO: An Intelligent CSI-based Scatter Sensing Framework for Integrated Sensing and Communication Systems
Cognitive-Level Adaptive Generation via Capability-Aware Retrieval and Style Adaptation
Radio Propagation Modelling: To Differentiate or To Deep Learn, That Is The Question
Multi-population Ensemble Genetic Programming via Cooperative Coevolution and Multi-view Learning for Classification
Joint Channel Estimation and Computation Offloading in Fluid Antenna-assisted MEC Networks
Fine-Grained AI Model Caching and Downloading With Coordinated Multipoint Broadcasting in Multi-Cell Edge Networks
Part-of-speech tagging for Nagamese Language using CRF
SCORE: A Semantic Evaluation Framework for Generative Document Parsing
Automated Item Neutralization for Non-Cognitive Scales: A Large Language Model Approach to Reducing Social-Desirability Bias
Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning
FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering
Readme_AI: Dynamic Context Construction for Large Language Models
Magnitude Matters: a Superior Class of Similarity Metrics for Holistic Semantic Understanding
Unveiling the Merits and Defects of LLMs in Automatic Review Generation for Scientific Papers
A systematic review of trial-matching pipelines using large language models
Human Activity Recognition Based on Electrocardiogram Data Only
LibEMER: A novel benchmark and algorithms library for EEG-based Multimodal Emotion Recognition
Holographic Transformers for Complex-Valued Signal Processing: Integrating Phase Interference into Self-Attention
Steerable Adversarial Scenario Generation through Test-Time Preference Alignment
PEPS: Quantum-Inspired Reinforcement Learning for Coherent Reasoning Traces in LLMs
Formal Verification of Minimax Algorithms
Federation of Agents: A Semantics-Aware Communication Fabric for Large-Scale Agentic AI
Design Insights and Comparative Evaluation of a Hardware-Based Cooperative Perception Architecture for Lane Change Prediction
Scan-do Attitude: Towards Autonomous CT Protocol Management using a Large Language Model Agent
LLMs as verification oracles for Solidity
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
A Federated Fine-Tuning Paradigm of Foundation Models in Heterogenous Wireless Networks
E2E Learning Massive MIMO for Multimodal Semantic Non-Orthogonal Transmission and Fusion
Calibrated Reasoning: An Explanatory Verifier for Dynamic and Efficient Problem-Solving
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
The Conductor and the Engine: A Path Towards Co-Designed Reasoning
Agentic Metacognition: Designing a "Self-Aware" Low-Code Agent for Failure Prediction and Human Handoff
Analysis of approximate linear programming solution to Markov decision problem with log barrier function
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
CON-QA: Privacy-Preserving QA using cloud LLMs in Contract Domain
Embodied AI: From LLMs to World Models
MACD: Multi-Agent Clinical Diagnosis with Self-Learned Knowledge for LLM
From Pheromones to Policies: Reinforcement Learning for Engineered Biological Swarms
The Indispensable Role of User Simulation in the Pursuit of AGI
Evaluation-Aware Reinforcement Learning
Estimating the Self-Consistency of LLMs
Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
Score the Steps, Not Just the Goal: VLM-Based Subgoal Evaluation for Robotic Manipulation
Nano Bio-Agents (NBA): Small Language Model Agents for Genomics
What Does Your Benchmark Really Measure? A Framework for Robust Inference of AI Capabilities
SteinerSQL: Graph-Guided Mathematical Reasoning for Text-to-SQL Generation

Research Sources: 519 | Generated: 9/25/2025