AI Research News Feeds for September 5th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

We Have It Covered: A Resampling-based Method for Uplift Model Comparison
Estimation of High-Dimensional Markov-Switching VAR Models with an Approximate EM Algorithm
Bootstrapping the Cross-Validation Estimate
FastPart: Over-Parameterized Stochastic Gradient Descent for Sparse optimisation on Measures
Improved sampling algorithms and Poincar\'e inequalities for non-log-concave distributions
Imitating Radiological Scrolling: A Global-Local Attention Model for 3D Chest CT Volumes Multi-Label Anomaly Classification
POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation
Completing Spatial Transcriptomics Data for Gene Expression Prediction Benchmarking
Res-MoCoDiff: Residual-guided diffusion models for motion artifact correction in brain MRI
Deep Learning Advances in Vision-Based Traffic Accident Anticipation: A Comprehensive Review of Methods, Datasets, and Future Directions
From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification
Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion Modulation
Vision-Based Autonomous MM-Wave Reflector Using ArUco-Driven Angle-of-Arrival Estimation
Towards Controllable Real Image Denoising with Camera Parameters
Ecological Legacies of Pre-Columbian Settlements Evident in Palm Clusters of Neotropical Mountain Forests
Spatial-aware Transformer-GRU Framework for Enhanced Glaucoma Diagnosis from 3D OCT Imaging
A Framework for Supervised and Unsupervised Segmentation and Classification of Materials Microstructure Images
Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis
Simulation-based Inference via Langevin Dynamics with Score Matching
Prob-GParareal: A Probabilistic Numerical Parallel-in-Time Solver for Differential Equations
Towards understanding Accelerated Stein Variational Gradient Flow -- Analysis of Generalized Bilinear Kernels for Gaussian target distributions
Sharp Convergence Rates of Empirical Unbalanced Optimal Transport for Spatio-Temporal Point Processes
AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anomaly Search
Aesthetic Image Captioning with Saliency Enhanced MLLMs
Learning neural representations for X-ray ptychography reconstruction with unknown probes
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
Durian: Dual Reference-guided Portrait Animation with Attribute Transfer
From Lines to Shapes: Geometric-Constrained Segmentation of X-Ray Collimators via Hough Transform
One Flight Over the Gap: A Survey from Perspective to Panoramic Vision
Plot'n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models
TRUST-VL: An Explainable News Assistant for General Multimodal Misinformation Detection
Revealing Fine Structure in Protoplanetary Disks with Physics Constrained Neural Fields
ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction
SMooGPT: Stylized Motion Generation using Large Language Models
Hyper Diffusion Avatars: Dynamic Human Avatar Generation using Network Weight Space Diffusion
OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection
Global-to-Local or Local-to-Global? Enhancing Image Retrieval with Efficient Local Search and Effective Global Re-ranking
Accurate and lightweight dehazing via multi-receptive-field non-local network and novel contrastive regularization
Replication Study and Benchmarking of Real-Time Object Detection Models
BOSC: A Backdoor-based Framework for Open Set Synthetic Image Attribution
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid Registration
FADE: A Dataset for Detecting Falling Objects around Buildings in Video
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision
ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
Fast rigid alignment of heterogeneous images in sliced Wasserstein distance
Attn-Adapter: Attention Is All You Need for Online Few-shot Learner of Vision-Language Model
A Generative Foundation Model for Chest Radiography
LMVC: An End-to-End Learned Multiview Video Coding Framework
TopoSculpt: Betti-Steered Topological Sculpting of 3D Fine-grained Tubular Shapes
ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection
Improving Vessel Segmentation with Multi-Task Learning and Auxiliary Data Available Only During Model Training
SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation
Learning from Majority Label: A Novel Problem in Multi-class Multiple-Instance Learning
Millisecond-Response Tracking and Gazing System for UAVs: A Domestic Solution Based on "Phytium + Cambricon"
A Re-ranking Method using K-nearest Weighted Fusion for Person Re-identification
TEn-CATS: Text-Enriched Audio-Visual Video Parsing with Multi-Scale Category-Aware Temporal Graph
TriLiteNet: Lightweight Model for Multi-Task Visual Perception
DVS-PedX: Synthetic-and-Real Event-Based Pedestrian Dataset
TaleDiffusion: Multi-Character Story Generation with Dialogue Rendering
Revisiting Simple Baselines for In-The-Wild Deepfake Detection
Differential Morphological Profile Neural Networks for Semantic Segmentation
TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models
Dual-Scale Volume Priors with Wasserstein-Based Consistency for Semi-Supervised Medical Image Segmentation
PAOLI: Pose-free Articulated Object Learning from Sparse-view Images
Noisy Label Refinement with Semantically Reliable Synthetic Images
Efficient Odd-One-Out Anomaly Detection
GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization
MICACL: Multi-Instance Category-Aware Contrastive Learning for Long-Tailed Dynamic Facial Expression Recognition
Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrieval
Towards Efficient General Feature Prediction in Masked Skeleton Modeling
Teacher-Student Model for Detecting and Classifying Mitosis in the MIDOG 2025 Challenge
Multi Attribute Bias Mitigation via Representation Learning
Lightweight image segmentation for echocardiography
Reg3D: Reconstructive Geometry Instruction Tuning for 3D Scene Understanding
QuantV2X: A Fully Quantized Multi-Agent System for Cooperative Perception
Transfer Learning-Based CNN Models for Plant Species Identification Using Leaf Venation Patterns
LayoutGKN: Graph Similarity Learning of Floor Plans
SLENet: A Guidance-Enhanced Network for Underwater Camouflaged Object Detection
Fitting Image Diffusion Models on Video Datasets
MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting
Causality-guided Prompt Learning for Vision-language Models via Visual Granulation
EGTM: Event-guided Efficient Turbulence Mitigation
Focus Through Motion: RGB-Event Collaborative Token Sparsification for Efficient Object Detection
OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction
Weakly-Supervised Learning of Dense Functional Correspondences
Measuring Bias or Measuring the Task: Understanding the Brittle Nature of LLM Gender Biases
Can Language Models Handle a Non-Gregorian Calendar?
Singular Value Few-shot Adaptation of Vision-Language Models
Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
SPECS: Specificity-Enhanced CLIP-Score for Long Image Caption Evaluation
LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis
Contextualized Token Discrimination for Speech Search Query Correction
Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
The Telephone Game: Evaluating Semantic Drift in Unified Models
MyProfessors: Mining Turkish Student Reviews
Mitigating Bias in Text Classification via Prompt-Based Text Transformation
Exploring Linguistic Features for Turkish Text Readability
R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
DynaSaur: Large Language Agents Beyond Predefined Actions
Small Changes, Large Consequences: Analyzing the Allocational Fairness of LLMs in Hiring Contexts
HamRaz: A Culture-Based Persian Conversation Dataset for Person-Centered Therapy Using LLM Agents
HalluEntity: Benchmarking and Understanding Entity-Level Hallucination Detection
Autoformalization in the Wild: Assessing LLMs on Real-World Mathematical Definitions
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions
Explicit Learning and the LLM in Machine Translation
EQ-Knight: A Memory-Augmented LLM Agent for Strategic Affective Gaming in Debt Recovery
Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning
MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions
NoteBar: An AI-Assisted Note-Taking System for Personal Knowledge Management
Semantic Analysis of SNOMED CT Concept Co-occurrences in Clinical Documentation using MIMIC-IV
NE-PADD: Leveraging Named Entity Knowledge for Robust Partial Audio Deepfake Detection via Attention Aggregation
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
MobileRAG: Enhancing Mobile Agent with Retrieval-Augmented Generation
Exploring NLP Benchmarks in an Extremely Low-Resource Setting
A RoBERTa-Based Functional Syntax Annotation Model for Chinese Texts
Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning
Improving Narrative Classification and Explanation via Fine Tuned Language Models
Towards Stable and Personalised Profiles for Lexical Alignment in Spoken Human-Agent Dialogue
MultiWikiQA: A Reading Comprehension Benchmark in 300+ Languages
Joint Modeling of Entities and Discourse Relations for Coherence Assessment
Explicit and Implicit Data Augmentation for Social Event Detection
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?
Integrating Intermediate Layer Optimization and Projected Gradient Descent for Solving Inverse Problems with Diffusion Models
The Strong, Weak and Benign Goodhart's law. An independence-free and paradigm-agnostic formalisation
A theoretical basis for model collapse in recursive training
Machine Intelligence on Wireless Edge Networks
Asymptotic convexity of wide and shallow neural networks
Enhancing Speech Large Language Models through Reinforced Behavior Alignment
Reading Between the Signs: Predicting Future Suicidal Ideation from Adolescent Social Media Texts
ResearchPulse: Building Method-Experiment Chains through Multi-Document Scientific Inference
Understanding sparse autoencoder scaling in the presence of feature manifolds
Straighter Flow Matching via a Diffusion-Based Coupling Prior
Vision-based Manipulation from Single Human Video with Open-World Object Graphs
Convergence of Unadjusted Langevin in High Dimensions: Delocalization of Bias
ConServe: Fine-Grained GPU Harvesting for LLM Online and Offline Co-Serving
dsld: A Socially Relevant Tool for Teaching Statistics
Hardware-Friendly Diffusion Models with Fixed-Size Reusable Structures for On-Device Image Generation
Exposing Synthetic Speech: Model Attribution and Detection of AI-generated Speech via Audio Fingerprints
An Unsupervised Natural Language Processing Pipeline for Assessing Referral Appropriateness
Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study
T-cell receptor specificity landscape revealed through de novo peptide design
FutureGen: A RAG-based Approach to Generate the Future Work of Scientific Article
Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model
Deliberate Planning of 3D Bin Packing on Packing Configuration Trees
Closed-Loop Neural Operator-Based Observer of Traffic Density
Revealing the empirical flexibility of gas units through deep clustering
A dynamic view of some anomalous phenomena in SGD
Enhancing Text2Cypher with Schema Filtering
Text2Cypher: Data Pruning using Hard Example Selection
DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval
Batched Stochastic Matching Bandits
COBRA: Multimodal Sensing Deep Learning Framework for Remote Chronic Obesity Management via Wrist-Worn Activity Monitoring
Sailing Towards Zero-Shot State Estimation using Foundation Models Combined with a UKF
Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology
SAFE--MA--RRT: Multi-Agent Motion Planning with Data-Driven Safety Certificates
Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image -- Technical Preview
Reservoir kernels and Volterra series
Towards Robust Graph Structural Learning Beyond Homophily via Preserving Neighbor Similarity
Moco: A Learnable Meta Optimizer for Combinatorial Optimization
Explaining Length Bias in LLM-Based Preference Evaluations
Uncertainty-Guided Likelihood Tree Search
Retrieval-Augmented Generation with Estimation of Source Reliability
Zero-shot Generalization in Inventory Management: Train, then Estimate and Decide
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Multi-Label Bayesian Active Learning with Inter-Label Relationships
Dataset Distillation as Pushforward Optimal Quantization
IC-Cache: Efficient Large Language Model Serving via In-context Caching
Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics
Technology prediction of a 3D model using Neural Network
Is Random Attention Sufficient for Sequence Modeling? Disentangling Trainable Components in the Transformer
Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems
Plugging Attention into Power Grids: Towards Transparent Forecasting
Recursive Reward Aggregation
Topic Identification in LLM Input-Output Pairs through the Lens of Information Bottleneck
An exact multiple-time-step variational formulation for the committor and the transition rate
Combining feature-based approaches with graph neural networks and symbolic regression for synergistic performance and interpretability
Predicting Antimicrobial Resistance (AMR) in Campylobacter, a Foodborne Pathogen, and Cost Burden Analysis Using Machine Learning
Exoplanetary atmospheres retrieval via a quantum extreme learning machine
Accurate and scalable deep Maxwell solvers using multilevel iterative methods
ACT: Automated Constraint Targeting for Multi-Objective Recommender Systems
Energy-Weighted Flow Matching: Unlocking Continuous Normalizing Flows for Efficient and Scalable Boltzmann Sampling
Hypothesis Selection: A High Probability Conundrum
LLM-based Relevance Assessment for Web-Scale Search Evaluation at Pinterest
Deficiency of equation-finding approach to data-driven modeling of dynamical systems
Testing for correlation between network structure and high-dimensional node covariates
Finetuning AI Foundation Models to Develop Subgrid-Scale Parameterizations: A Case Study on Atmospheric Gravity Waves
Reservoir Predictive Path Integral Control for Unknown Nonlinear Dynamics
Hardware-Aware Data and Instruction Mapping for AI Tasks: Balancing Parallelism, I/O and Memory Tradeoffs
Sample Efficient Certification of Discrete-Time Control Barrier Functions
An invertible generative model for forward and inverse problems
Decoding the Poetic Language of Emotion in Korean Modern Poetry: Insights from a Human-Labeled Dataset and AI Modeling
LMAE4Eth: Generalizable and Robust Ethereum Fraud Detection by Exploring Transaction Semantics and Masked Graph Embedding
Divergence-Kernel method for linear responses and diffusion models
What if I ask in \textit{alia lingua}? Measuring Functional Similarity Across Languages
TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media
Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Gromov-Wasserstein and optimal transport: from assignment problems to probabilistic numeric
Shuffling Heuristic in Variational Inequalities: Establishing New Convergence Guarantees
Unobtrusive In-Situ Measurement of Behavior Change by Deep Metric Similarity Learning of Motion Patterns
KubeGuard: LLM-Assisted Kubernetes Hardening via Configuration Files and Runtime Logs Analysis
Formal Verification of Local Robustness of a Classification Algorithm for a Spatial Use Case
On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study
FedQuad: Federated Stochastic Quadruplet Learning to Mitigate Data Heterogeneity
Synthetic Counterfactual Labels for Efficient Conformal Counterfactual Inference
Who Pays for Fairness? Rethinking Recourse under Social Burden
Privacy Risks in Time Series Forecasting: User- and Record-Level Membership Inference
Comment on "A Note on Over-Smoothing for Graph Neural Networks"
Set Block Decoding is a Language Model Inference Accelerator
One-Embedding-Fits-All: Efficient Zero-Shot Time Series Forecasting by a Model Zoo
Why Can't I See My Clusters? A Precision-Recall Approach to Dimensionality Reduction Validation
Rethinking the long-range dependency in Mamba/SSM and transformer models
Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation
Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models
RL's Razor: Why Online Reinforcement Learning Forgets Less
An Interactive Framework for Finding the Optimal Trade-off in Differential Privacy
A Primer on Causal and Statistical Dataset Biases for Fair and Robust Image Analysis
Using causal abstractions to accelerate decision-making in complex bandit problems
Characteristic Energy Behavior Profiling of Non-Residential Buildings
When three experiments are better than two: Avoiding intractable correlated aleatoric uncertainty by leveraging a novel bias--variance tradeoff
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference
Transition Models: Rethinking the Generative Learning Objective
Interpretable Clustering with Adaptive Heterogeneous Causal Structure Learning in Mixed Observational Data
Echo State Networks as State-Space Models: A Systems Perspective
Unveiling the Role of Data Uncertainty in Tabular Deep Learning
Towards Cognitively-Faithful Decision-Making Models to Improve AI Alignment
A Small Dataset May Go a Long Way: Process Duration Prediction in Clinical Settings
The ProLiFIC dataset: Leveraging LLMs to Unveil the Italian Lawmaking Process
First Order Model-Based RL through Decoupled Backpropagation
AImoclips: A Benchmark for Evaluating Emotion Conveyance in Text-to-Music Generation
EZhouNet:A framework based on graph neural network and anchor interval for the respiratory sound event detection
AudioCodecBench: A Comprehensive Benchmark for Audio Codec Evaluation
Nonnegative matrix factorization and the principle of the common cause
Semi-decentralized Federated Time Series Prediction with Client Availability Budgets
AutoGrid AI: Deep Reinforcement Learning Framework for Autonomous Microgrid Management
SharedRep-RLHF: A Shared Representation Approach to RLHF with Diverse Preferences
A Machine Learning-Based Study on the Synergistic Optimization of Supply Chain Management and Financial Supply Chains from an Economic Perspective
A Comprehensive Review of Multi-Agent Reinforcement Learning in Video Games
Graph Random Features for Scalable Gaussian Processes
EmbedOR: Provable Cluster-Preserving Visualizations with Curvature-Based Stochastic Neighbor Embeddings
Online Learning of Optimal Sequential Testing Policies
Mapping on a Budget: Optimizing Spatial Data Collection for ML
Learning functions through Diffusion Maps
Online time series prediction using feature adjustment
Machine Learning for LiDAR-Based Indoor Surface Classification in Intelligent Wireless Environments
Predicting Traffic Accident Severity with Deep Neural Networks
Vehicle-to-Infrastructure Collaborative Spatial Perception via Multimodal Large Language Models
Data-Augmented Quantization-Aware Knowledge Distillation
Topotein: Topological Deep Learning for Protein Representation Learning
Mistake-bounded online learning with operation caps
Breaking the Context Bottleneck on Long Time Series Forecasting
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models
Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases?
Image Embedding Sampling Method for Diverse Captioning
CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection
FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response
KNighter: Transforming Static Analysis with LLM-Synthesized Checkers
Beyond holography: the entropic quantum gravity foundations of image processing
Robust Offline Imitation Learning Through State-level Trajectory Stitching
RBT4DNN: Requirements-based Testing of Neural Networks
Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation
Optimization of Module Transferability in Single Image Super-Resolution: Universality Assessment and Cycle Residual Blocks
Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling
MiniCPM4: Ultra-Efficient LLMs on End Devices
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation
Stochastic Parameter Decomposition
An Analysis of Action-Value Temporal-Difference Methods That Learn State Values
TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP
Conditional Video Generation for High-Efficiency Video Compression
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Transferable Belief Model on Quantum Circuits
WASP: A Weight-Space Approach to Detecting Learned Spuriousness
Enhancing FKG.in: automating Indian food composition analysis
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers
Computational Basis of LLM's Decision Making in Social Simulation
DMN-Guided Prompting: A Framework for Controlling LLM Behavior
Axiomatics of Restricted Choices by Linear Orders of Sets with Minimum as Fallback
CP-Bench: Evaluating Large Language Models for Constraint Modelling
Autonomation, Not Automation: Activities and Needs of European Fact-checkers as a Basis for Designing Human-Centered AI Systems
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Diffusion on language model encodings for protein sequence generation
MTP: A Meaning-Typed Language Abstraction for AI-Integrated Programming
Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers
Long Input Sequence Network for Long Time Series Forecasting
AutoPETIII: The Tracer Frontier. What Frontier?
Learning from 10 Demos: Generalisable and Sample-Efficient Policy Learning with Oriented Affordance Frames
Robust training of implicit generative models for multivariate and heavy-tailed distributions with an invariant statistical loss
Quantifying Calibration Error in Neural Networks Through Evidence-Based Theory
Kolb-Based Experiential Learning for Generalist Agents with Human-Level Kaggle Data Science Performance
ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
EHVC: Efficient Hierarchical Reference and Quality Structure for Neural Video Coding
MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation
Simplicity Lies in the Eye of the Beholder: A Strategic Perspective on Controllers in Reactive Synthesis
Enhancing Technical Documents Retrieval for RAG
TAGAL: Tabular Data Generation using Agentic LLM Methods
Attention as an Adaptive Filter
YOLO Ensemble for UAV-based Multispectral Defect Detection in Wind Turbine Components
Crossing the Species Divide: Transfer Learning from Speech to Animal Sounds
VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision
MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions
Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding
How many patients could we save with LLM priors?
An Empirical Study of Vulnerabilities in Python Packages and Their Detection
Reinforcement Learning for Robust Ageing-Aware Control of Li-ion Battery Systems with Data-Driven Formal Verification
HumAIne-Chatbot: Real-Time Personalized Conversational AI via Reinforcement Learning
Facts Fade Fast: Evaluating Memorization of Outdated Medical Knowledge in Large Language Models
Decoupled Entity Representation Learning for Pinterest Ads Ranking
From Editor to Dense Geometry Estimator
AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds
PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation
Parking Availability Prediction via Fusing Multi-Source Data with A Self-Supervised Learning Enhanced Spatio-Temporal Inverted Transformer
SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation
No Thoughts Just AI: Biased LLM Recommendations Limit Human Agency in Resume Screening
Towards a Unified View of Large Language Model Post-Training
DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation
Delta Activations: A Representation for Finetuned Large Language Models
ChronoGraph: A Real-World Graph-Based Multivariate Time Series Dataset
Intelligence Primer
Gravity Well Echo Chamber Modeling With An LLM-Based Confirmation Bias Model
From Leiden to Pleasure Island: The Constant Potts Model for Community Detection as a Hedonic Game
INGRID: Intelligent Generative Robotic Design Using Large Language Models
Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables
MillGNN: Learning Multi-Scale Lead-Lag Dependencies for Multi-Variate Time Series Forecasting
A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models
SalientFusion: Context-Aware Compositional Zero-Shot Food Recognition
Peptidomic-Based Prediction Model for Coronary Heart Disease Using a Multilayer Perceptron Neural Network
Reactive In-Air Clothing Manipulation with Confidence-Aware Dense Correspondence and Visuotactile Affordance
Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series
MTQA:Matrix of Thought for Enhanced Reasoning in Complex Question Answering
SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment
SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning
VoxRole: A Comprehensive Benchmark for Evaluating Speech-Based Role-Playing Agents
Chest X-ray Pneumothorax Segmentation Using EfficientNet-B4 Transfer Learning in a U-Net Architecture
CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Multimodal Feature Fusion Network with Text Difference Enhancement for Remote Sensing Change Detection
Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study
SAC-MIL: Spatial-Aware Correlated Multiple Instance Learning for Histopathology Whole Slide Image Classification
NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models
Promptception: How Sensitive Are Large Multimodal Models to Prompts?
RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models
Detecting Regional Spurious Correlations in Vision Transformers via Token Discarding
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings
On Robustness and Reliability of Benchmark-Based Evaluation of LLMs
Neural Video Compression with In-Loop Contextual Filtering and Out-of-Loop Reconstruction Enhancement
Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot
RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models
QuesGenie: Intelligent Multimodal Question Generation
AR$^2$: Adversarial Reinforcement Learning for Abstract Reasoning in Large Language Models
Improving Factuality in LLMs via Inference-Time Knowledge Graph Construction
A software security review on Uganda's Mobile Money Services: Dr. Jim Spire's tweets sentiment analysis
The Optimiser Hidden in Plain Sight: Training with the Loss Landscape's Induced Metric
E-ARMOR: Edge case Assessment and Review of Multilingual Optical Character Recognition
treeX: Unsupervised Tree Instance Segmentation in Dense Forest Point Clouds
CEHR-GPT: A Scalable Multi-Task Foundation Model for Electronic Health Records
Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators
Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning
Insights from Gradient Dynamics: Gradient Autoscaled Normalization
LuxDiT: Lighting Estimation with Video Diffusion Transformer
Hierarchical Federated Foundation Models over Wireless Networks for Multi-Modal Multi-Task Intelligence: Integration of Edge Learning with D2D/P2P-Enabled Fog Learning Architectures
From Federated Learning to $\mathbb{X}$-Learning: Breaking the Barriers of Decentrality Through Random Walks
MLSD: A Novel Few-Shot Learning Approach to Enhance Cross-Target and Cross-Domain Stance Detection
Differentiable Entropy Regularization for Geometry and Neural Networks
Sparse Autoencoder Neural Operators: Model Recovery in Function Spaces
Designing Gaze Analytics for ELA Instruction: A User-Centered Dashboard with Conversational AI Support
STA-Net: A Decoupled Shape and Texture Attention Network for Lightweight Plant Disease Classification
ARDO: A Weak Formulation Deep Neural Network Method for Elliptic and Parabolic PDEs Based on Random Differences of Test Functions
Learning an Adversarial World Model for Automated Curriculum Generation in MARL
Natural Latents: Latent Variables Stable Across Ontologies
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation
SAMVAD: A Multi-Agent System for Simulating Judicial Deliberation Dynamics in India
Measuring How (Not Just Whether) VLMs Build Common Ground
Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Multilevel Analysis of Cryptocurrency News using RAG Approach with Fine-Tuned Mistral Large Language Model
Multimodal Proposal for an AI-Based Tool to Increase Cross-Assessment of Messages
Real-Time Detection of Hallucinated Entities in Long-Form Generation
A Multidimensional AI-powered Framework for Analyzing Tourist Perception in Historic Urban Quarters: A Case Study in Shanghai
Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures
Expedition & Expansion: Leveraging Semantic Representations for Goal-Directed Exploration in Continuous Cellular Automata
FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace
A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning
Handling Infinite Domain Parameters in Planning Through Best-First Search with Delayed Partial Expansions
World Model Implanting for Test-time Adaptation of Embodied Agents
Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent
AutoPBO: LLM-powered Optimization for Local Search PBO Solvers
CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning
Oruga: An Avatar of Representational Systems Theory
Intermediate Languages Matter: Formal Languages and LLMs affect Neurosymbolic Reasoning
Hybrid Reinforcement Learning and Search for Flight Trajectory Planning
Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
The human biological advantage over AI
Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs
Domain size asymptotics for Markov logic networks
Evaluating Quality of Gaming Narratives Co-created with AI
EvoEmo: Towards Evolved Emotional Policies for LLM Agents in Multi-Turn Negotiation
Improving Robustness of AlphaZero Algorithms to Test-Time Environment Changes
Psychologically Enhanced AI Agents
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
BiND: A Neural Discriminator-Decoder for Accurate Bimanual Trajectory Prediction in Brain-Computer Interfaces
Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies
PG-Agent: An Agent Powered by Page Graph
Multilinear and Linear Programs for Partially Identifiable Queries in Quasi-Markovian Structural Causal Models
Diffusion-RL Based Air Traffic Conflict Detection and Resolution Method
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE
CausalARC: Abstract Reasoning with Causal World Models
Towards a Neurosymbolic Reasoning System Grounded in Schematic Representations
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
An Empirical Evaluation of Factors Affecting SHAP Explanation of Time Series Classification
PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming
The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs
Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation
RAGuard: A Novel Approach for in-context Safe Retrieval Augmented Generation for LLMs
Leveraging LLM-Based Agents for Intelligent Supply Chain Planning
Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning
What Would an LLM Do? Evaluating Policymaking Capabilities of Large Language Models
An Agentic Model Context Protocol Framework for Medical Concept Standardization

Research Sources: 423 | Generated: 9/5/2025