AI Research News Feeds for September 11th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

Adversarial Attacks Against Automated Fact-Checking: A Survey
Alternating Minimization Schemes for Computing Rate-Distortion-Perception Functions with $f$-Divergence Perception Constraints
A Survey of World Models for Autonomous Driving
Event Camera Meets Resource-Aware Mobile Computing: Abstraction, Algorithm, Acceleration, Application
D\'ej\`a Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse
Bias in the Loop: How Humans Evaluate AI-Generated Suggestions
A transport approach to the cutoff phenomenon
On the Sample Complexity of Set Membership Estimation for Linear Systems with Disturbances Bounded by Convex Sets
Identification and Estimation of Simultaneous Equation Models Using Higher-Order Cumulant Restrictions
RewardDance: Reward Scaling in Visual Generation
SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video
Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities
Physics-Guided Rectified Flow for Low-light RAW Image Enhancement
Good Deep Features to Track: Self-Supervised Feature Extraction and Tracking in Visual Odometry
CNN-ViT Hybrid for Pneumonia Detection: Theory and Empiric on Limited Data without Pretraining
X-Part: high fidelity and structure coherent shape decomposition
SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation
Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
Vision Transformer with Sparse Scan Prior
Have Large Vision-Language Models Mastered Art History?
A Chinese Continuous Sign Language Dataset Based on Complex Environments
ALOcc: Adaptive Lifting-Based 3D Semantic Occupancy and Cost Volume-Based Flow Predictions
GloFinder: AI-empowered QuPath Plugin for WSI-level Glomerular Detection, Visualization, and Curation
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration
GNF: Gaussian Neural Fields for Multidimensional Signal Representation and Reconstruction
Towards properties of adversarial image perturbations
CamC2V: Context-aware Controllable Video Generation
Physics-Driven Local-Whole Elastic Deformation Modeling for Point Cloud Representation Learning
GenFlow: Interactive Modular System for Image Generation
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
VRAE: Vertical Residual Autoencoder for License Plate Denoising and Deblurring
Beyond Distribution Shifts: Adaptive Hyperspectral Image Classification at Test Time
First-order State Space Model for Lightweight Image Super-resolution
Maximally Useful and Minimally Redundant: The Key to Self Supervised Learning for Imbalanced Data
Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
ViewSparsifier: Killing Redundancy in Multi-View Plant Phenotyping
Vision-Language Semantic Aggregation Leveraging Foundation Model for Generalizable Medical Image Segmentation
Improving Greenland Bed Topography Mapping with Uncertainty-Aware Graph Learning on Sparse Radar Data
EfficientIML: Efficient High-Resolution Image Manipulation Localization
CLAPS: A CLIP-Unified Auto-Prompt Segmentation for Multi-Modal Retinal Imaging
AdsQA: Towards Advertisement Video Understanding
LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation
FractalPINN-Flow: A Fractal-Inspired Network for Unsupervised Optical Flow Estimation with Total Variation Regularization
Multi-Modal Robust Enhancement for Coastal Water Segmentation: A Systematic HSV-Guided Framework
Computational Imaging for Enhanced Computer Vision
BcQLM: Efficient Vision-Language Understanding with Distilled Q-Gated Cross-Modal Fusion
CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes
ArgoTweak: Towards Self-Updating HD Maps through Structured Priors
Quantifying Accuracy of an Event-Based Star Tracker via Earth's Rotation
Handling Multiple Hypotheses in Coarse-to-Fine Dense Image Matching
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
An Explainable Deep Neural Network with Frequency-Aware Channel and Spatial Refinement for Flood Prediction in Sustainable Cities
Two Stage Context Learning with Large Language Models for Multimodal Stance Detection on Climate Change
Lightweight Deep Unfolding Networks with Enhanced Robustness for Infrared Small Target Detection
Sparse Transformer for Ultra-sparse Sampled Video Compressive Sensing
GTA-Crime: A Synthetic Dataset and Generation Framework for Fatal Violence Detection with Adversarial Snippet-Level Domain Adaptation
Symmetry Interactive Transformer with CNN Framework for Diagnosis of Alzheimer's Disease Using Structural MRI
EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning
Hyperspectral Mamba for Hyperspectral Object Tracking
Examining Vision Language Models through Multi-dimensional Experiments with Vision and Text Features
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
Dual-Thresholding Heatmaps to Cluster Proposals for Weakly Supervised Object Detection
An Open Benchmark Dataset for GeoAI Foundation Models for Oil Palm Mapping in Indonesia
SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training
Boosted Training of Lightweight Early Exits for Optimizing CNN Image Classification Inference
AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs
SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery
No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
Culturally transmitted color categories in LLMs reflect a learning bias toward efficient compression
MERLIN: Multi-Stage Curriculum Alignment for Multilingual Encoder and LLM Fusion
Verbalized Algorithms
Towards Knowledge-Aware Document Systems: Modeling Semantic Coverage Relations via Answerability Detection
CommonVoice-SpeechRE and RPG-MoGe: Advancing Speech Relation Extraction with a New Dataset and Multi-Order Generative Framework
Acquiescence Bias in Large Language Models
Simulating Identity, Propagating Bias: Abstraction and Stereotypes in LLM-Generated Text
Too Helpful, Too Harmless, Too Honest or Just Right?
CM-Align: Consistency-based Multilingual Alignment for Large Language Models
LLM Ensemble for RAG: Role of Context Length in Zero-Shot Question Answering for BioASQ Challenge
Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Baba Is AI: Break the Rules to Beat the Benchmark
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
MedS$^3$: Towards Medical Slow Thinking with Self-Evolved Soft Dual-sided Process Supervision
REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction
Maximizing Information in Domain-Invariant Representation Improves Transfer Learning
Damped Proximal Augmented Lagrangian Method for weakly-Convex Problems with Convex Constraints
A single-loop SPIDER-type stochastic subgradient method for expectation-constrained nonconvex nonsmooth optimization
Accelerating Hamiltonian Monte Carlo for Bayesian Inference in Neural Networks and Neural Operators
Reward function compression facilitates goal-dependent reinforcement learning
Gaussian Process Regression -- Neural Network Hybrid with Optimized Redundant Coordinates
Deep Unrolling of Sparsity-Induced RDO for 3D Point Cloud Attribute Coding
Calibrating Transformers via Sparse Gaussian Processes
Generative Example-Based Explanations: Bridging the Gap between Generative Modeling and Explainability
MDDM: A Molecular Dynamics Diffusion Model to Predict Particle Self-Assembly
Investigating Compositional Reasoning in Time Series Foundation Models
Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Task-based Loss Functions in Computer Vision: A Comprehensive Review
Training Deep Morphological Neural Networks as Universal Approximators
Data-driven generative simulation of SDEs using diffusion models
ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System
PracMHBench: Re-evaluating Model-Heterogeneous Federated Learning Based on Practical Edge Device Constraints
Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning
ADHDeepNet From Raw EEG to Diagnosis: Improving ADHD Diagnosis through Temporal-Spatial Processing, Adaptive Attention Mechanisms, and Explainability in Raw EEG Signals
A Survey of TinyML Applications in Beekeeping for Hive Monitoring and Management
STROKEVISION-BENCH: A Multimodal Video And 2D Pose Benchmark For Tracking Stroke Recovery
Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Forecasting Generative Amplification
SCA-LLM: Spectral-Attentive Channel Prediction with Large Language Models in MIMO-OFDM
Bias after Prompting: Persistent Discrimination in Large Language Models
Generative Quasi-Continuum Modeling of Confined Fluids at the Nanoscale
RepViT-CXR: A Channel Replication Strategy for Vision Transformers in Chest X-ray Tuberculosis and Pneumonia Classification
LLM-Guided Ans\"atze Design for Quantum Circuit Born Machines in Financial Generative Modeling
LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations
Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition
Compressing CNN models for resource-constrained systems by channel and layer pruning
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
Machine Learning with Multitype Protected Attributes: Intersectional Fairness through Regularisation
The Domain Mixed Unit: A New Neural Arithmetic Layer
Selective Induction Heads: How Transformers Select Causal Structures In Context
ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis
Mitigating Catastrophic Forgetting in Large Language Models with Forgetting-aware Pruning
Adaptive Rainfall Forecasting from Multiple Geographical Models Using Matrix Profile and Ensemble Learning
EvolKV: Evolutionary KV Cache Compression for LLM Inference
Rethinking the Backbone in Class Imbalanced Federated Source Free Domain Adaptation: The Utility of Vision Foundation Models
Two Sides of the Same Optimization Coin: Model Degradation and Representation Collapse in Graph Foundation Models
An Interpretable Deep Learning Model for General Insurance Pricing
SHAining on Process Mining: Explaining Event Log Characteristics Impact on Algorithms
Modified Loss of Momentum Gradient Descent: Fine-Grained Analysis
Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Towards Interpretable Deep Neural Networks for Tabular Data
Generative Data Refinement: Just Ask for Better Data
Replicable Reinforcement Learning with Linear Function Approximation
Machine Learning-Based Prediction of Speech Arrest During Direct Cortical Stimulation Mapping
Stopping Criteria for Value Iteration on Concurrent Stochastic Reachability and Safety Games
Whose Name Comes Up? Auditing LLM-Based Scholar Recommendations
From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks
VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
A Nonlinear Low-rank Representation Model with Convolutional Neural Network for Imputing Water Quality Data
Multi-Timescale Hierarchical Reinforcement Learning for Unified Behavior and Control of Autonomous Driving
CyberRAG: An Agentic RAG cyber attack classification and reporting tool
Comprehensive Evaluation of Prototype Neural Networks
Hammer and Anvil: A Principled Defense Against Backdoors in Federated Learning
In-Context Learning Enhanced Credibility Transformer
MMM-fair: An Interactive Toolkit for Exploring and Operationalizing Multi-Fairness Trade-offs
FedComLoc: Communication-Efficient Distributed Training of Sparse and Quantized Models
A Transformer approach for Electricity Price Forecasting
Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study
QR-VC: Leveraging Quantization Residuals for Linear Disentanglement in Zero-Shot Voice Conversion
Traffic-Rule-Compliant Trajectory Repair via Satisfiability Modulo Theories and Reachability Analysis
Mind the Value-Action Gap: Do LLMs Act in Alignment with Their Values?
CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning
MPO: Boosting LLM Agents with Meta Plan Optimization
To See a World in a Spark of Neuron: Disentangling Multi-task Interference for Training-free Model Merging
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making
A decision-theoretic approach to dealing with uncertainty in quantum mechanics
Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?
TerraMind: Large-Scale Generative Multimodality for Earth Observation
CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features
Prior Prompt Engineering for Reinforcement Fine-Tuning
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Explainability of CNN Based Classification Models for Acoustic Signal
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates
Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform
An End-to-End Deep Learning Framework for Arsenicosis Diagnosis Using Mobile-Captured Skin Images
PianoVAM: A Multimodal Piano Performance Dataset
Scaling Truth: The Confidence Paradox in AI Fact-Checking
Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation
A Survey of Reinforcement Learning for Large Reasoning Models
Associative Knowledge Graphs for Efficient Sequence Storage and Retrieval
Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation
Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism
Toward Subtrait-Level Model Explainability in Automated Writing Evaluation
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
An Iterative LLM Framework for SIBT utilizing RAG-based Adaptive Weight Optimization
Spherical Brownian Bridge Diffusion Models for Conditional Cortical Thickness Forecasting
Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation
Send to which account? Evaluation of an LLM-based Scambaiting System
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
Variational Rank Reduction Autoencoders for Generative
Agents of Discovery
Interpretability as Alignment: Making Internal Understanding a Design Principle
Memorization in Large Language Models in Medicine: Prevalence, Characteristics, and Implications
Classification of 24-hour movement behaviors from wrist-worn accelerometer data: from handcrafted features to deep learning techniques
UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation
Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Performance Assessment Strategies for Generative AI Applications in Healthcare
APML: Adaptive Probabilistic Matching Loss for Robust 3D Point Cloud Reconstruction
Domain Knowledge is Power: Leveraging Physiological Priors for Self Supervised Representation Learning in Electrocardiography
From Limited Data to Rare-event Prediction: LLM-powered Feature Engineering and Multi-model Learning in Venture Capital
Diffusion-Guided Multi-Arm Motion Planning
Quadrotor Navigation using Reinforcement Learning with Privileged Information
XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols
Accelerating AI Development with Cyber Arenas
Componentization: Decomposing Monolithic LLM Responses into Manipulable Semantic Units
Strategies for Improving Communication Efficiency in Distributed and Federated Learning: Compression, Local Training, and Personalization
Combined-distance-based score function of cognitive fuzzy sets and its application in lung cancer pain evaluation
A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving
Segment Transformer: AI-Generated Music Detection via Music Structural Analysis
Game-Theoretic Resilience Framework for Cyber-Physical Microgrids using Multi-Agent Reinforcement Learning
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Retrieval-Augmented VLMs for Multimodal Melanoma Diagnosis
EnvX: Agentize Everything with Agentic AI
Trust Semantics Distillation for Collaborator Selection via Memory-Augmented Agentic AI
Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction Following
Leveraging AI Agents for Autonomous Networks: A Reference Architecture and Empirical Studies
Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives
No-Knowledge Alarms for Misaligned LLMs-as-Judges
Automatic Failure Attribution and Critical Step Prediction Method for Multi-Agent Systems Based on Causal Inference
The More You Automate, the Less You See: Hidden Pitfalls of AI Scientist Systems
Narrative-Guided Reinforcement Learning: A Platform for Studying Language Model Influence on Decision Making
ToDMA: Large Model-Driven Token-Domain Multiple Access for Semantic Communications
Expert-Guided Explainable Few-Shot Learning for Medical Image Diagnosis
A New Dataset and Benchmark for Grounding Multimodal Misinformation
The Law-Following AI Framework: Legal Foundations and Technical Constraints. Legal Analogues for AI Actorship and technical feasibility of Law Alignment
Measuring and mitigating overreliance is necessary for building human-compatible AI
Validation of a CT-brain analysis tool for measuring global cortical atrophy in older patient cohorts
MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values
NOWJ@COLIEE 2025: A Multi-stage Framework Integrating Embedding Models and Large Language Models for Legal Retrieval and Entailment

Research Sources: 232 | Generated: 9/11/2025