AI Research News Feeds for August 22nd, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

Net zero needs AI — five actions to realize its promise
GraphVelo allows for accurate inference of multimodal velocities and molecular mechanisms for single cells
Variable selection for minimum-variance portfolios
Sampling by averaging: A multiscale approach to score estimation
A Unified Framework for Inference with General Missingness Patterns and Machine Learning Imputation
Multiply Robust Conformal Risk Control with Coarsened Data
On Prior Distributions for Orthogonal Function Sequences
Boundary Detection Algorithm Inspired by Locally Linear Embedding
CaLiV: LiDAR-to-Vehicle Calibration of Arbitrary Sensor Setups
Handle-based Mesh Deformation Guided By Vision Language Model
Bidirectional Temporal Information Propagation for Moving Infrared Small Target Detection
A Curated Dataset and Deep Learning Approach for Minor Dent Detection in Vehicles
Aligning Moments in Time using Video Queries
Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework
MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration
Task-Generalized Adaptive Cross-Domain Learning for Multimodal Image Fusion
ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors
Multi-Object Sketch Animation with Grouping and Motion Trajectory Priors
D3FNet: A Differential Attention Fusion Network for Fine-Grained Road Structure Extraction in Remote Perception Systems
High-Frequency First: A Two-Stage Approach for Improving Image INR
Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis
Multi-perspective monitoring of wildlife and human activities from camera traps and drones with deep learning models
When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding
Weakly-Supervised Learning for Tree Instances Segmentation in Airborne Lidar Point Clouds
MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction
CM2LoD3: Reconstructing LoD3 Building Models Using Semantic Conflict Maps
LLM-empowered Dynamic Prompt Routing for Vision-Language Models Tuning under Long-Tailed Distributions
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
Fine-grained Multi-class Nuclei Segmentation with Molecular-empowered All-in-SAM Model
Waver: Wave Your Way to Lifelike Video Generation
ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling
Visual Autoregressive Modeling for Instruction-Guided Image Editing
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation
Scalable FPGA Framework for Real-Time Denoising in High-Throughput Imaging: A DRAM-Optimized Pipeline using High-Level Synthesis
\textit{adder-viz}: Real-Time Visualization Software for Transcoding Event Video
Scalable Event-Based Video Streaming for Machines with MoQ
Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors
Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis
Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation
On the Effectiveness of Graph Reordering for Accelerating Approximate Nearest Neighbor Search on GPU
DoSReMC: Domain Shift Resilient Mammography Classification using Batch Normalization Adaptation
Self-supervised physics-informed generative networks for phase retrieval from a single X-ray hologram
Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising
Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset
Translating Images to Road Network: A Sequence-to-Sequence Perspective
RESfM: Robust Deep Equivariant Structure from Motion
Learning Motion Blur Robust Vision Transformers for Real-Time UAV Tracking
Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks
TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt
Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer
Cross multiscale vision transformer for deep fake detection
BannerAgency: Advertising Banner Design with Multimodal LLM Agents
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos
Understanding Co-speech Gestures in-the-wild
Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
FastMap: Revisiting Structure from Motion through First-Order Optimization
Creating a Historical Migration Dataset from Finnish Church Records, 1800-1920
Referring Expression Instance Retrieval and A Strong End-to-End Baseline
Omni-Video: Democratizing Unified Video Understanding and Generation
Capturing Stable HDR Videos Using a Dual-Camera System
Hybrid Autoregressive-Diffusion Model for Real-Time Streaming Sign Language Production
Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Parallel transport on matrix manifolds and Exponential Action
NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation
Physics-Driven Autoregressive State Space Models for Medical Image Reconstruction
A Study of Privacy-preserving Language Modeling Approaches
M-HELP: Using Social Media Data to Detect Mental Health Help-Seeking Signals
Principle Methods of Rendering Non-equivalent Words from Uzbek and Dari to Russian and English
PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback
SLM4Offer: Personalized Marketing Offer Generation Using Contrastive Learning Based Fine-Tuning
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts -- Extended Version
HebID: Detecting Social Identities in Hebrew-language Political Text
Dream 7B: Diffusion Large Language Models
The Enemy from Within: A Study of Political Delegitimization Discourse in Israeli Political Speech
SafetyFlow: An Agent-Flow System for Automated LLM Safety Benchmarking
SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
Position Bias Mitigates Position Bias:Mitigate Position Bias Through Inter-Position Knowledge Distillation
Stemming -- The Evolution and Current State with a Focus on Bangla
Robust Symbolic Reasoning for Visual Narratives via Hierarchical and Semantically Normalized Knowledge Graphs
LLMs and Agentic AI in Insurance Decision-Making: Opportunities and Challenges For Africa
Retrieval-Augmented Review Generation for Poisoning Recommender Systems
AmbiSQL: Interactive Ambiguity Detection and Resolution for Text-to-SQL
Adversarial Attacks against Neural Ranking Models via In-Context Learning
On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 Languages
Pub-Guard-LLM: Detecting Retracted Biomedical Articles with Reliable Explanations
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
Advancing the Database of Cross-Linguistic Colexifications with New Workflows and Data
Leveraging Large Language Models for Explainable Activity Recognition in Smart Homes: A Critical Evaluation
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model
One-shot Entropy Minimization
AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP
SAND: Boosting LLM Agents with Self-Taught Action Deliberation
Versatile Framework for Song Generation with Prompt-based Control
You Only Pose Once: A Minimalist's Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation
Paired-Sampling Contrastive Framework for Joint Physical-Digital Face Attack Detection
GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging
CurveFlow: Curvature-Guided Flow Matching for Image Generation
HiRQA: Hierarchical Ranking and Quality Alignment for Opinion-Unaware Image Quality Assessment
Reliable Multi-view 3D Reconstruction for `Just-in-time' Edge Environments
XDR-LVLM: An Explainable Vision-Language Large Model for Diabetic Retinopathy Diagnosis
MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion
Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning
DyMorph-B2I: Dynamic and Morphology-Guided Binary-to-Instance Segmentation for Renal Pathology
STAGNet: A Spatio-Temporal Graph and LSTM Framework for Accident Anticipation
Collaborative Multi-Modal Coding for High-Quality 3D Generation
Center-Oriented Prototype Contrastive Clustering
AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation
Comp-X: On Defining an Interactive Learned Image Compression Paradigm With Expert-driven LLM Agent
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
TPA: Temporal Prompt Alignment for Fetal Congenital Heart Defect Classification
BasketLiDAR: The First LiDAR-Camera Multimodal Dataset for Professional Basketball MOT
RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features
An Empirical Study on How Video-LLMs Answer Video Questions
Transfer learning optimization based on evolutionary selective fine tuning
DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians
DIO: Refining Mutual Information and Causal Chain to Enhance Machine Abstract Reasoning Ability
Spiking Variational Graph Representation Inference for Video Summarization
From Linearity to Non-Linearity: How Masked Autoencoders Capture Spatial Correlations
Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models
Preliminary Ranking of WMT25 General Machine Translation Systems
Bridging the Culture Gap: A Framework for LLM-Driven Socio-Cultural Localization of Math Word Problems in Low-Resource Languages
Improving LLMs for Machine Translation Using Synthetic Preference Data
Multilingual Datasets for Custom Input Extraction and Explanation Requests Parsing in Conversational XAI Systems
Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner
Identifying and Answering Questions with False Assumptions: An Interpretable Approach
ContextualLVLM-Agent: A Holistic Framework for Multi-Turn Visually-Grounded Dialogue and Complex Instruction Following
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall
Are Checklists Really Useful for Automatic Evaluation of Generative Tasks?
WangchanThaiInstruct: An instruction-following Dataset for Culture-Aware, Multitask, and Multi-domain Evaluation in Thai
UniCoM: A Universal Code-Switching Speech Generator
EMNLP: Educator-role Moral and Normative Large Language Models Profiling
TComQA: Extracting Temporal Commonsense from Text
KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models
A Survey on Large Language Model Benchmarks
Confidence-Modulated Speculative Decoding for Large Language Models
Tree-like Pairwise Interaction Networks
Effect Identification and Unit Categorization in the Multi-Score Regression Discontinuity Design with Application to LED Manufacturing
End-to-End Analysis of Charge Stability Diagrams with Transformers
Exploring the Landscape of Non-Equilibrium Memories with Neural Cellular Automata
Scaling Group Inference for Diverse and High-Quality Generation
Robust Sparse Mean Estimation via Incremental Learning
A mathematical perspective on Transformers
Contextual Bandits with Stage-wise Constraints
Wasserstein Distributionally Robust Shallow Convex Neural Networks
Scalable Time-Series Causal Discovery with Approximate Causal Ordering
MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular Applications
The Complexity Dynamics of Grokking
Faster Convergence of Riemannian Stochastic Gradient Descent with Increasing Batch Size
Deceptive Sequential Decision-Making via Regularized Policy Optimization
MaskSDM with Shapley values to improve flexibility, robustness, and explainability in species distribution modeling
Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation
Bayes Error Rate Estimation in Difficult Situations
Multi-Exit Kolmogorov-Arnold Networks: enhancing accuracy and parsimony
Exploring Modularity of Agentic Systems for Drug Discovery
Physics-Informed Neural Networks with Hard Nonlinear Equality and Inequality Constraints
Causal Modelling of Cryptocurrency Price Movements Using Discretisation-Aware Bayesian Networks
Neural reproducing kernel Banach spaces and representer theorems for deep networks
ILeSiA: Interactive Learning of Robot Situational Awareness from Camera Input
Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo
ABC: Achieving Better Control of Multimodal Embeddings using VLMs
Online Convex Optimization and Integral Quadratic Constraints: An automated approach to regret analysis
Improving Predictions of Convective Storm Wind Gusts through Statistical Post-Processing of Neural Weather Models
Training neural control variates using correlated configurations
Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video
Scalable Bayesian Monte Carlo: fast uncertainty estimation beyond deep ensembles
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
A Robust BERT-Based Deep Learning Model for Automated Cancer Type Extraction from Unstructured Pathology Reports
SafeLLM: Unlearning Harmful Outputs from Large Language Models against Jailbreak Attacks
Revisiting Pre-processing Group Fairness: A Modular Benchmarking Framework
Frequency-adaptive tensor neural networks for high-dimensional multi-scale problems
SleepDIFFormer: Sleep Stage Classification via Multivariate Differential Transformer
See Beyond a Single View: Multi-Attribution Learning Leads to Better Conversion Rate Prediction
Learning ECG Representations via Poly-Window Contrastive Learning
Deep Think with Confidence
Evaluating Knowledge Graph Complexity via Semantic, Spectral, and Structural Metrics for Link Prediction
Saving for the future: Enhancing generalization via partial logic regularization
ExBigBang: A Dynamic Approach for Explainable Persona Classification through Contextualized Hybrid Transformer Analysis
Enhancing Forecasting with a 2D Time Series Approach for Cohort-Based Data
Fairness for the People, by the People: Minority Collective Action
CITE: A Comprehensive Benchmark for Heterogeneous Text-Attributed Graphs on Catalytic Materials
Federated Learning based on Self-Evolving Gaussian Clustering
Measures of Overlapping Multivariate Gaussian Clusters in Unsupervised Online Learning
Mini-Batch Robustness Verification of Deep Neural Networks
Learning Protein-Ligand Binding in Hyperbolic Space
Let's Grow an Unbiased Community: Guiding the Fairness of Graphs via New Links
Jointly Computation- and Communication-Efficient Distributed Learning
Stabilization of Perturbed Loss Function: Differential Privacy without Gradient Noise
AI-Powered Machine Learning Approaches for Fault Diagnosis in Industrial Pumps
Conformalized Exceptional Model Mining: Telling Where Your Model Performs (Not) Well
Inductive Domain Transfer In Misspecified Simulation-Based Inference
Continual Neural Topic Model
Classification errors distort findings in automated speech processing: examples and solutions from child-development research
Correct-By-Construction: Certified Individual Fairness through Neural Network Training
Amortized In-Context Mixed Effect Transformer Models: A Zero-Shot Approach for Pharmacokinetics
Tensorized Multi-Task Learning for Personalized Modeling of Heterogeneous Individuals with High-Dimensional Data
An Efficient Open World Environment for Multi-Agent Social Learning
Conditionally adaptive augmented Lagrangian method for physics-informed learning of forward and inverse problems using artificial neural networks
Investigation of D-Wave quantum annealing for training Restricted Boltzmann Machines and mitigating catastrophic forgetting
Communication Efficient LLM Pre-training with SparseLoCo
Probability Density from Latent Diffusion Models for Out-of-Distribution Detection
Intern-S1: A Scientific Multimodal Foundation Model
Distributed Detection of Adversarial Attacks in Multi-Agent Reinforcement Learning with Continuous Action Space
Computational Resolution of Hadamard Product Factorization for $4 \times 4$ Matrices
Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling
Personalized Recommendations via Active Utility-based Pairwise Sampling
Denoising by neural network for muzzle blast detection
Human Feedback Driven Dynamic Speech Emotion Recognition
MCPTox: A Benchmark for Tool Poisoning Attack on Real-World MCP Servers
AGP: A Novel Arabidopsis thaliana Genomics-Phenomics Dataset and its HyperGraph Baseline Benchmarking
XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI
CUTE-MRI: Conformalized Uncertainty-based framework for Time-adaptivE MRI
Generative AI models enable efficient and physically consistent sea-ice simulations
A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot
Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning
Adaptive Anomaly Detection in Evolving Network Environments
Integrated Sensing, Communication, and Computation for Over-the-Air Federated Edge Learning
GEN2: A Generative Prediction-Correction Framework for Long-time Emulations of Spatially-Resolved Climate Extremes
Pretrained Diffusion Models Are Inherently Skipped-Step Samplers
MMQ: Multimodal Mixture-of-Quantization Tokenization for Semantic ID Generation and User Behavioral Adaptation
CUPE: Contextless Universal Phoneme Encoder for Language-Agnostic Speech Processing
Flow Matching at Scale: A Machine Learning Framework for Efficient Large-Size Sampling of Many-Body Systems
An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models
Bayesian Inference and Learning in Nonlinear Dynamical Systems: A Framework for Incorporating Explicit and Implicit Prior Knowledge
Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training
Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems
JEDI-linear: Fast and Efficient Graph Neural Networks for Jet Tagging on FPGAs
Influence-driven Curriculum Learning for Pre-training on Limited Data
High-dimensional Asymptotics of Generalization Performance in Continual Ridge Regression
BadFU: Backdoor Federated Learning through Adversarial Machine Unlearning
HEAS: Hierarchical Evolutionary Agent Simulation Framework for Cross-Scale Modeling and Multi-Objective Search
Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment
Exploiting Policy Idling for Dexterous Manipulation
Bayesian Optimization with Expected Improvement: No Regret and the Choice of Incumbent
Towards Reliable and Generalizable Differentially Private Machine Learning (Extended Version)
Continual Learning for Multimodal Data Fusion of a Soft Gripper
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
Fine-tuning foundational models to code diagnoses from veterinary health records
Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
Large Language Models for Automated Literature Review: An Evaluation of Reference Generation, Abstract Writing, and Review Composition
Modeling Discrimination with Causal Abstraction
Learning to Generate Unit Tests for Automated Debugging
Self-Supervised Prompt Optimization
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation
Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering
Innamark: A Whitespace Replacement Information-Hiding Method
Synthetic vs. Gold: The Role of LLM Generated Labels and Data in Cyberbullying Detection
Pragmatic Inference Chain (PIC) Improving LLMs' Reasoning of Authentic Implicit Toxic Language
A Case for Specialisation in Non-Human Entities
Revisiting Out-of-Distribution Detection in Real-time Object Detection: From Benchmark Pitfalls to a New Mitigation Paradigm
VerifiAgent: a Unified Verification Agent in Language Model Reasoning
TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting
MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos
Kuwain 1.5B: An Arabic SLM via Language Injection
Cequel: Cost-Effective Querying of Large Language Models for Text Clustering
On the Consistency of GNN Explanations for Malware Detection
Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs
Sadeed: Advancing Arabic Diacritization Through Small Language Model
MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model
Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer
Lossless Token Sequence Compression via Meta-Tokens
LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles
Deep regularization networks for inverse problems with noisy operators
A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis
Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques
KEA Explain: Explanations of Hallucinations using Graph Kernel Analysis
MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation
Cross-Modality Masked Learning for Survival Prediction in ICI Treated NSCLC Patients
Generation of structure-guided pMHC-I libraries using Diffusion Models
Cohort-Aware Agents for Individualized Lung Cancer Risk Prediction Using a Retrieval-Augmented Model Selection Framework
Structure-Aware Temporal Modeling for Chronic Disease Progression Prediction
HHNAS-AM: Hierarchical Hybrid Neural Architecture Search using Adaptive Mutation Policies
Linear Preference Optimization: Decoupled Gradient Control via Absolute Regularization
Large Foundation Model for Ads Recommendation
CuMoLoS-MAE: A Masked Autoencoder for Remote Sensing Data Reconstruction
Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System
Generative Neural Operators of Log-Complexity Can Simultaneously Solve Infinitely Many Convex Programs
TOAST: Fast and scalable auto-partitioning based on principled static analysis
Fragment-Wise Interpretability in Graph Neural Networks via Molecule Decomposition and Contribution Analysis
Nonlinear Federated System Identification
Rethinking the Potential of Layer Freezing for Efficient DNN Training
Robust Estimation Under Heterogeneous Corruption Rates
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
Evaluating Sparse Autoencoders for Monosemantic Representation
Side Effects of Erasing Concepts from Diffusion Models
Towards Source-Free Machine Unlearning
SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis
SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
Survey of Vision-Language-Action Models for Embodied Manipulation
SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning
Locally Pareto-Optimal Interpretations for Black-Box Machine Learning Models
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design
VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models
Robust and Efficient Quantum Reservoir Computing with Discrete Time Crystal
Explainable Knowledge Distillation for Efficient Medical Image Classification
Conflict-Aware Soft Prompting for Retrieval-Augmented Generation
M-$LLM^3$REC: A Motivation-Aware User-Item Interaction Framework for Enhancing Recommendation Accuracy with LLMs
Way to Build Native AI-driven 6G Air Interface: Principles, Roadmap, and Outlook
DesignCLIP: Multimodal Learning with CLIP for Design Patent Understanding
IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection
VideoEraser: Concept Erasure in Text-to-Video Diffusion Models
Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling
Unveiling Trust in Multimodal Large Language Models: Evaluation, Analysis, and Mitigation
Image-Conditioned 3D Gaussian Splat Quantization
EvoFormer: Learning Dynamic Graph-Level Representations with Structural and Temporal Bias Correction
Bladder Cancer Diagnosis with Deep Learning: A Multi-Task Framework and Online Platform
Hybrid Least Squares/Gradient Descent Methods for DeepONets
When Audio and Text Disagree: Revealing Text Bias in Large Audio-Language Models
Bridging Generalization and Personalization in Wearable Human Activity Recognition via On-Device Few-Shot Learning
LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model
An Empirical Study of Knowledge Distillation for Code Understanding Tasks
Test-time Corpus Feedback: From Retrieval to RAG
Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets
Reliable Unlearning Harmful Information in LLMs with Metamorphosis Representation Projection
A Solvable Molecular Switch Model for Stable Temporal Information Processing
RadReason: Radiology Report Evaluation Metric with Reasons and Sub-Scores
Subjective Behaviors and Preferences in LLM: Language of Browsing
LGMSNet: Thinning a medical image segmentation model via dual-level multiscale fusion
LLM-Driven Self-Refinement for Embodied Drone Task Planning
LoUQAL: Low-fidelity informed Uncertainty Quantification for Active Learning in the chemical configuration space
Are Virtual DES Images a Valid Alternative to the Real Ones?
Trained Miniatures: Low cost, High Efficacy SLMs for Sales & Marketing
GRASPED: Graph Anomaly Detection using Autoencoder with Spectral Encoder and Decoder (Full Version)
Label Uncertainty for Ultrasound Segmentation
Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance
Benchmarking Computer Science Survey Generation
Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation
Row-Column Hybrid Grouping for Fault-Resilient Multi-Bit Weight Representation on IMC Arrays
Foundation Models for Cross-Domain EEG Analysis Application: A Survey
StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding
Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning, and Generative AI
EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-Commerce Models
Numerical models outperform AI weather forecasts of record-breaking extremes
End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning
"Does the cafe entrance look accessible? Where is the door?" Towards Geospatial AI Agents for Visual Inquiries
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis
Neural Robot Dynamics
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Discovering Hidden Algebraic Structures via Transformers with Rank-Aware Beam GRPO
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
CRISPR-GPT for Agentic Automation of Gene-editing Experiments
Non-linear Welfare-Aware Strategic Learning
Human-Object Interaction from Human-Level Instructions
On Learning Action Costs from Input Plans
Exploring the Effect of Explanation Content and Format on User Comprehension and Trust in Healthcare
VLASCD: A Visual Language Action Model for Simultaneous Chatting and Decision Making
CopyrightShield: Enhancing Diffusion Model Security against Copyright Infringement Attacks
SycEval: Evaluating LLM Sycophancy
PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data
Automatic Curriculum Design for Zero-Shot Human-AI Coordination
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues
Using a cognitive architecture to consider antiBlackness in design and development of AI systems
Unplug and Play Language Models: Decomposing Experts in Language Models at Inference Time
Generating 3D Terrain with 2D Cellular Automata
CREMA: A Contrastive Regularized Masked Autoencoder for Robust ECG Diagnostics across Clinical Domains
OPDR: Order-Preserving Dimension Reduction for Semantic Embedding of Multimodal Scientific Data
BoostTrack++: using tracklet information to detect more objects in multiple object tracking
A Fully Spectral Neuro-Symbolic Reasoning Architecture with Graph Signal Processing as the Computational Backbone
Goals and the Structure of Experience
Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism
Emergent Crowds Dynamics from Language-Driven Multi-Agent Interactions
Don't Think Twice! Over-Reasoning Impairs Confidence Calibration
Demonstrating Onboard Inference for Earth Science Applications with Spectral Analysis Algorithms and Deep Learning
S3LoRA: Safe Spectral Sharpness-Guided Pruning in Adaptation of Agent Planner
Argumentation for Explainable Workforce Optimisation (with Appendix)
Open-Universe Assistance Games
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists
Mobile-Agent-v3: Foundamental Agents for GUI Automation
PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data
LLM4Sweat: A Trustworthy Large Language Model for Hyperhidrosis Support
R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling
See it. Say it. Sorted: Agentic System for Compositional Diagram Generation
Computational Intelligence based Land-use Allocation Approaches for Mixed Use Areas
Multiple Memory Systems for Enhancing the Long-term Memory of Agent
Coarse-to-Fine Grounded Memory for LLM Agent Planning
Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
RETAIL: Towards Real-world Travel Planning for Large Language Models
DiagECG: An LLM-Driven Framework for Diagnostic Reasoning via Discretized ECG Tokenization
Planning with Minimal Disruption
GraSP: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data for SFT and DPO
From Bits to Boardrooms: A Cutting-Edge Multi-Agent LLM Framework for Business Excellence
Think in Blocks: Adaptive Reasoning from Direct Response to Deep Reasoning
Super-additive Cooperation in Language Model Agents
DeepThink3D: Enhancing Large Language Models with Programmatic Reasoning in Complex 3D Situated Reasoning Tasks
A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification
Transduction is All You Need for Structured Data Workflows
Adapting A Vector-Symbolic Memory for Lisp ACT-R
Understanding Action Effects through Instrumental Empowerment in Multi-Agent Reinforcement Learning
Futurity as Infrastructure: A Techno-Philosophical Interpretation of the AI Lifecycle
GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments
Measuring the environmental impact of delivering AI at Google Scale
Response and Prompt Evaluation to Prevent Parasocial Relationships with Chatbots
Language-Guided Tuning: Enhancing Numeric Optimization with Textual Feedback
SVM/SVR Kernels as Quantum Propagators
Accelerating GenAI Workloads by Enabling RISC-V Microkernel Support in IREE
Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training
Privacy Preserving Inference of Personalized Content for Out of Matrix Users
Collaborative Filtering using Variational Quantum Hopfield Associative Memory
A Chinese Heart Failure Status Speech Database with Universal and Personalised Classification
Transsion Multilingual Speech Recognition System for MLC-SLM 2025 Challenge
Disentangling the Drivers of LLM Social Conformity: An Uncertainty-Moderated Dual-Process Mechanism
Designing an Interdisciplinary Artificial Intelligence Curriculum for Engineering: Evaluation and Insights from Experts
Fusing Structural Phenotypes with Functional Data for Early Prediction of Primary Angle Closure Glaucoma Progression
A U-Statistic-based random forest approach for genetic interaction study
Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving
AI Testing Should Account for Sophisticated Strategic Behaviour
Heatmap Regression without Soft-Argmax for Facial Landmark Detection
TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation
Inference Time Debiasing Concepts in Diffusion Models
Can synthetic data reproduce real-world findings in epidemiology? A replication study using tree-based generative AI
Quantum Long Short-term Memory with Differentiable Architecture Search
Fast Graph Neural Network for Image Classification
Quantized Neural Networks for Microcontrollers: A Comprehensive Review of Methods, Platforms, and Applications
Twin-Boot: Uncertainty-Aware Optimization via Online Two-Sample Bootstrapping
TAIGen: Training-Free Adversarial Image Generation via Diffusion Models
Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement
A Systematic Survey of Model Extraction Attacks and Defenses: State-of-the-Art and Perspectives
MoEcho: Exploiting Side-Channel Attacks to Compromise User Privacy in Mixture-of-Experts LLMs
Decentralized Vision-Based Autonomous Aerial Wildlife Monitoring
From Basic Affordances to Symbolic Thought: A Computational Phylogenesis of Biological Intelligence
LongRecall: A Structured Approach for Robust Recall Evaluation in Long-Form Text
Wormhole Dynamics in Deep Neural Networks
Mapping the Course for Prompt-based Structured Prediction
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
Hydra: A 1.6B-Parameter State-Space Language Model with Sparse Attention, Mixture-of-Experts, and Memory
Equi-mRNA: Protein Translation Equivariant Encoding for mRNA Language Models
Enhanced Predictive Modeling for Hazardous Near-Earth Object Detection: A Comparative Analysis of Advanced Resampling Strategies and Machine Learning Algorithms in Planetary Risk Assessment
Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction

Research Sources: 443 | Generated: 8/25/2025