AI Research News Feeds for October 9th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

RAISE: A self-driving laboratory for interfacial property formulation discovery
Safe Obstacle-Free Guidance of Space Manipulators in Debris Removal Missions via Deep Reinforcement Learning
Assist-As-Needed: Adaptive Multimodal Robotic Assistance for Medication Management in Dementia Care
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
SanDRA: Safe Large-Language-Model-Based Decision Making for Automated Vehicles Using Reachability Analysis
Distributed 3D Source Seeking via SO(3) Geometric Control of Robot Swarms
Tailoring materials into kirigami robots
Temporal-Prior-Guided View Planning for Periodic 3D Plant Reconstruction
Diffusing Trajectory Optimization Problems for Recovery During Multi-Finger Manipulation
Bring the Apple, Not the Sofa: Impact of Irrelevant Context in Embodied AI Commands on VLA Models
Sampling Strategies for Robust Universal Quadrupedal Locomotion Policies
DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis and Cross-Attention Terrain Reconstruction
A Narwhal-Inspired Sensing-to-Control Framework for Small Fixed-Wing Aircraft
COMPAct: Computational Optimization and Automated Modular design of Planetary Actuators
Three-dimensional Integrated Guidance and Control for Leader-Follower Flexible Formation of Fixed Wing UAVs
Terrain-Aided Navigation Using a Point Cloud Measurement Sensor
Artists' Views on Robotics Involvement in Painting Productions
M^3RS: Multi-robot, Multi-objective, and Multi-mode Routing and Scheduling
EffiTune: Diagnosing and Mitigating Training Inefficiency for Parameter Tuner in Robot Navigation System
P2 Explore: Efficient Exploration in Unknown Cluttered Environment with Floor Plan Prediction
Generating and Optimizing Topologically Distinct Guesses for Mobile Manipulator Path Planning with Path Constraints
Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation
Development of a magnetorheological hand exoskeleton featuring a high force-to-power ratio for enhanced grip endurance
Control of Humanoid Robots with Parallel Mechanisms using Differential Actuation Models
Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions
Touch Speaks, Sound Feels: A Multimodal Approach to Affective and Social Touch from Robots to Humans
UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography
BIM Informed Visual SLAM for Construction Monitoring
Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making
"Grillz on a hijabi": Intersectional Identities in Fostering Critical AI Literacy
Code Semantic Zooming
Back to the Future Museum -- Speculative Design for Virtual Citizen-Curated Museums
AI Eyes on the Road: Cross-Cultural Perspectives on Traffic Surveillance
A Meat-Summer Night's Dream: A Tangible Design Fiction Exploration of Eating Biohybrid Flying Robots
Examining Solidarity Against AI-Enabled Surveillance at the Intersection of Workplace and Carceral Realities
PriorWeaver: Prior Elicitation via Iterative Dataset Construction
RAVEN: Realtime Accessibility in Virtual ENvironments for Blind and Low-Vision People
Investigating Students' Preferences for AI Roles in Mathematical Modelling: Evidence from a Randomized Controlled Trial
"It feels like hard work trying to talk to it": Understanding Older Adults' Experiences of Encountering and Repairing Conversational Breakdowns with AI Systems
"Sometimes You Need Facts, and Sometimes a Hug": Understanding Older Adults' Preferences for Explanations in LLM-Based Conversational AI Systems
Lonely Individuals Show Distinct Patterns of Social Media Engagement
Am I Productive? Exploring the Experience of Remote Workers with Task Management Tools
Prototyping Multimodal GenAI Real-Time Agents with Counterfactual Replays and Hybrid Wizard-of-Oz
The Feature Understandability Scale for Human-Centred Explainable AI: Assessing Tabular Feature Importance
AI for Abolition? A Participatory Design Approach
Exploring the Feasibility of Gaze-Based Navigation Across Path Types
Regulating Social Media: Surveying the Impact of Nepali Government's TikTok Ban
A Review of 10 Years of ProtoSpace: Spacecraft CAD Visualization in Collaborative Augmented Reality
The Stage Comes to You: A Real-Time Tele-Immersive System with 3D Point Clouds and Vibrotactile Feedback
From Neural Sensing to Stimulation: An Interdisciplinary Roadmap for Neurotechnology
A risk model and analysis method for the psychological safety of human and autonomous vehicles interaction
Desirable Unfamiliarity: Insights from Eye Movements on Engagement and Readability of Dictation Interfaces
Geometric Queries on Closed Implicit Surfaces for Walk on Stars
SAR-GS: Gaussian Splatting based SAR Images Rendering and Target Reconstruction
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS
LLM-Powered Nuanced Video Attribute Annotation for Enhanced Recommendations
Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks
Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
Ethical AI prompt recommendations in large language models using collaborative filtering
Reasoning-enhanced Query Understanding through Decomposition and Interpretation
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Breaking Precision Time: OS Vulnerability Exploits Against IEEE 1588
Proofs of No Intrusion
BATTLE for Bitcoin: Capital-Efficient Optimistic Bridges with Large Committees
SpyChain: Multi-Vector Supply Chain Attacks on Small Satellite Systems
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent
I Can't Patch My OT Systems! A Look at CISA's KEVC Workarounds & Mitigations for OT
A multi-layered embedded intrusion detection framework for programmable logic controllers
Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions
Security-Robustness Trade-offs in Diffusion Steganography: A Comparative Analysis of Pixel-Space and VAE-Based Architectures
Benchmarking Fake Voice Detection in the Fake Voice Generation Arms Race
Representation Gap of the Motzkin Monoid
The Knowledge Complexity of Quantum Problems
Friend or Foe Inside? Exploring In-Process Isolation to Maintain Memory Safety for Unsafe Rust
Streamlining Plug-and-Charge Authorization for Electric Vehicles with OAuth2 and OIDC
WAFFLED: Exploiting Parsing Discrepancies to Bypass Web Application Firewalls
On Univariate Sumcheck
RevealNet: Distributed Traffic Correlation for Attack Attribution on Programmable Networks
Security through the Eyes of AI: How Visualization is Shaping Malware Detection
Securing WiFi Fingerprint-based Indoor Localization Systems from Malicious Access Points
Obfuscated Quantum and Post-Quantum Cryptography
jmstate, a Flexible Python Package for Multi-State Joint Modeling
Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs
Vi-TacMan: Articulated Object Manipulation via Vision and Touch
A Formal gatekeeper Framework for Safe Dual Control with Active Exploration
What You Don't Know Can Hurt You: How Well do Latent Safety Filters Understand Partially Observable Safety Constraints?
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
Lattice-allocated Real-time Line Segment Feature Detection and Tracking Using Only an Event-based Camera
Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Online Generic Event Boundary Detection
HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic Segmentation
Lung Infection Severity Prediction Using Transformers with Conditional TransMix Augmentation and Cross-Attention
Label-frugal satellite image change detection with generative virtual exemplar learning
IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction
OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects
Addressing the ID-Matching Challenge in Long Video Captioning
No MoCap Needed: Post-Training Motion Diffusion Models with Reinforcement Learning using Only Textual Prompts
Bayesian Modelling of Multi-Year Crop Type Classification Using Deep Neural Networks and Hidden Markov Models
U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking
Concept Retrieval -- What and How?
DADO: A Depth-Attention framework for Object Discovery
Enhancing Concept Localization in CLIP-based Concept Bottleneck Models
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?
Few-Shot Adaptation Benchmark for Remote Sensing Vision-Language Models
Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis
EigenScore: OOD Detection using Covariance in Diffusion Models
TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation
Evaluating Fundus-Specific Foundation Models for Diabetic Macular Edema Detection
SpecGuard: Spectral Projection-based Advanced Invisible Watermarking
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Quantum-enhanced Computer Vision: Going Beyond Classical Algorithms
Temporal Prompting Matters: Rethinking Referring Video Object Segmentation
Active Next-Best-View Optimization for Risk-Averse Path Planning
Real-Time Glass Detection and Reprojection using Sensor Fusion Onboard Aerial Robots
UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene
Bionetta: Efficient Client-Side Zero-Knowledge Machine Learning Proving
Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity
LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition
Decomposed Global Optimization for Robust Point Matching with Low-Dimensional Branching
Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
Taming Diffusion Models for Image Restoration: A Review
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
SubGrapher: Visual Fingerprinting of Chemical Structures
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
Uncertainty-Aware Remaining Lifespan Prediction from Images
WAFT: Warping-Alone Field Transforms for Optical Flow
BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring
OpenStaxQA: A multilingual dataset based on open-source college textbooks
A Comprehensive Survey of Hallucination in Large Language Models: Causes, Detection, and Mitigation
Type and Complexity Signals in Multilingual Question Representations
LLM Bias Detection and Mitigation through the Lens of Desired Distributions
EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preference
Semantic Regexes: Auto-Interpreting LLM Features with a Structured Language
Controllable Stylistic Text Generation with Train-Time Attribute-Regularized Diffusion
Instructional Goal-Aligned Question Generation for Student Evaluation in Virtual Lab Settings: How Closely Do LLMs Actually Align?
FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering
Bridging Discourse Treebanks with a Unified Rhetorical Structure Parser
MathRobust-LV: Evaluation of Large Language Models' Robustness to Linguistic Variations in Mathematical Reasoning
Linguistically Informed Tokenization Improves ASR for Underresourced Languages
Test-Time Scaling of Reasoning Models for Machine Translation
Flipping the Dialogue: Training and Evaluating User Language Models
TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?
Aligning Large Language Models via Fully Self-Synthetic Data
ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects
PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs
AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
TWIST: Training-free and Label-free Short Text Clustering through Iterative Vector Updating with LLMs
A Formal Framework for Fluency-based Multi-Reference Evaluation in Grammatical Error Correction
Gold-Switch: Training-Free Superposition of Slow- and Fast- Thinking LLMs
Adaptive LLM-Symbolic Reasoning via Dynamic Logical Solver Composition
Overview of the Plagiarism Detection Task at PAN 2025
Adaptive Tool Generation with Models as Tools and Reinforcement Learning
Mid-Training of Large Language Models: A Survey
GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
$\lambda$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
MeXtract: Light-Weight Metadata Extraction from Scientific Papers
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups
Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Beyond Monolingual Assumptions: A Survey of Code-Switched NLP in the Era of Large Language Models
Does Local News Stay Local?: Online Content Shifts in Sinclair-Acquired Stations
Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages
Accelerating Diffusion LLM Inference via Local Determinism Propagation
All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
TALENT: Table VQA via Augmented Language-Enhanced Natural-text Transcription
Reasoning for Hierarchical Text Classification: The Case of Patents
More Data or Better Data? A Critical Analysis of Data Selection and Synthesis for Mathematical Reasoning
CARPAS: Towards Content-Aware Refinement of Provided Aspects for Summarization in Large Language Models
Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible
Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models
How much speech data is necessary for ASR in African languages? An evaluation of data scaling in Kinyarwanda and Kikuyu
Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation
Red-Bandit: Test-Time Adaptation for LLM Red-Teaming via Bandit-Guided LoRA Experts
Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models
Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
Agent Bain vs. Agent McKinsey: A New Text-to-SQL Benchmark for the Business Domain
CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection
GPT-5 Model Corrected GPT-4V's Chart Reading Errors, Not Prompting
Exposing Citation Vulnerabilities in Generative Engines
Crossing Domains without Labels: Distant Supervision for Term Extraction
RedTWIZ: Diverse LLM Red Teaming via Adaptive Attack Planning
Machines in the Crowd? Measuring the Footprint of Machine-Generated Text on Reddit
LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models
Benchmarking Gaslighting Negation Attacks Against Multimodal Large Language Models
Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning
Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
Speculative Decoding and Beyond: An In-Depth Survey of Techniques
MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
Geometry of Semantics in Next-Token Prediction: How Optimization Implicitly Organizes Linguistic Representations
AutoRev: Multi-Modal Graph Retrieval for Automated Peer-Review Generation
HopWeaver: Cross-Document Synthesis of High-Quality and Authentic Multi-Hop Questions
FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management
Do RAG Systems Really Suffer From Positional Bias?
MIST: Towards Multi-dimensional Implicit BiaS Evaluation of LLMs via Theory of Mind
PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction
Do LLMs Overthink Basic Math Reasoning? Benchmarking the Accuracy-Efficiency Tradeoff in Language Models
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models
User to Video: A Model for Spammer Detection Inspired by Video Classification Technology
multimodars: A Rust-powered toolkit for multi-modality cardiac image fusion and registration
Does Physics Knowledge Emerge in Frontier Models?
Enhanced Self-Distillation Framework for Efficient Spiking Neural Network Training
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation
Superpixel Integrated Grids for Fast Image Segmentation
Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation
From Captions to Keyframes: Efficient Video Summarization via Caption- and Context-Aware Frame Scoring
Limited-Angle Tomography Reconstruction via Projector Guided 3D Diffusion
VUGEN: Visual Understanding priors for GENeration
Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation
Improving Artifact Robustness for CT Deep Learning Models Without Labeled Artifact Images via Domain Adaptation
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
Adaptive Stain Normalization for Cross-Domain Medical Histology
AIM 2025 Challenge on Real-World RAW Image Denoising
Self-supervised Physics-guided Model with Implicit Representation Regularization for Fast MRI Reconstruction
A Bridge from Audio to Video: Phoneme-Viseme Alignment Allows Every Face to Speak Multiple Languages
MSITrack: A Challenging Benchmark for Multispectral Single Object Tracking
DreamOmni2: Multimodal Instruction-based Editing and Generation
SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis
DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
Transforming Noise Distributions with Histogram Matching: Towards a Single Denoiser for All
A deep multiple instance learning approach based on coarse labels for high-resolution land-cover mapping
TTRV: Test-Time Reinforcement Learning for Vision Language Models
VA-Adapter: Adapting Ultrasound Foundation Model to Echocardiography Probe Guidance
Covert Quantum Learning: Privately and Verifiably Learning from Quantum Data
Accelerating Inference for Multilayer Neural Networks with Quantum Computers
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
On the Convergence of Moral Self-Correction in Large Language Models
Maximising the Utility of Validation Sets for Imbalanced Noisy-label Meta-learning
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation
Domain Generalization by Rejecting Extreme Augmentations
Generalizable Physics-Informed Learning for Stochastic Safety-Critical Systems
Want to train KANS at scale? Now UKAN!
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Reinforcement Learning for Dynamic Memory Allocation
Quantum Rationale-Aware Graph Contrastive Learning for Jet Discrimination
VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction
Contrastive Graph Condensation: Advancing Data Versatility through Self-Supervised Learning
DPGIIL: Dirichlet Process-Deep Generative Model-Integrated Incremental Learning for Clustering in Transmissibility-based Online Structural Anomaly Detection
Towards the Worst-case Robustness of Large Language Models
Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves
A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
A Novel Collaborative Framework for Efficient Synchronization in Split Federated Learning over Wireless Networks
Nonparametric Bellman Mappings for Value Iteration in Distributed Reinforcement Learning
Unveiling the Basin-Like Loss Landscape in Large Language Models
HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks
Learning where to learn: Training data distribution optimization for scientific machine learning
Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design
AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction
AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
Adversarial Surrogate Risk Bounds for Binary Classification
Auto-Compressing Networks
On the necessity of adaptive regularisation:Optimal anytime online learning on $\boldsymbol{\ell_p}$-balls
P3D: Scalable Neural Surrogates for High-Resolution 3D Physics Simulations with Global Context
Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
An Empirical Analysis of the Laplace and Neural Tangent Kernels
Train-Free Segmentation in MRI with Cubical Persistent Homology
Testing Support Size More Efficiently Than Learning Histograms
2 OLMo 2 Furious
Automating RT Planning at Scale: High Quality Data For AI Training
GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm
Bit-Level Discrete Diffusion with Markov Probabilistic Models: An Improved Framework with Sharp Convergence Bounds under Minimal Assumptions
Last-iterate Convergence for Symmetric, General-sum, $2 \times 2$ Games Under The Exponential Weights Dynamic
Jailbreak Attack Initializations as Extractors of Compliance Directions
DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion
Is Supervised Learning Really That Different from Unsupervised?
TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference
Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models
MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning
360-LLaMA-Factory: Plug & Play Sequence Parallelism for Long Post-Training
Estimating the Joint Probability of Scenario Parameters with Gaussian Mixture Copula Models
Probing forced responses and causal mechanisms in large-scale climate dynamics with reduced-order neural models
Making and Evaluating Calibrated Forecasts
The Effect of Label Noise on the Information Content of Neural Representations
Test-Time Efficient Pretrained Model Portfolios for Time Series Forecasting
Nearly Instance-Optimal Parameter Recovery from Many Trajectories via Hellinger Localization
Bayesian Optimization under Uncertainty for Training a Scale Parameter in Stochastic Models
GUIDE: Guided Initialization and Distillation of Embeddings
Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security
Wide Neural Networks as a Baseline for the Computational No-Coincidence Conjecture
DPA-Net: A Dual-Path Attention Neural Network for Inferring Glycemic Control Metrics from Self-Monitored Blood Glucose Data
POME: Post Optimization Model Edit via Muon-style Projection
Chem-NMF: Multi-layer $\alpha$-divergence Non-Negative Matrix Factorization for Cardiorespiratory Disease Clustering, with Improved Convergence Inspired by Chemical Catalysts and Rigorous Asymptotic Analysis
Three Forms of Stochastic Injection for Improved Distribution-to-Distribution Generative Modeling
StruSR: Structure-Aware Symbolic Regression with Physics-Informed Taylor Guidance
Rethinking Nonlinearity: Trainable Gaussian Mixture Modules for Modern Neural Architectures
The Effect of Attention Head Count on Transformer Approximation
XRPO: Pushing the limits of GRPO with Targeted Exploration and Exploitation
TimeFormer: Transformer with Attention Modulation Empowered by Temporal Characteristics for Time Series Forecasting
Distributed Algorithms for Multi-Agent Multi-Armed Bandits with Collision
AutoBalance: An Automatic Balancing Framework for Training Physics-Informed Neural Networks
Is the Hard-Label Cryptanalytic Model Extraction Really Polynomial?
A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking
Incorporating Expert Knowledge into Bayesian Causal Discovery of Mixtures of Directed Acyclic Graphs
Function regression using the forward forward training and inferring paradigm
Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness
The Unreasonable Effectiveness of Randomized Representations in Online Continual Graph Learning
Efficient numeracy in language models through single-token number embeddings
Early wind turbine alarm prediction based on machine learning: AlarmForecasting
Vectorized FlashAttention with Low-cost Exponential Computation in RISC-V Vector Processors
SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models
Vacuum Spiker: A Spiking Neural Network-Based Model for Efficient Anomaly Detection in Time Series
Utilizing Large Language Models for Machine Learning Explainability
Revisiting Node Affinity Prediction in Temporal Graphs
Fisher Information, Training and Bias in Fourier Regression Models
From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization
Revisiting Mixout: An Overlooked Path to Robust Finetuning
Spiral Model Technique For Data Science & Machine Learning Lifecycle
Sharpness-Aware Data Generation for Zero-shot Quantization
COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Enhancing Speech Emotion Recognition via Fine-Tuning Pre-Trained Models and Hyper-Parameter Optimisation
Blind Construction of Angular Power Maps in Massive MIMO Networks
Non-Stationary Online Structured Prediction with Surrogate Losses
Non-Asymptotic Analysis of Efficiency in Conformalized Regression
DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering
Bridged Clustering for Representation Learning: Semi-Supervised Sparse Bridging
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
An in-depth look at approximation via deep and narrow neural networks
Guided by the Experts: Provable Feature Learning Dynamic of Soft-Routed Mixture-of-Experts
A Broader View of Thompson Sampling
Discriminative Feature Feedback with General Teacher Classes
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Dynamic Regret Bounds for Online Omniprediction with Long Term Constraints
MolGA: Molecular Graph Adaptation with Pre-trained 2D Graph Encoder
Enhancing Resilience for IoE: A Perspective of Networking-Level Safeguard
Layerwise Federated Learning for Heterogeneous Quantum Clients using Quorus
Milestone Determination for Autonomous Railway Operation
Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions
Toward Uncertainty-Aware and Generalizable Neural Decoding for Quantum LDPC Codes
Developing a Sequential Deep Learning Pipeline to Model Alaskan Permafrost Thaw Under Climate Change
Beyond Static Knowledge Messengers: Towards Adaptive, Fair, and Scalable Federated Learning for Medical AI
A Mixed-Methods Analysis of Repression and Mobilization in Bangladesh's July Revolution Using Machine Learning and Statistical Modeling
Vision Transformer for Transient Noise Classification
General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
Mass Conservation on Rails -- Rethinking Physics-Informed Learning of Ice Flow Vector Fields
Scalable deep fusion of spaceborne lidar and synthetic aperture radar for global forest structural complexity mapping
Conditional Denoising Diffusion Model-Based Robust MR Image Reconstruction from Highly Undersampled Data
Diffusion-Guided Renormalization of Neural Systems via Tensor Networks
A General Constructive Upper Bound on Shallow Neural Nets Complexity
Road Surface Condition Detection with Machine Learning using New York State Department of Transportation Camera Images and Weather Forecast Data
Online Matching via Reinforcement Learning: An Expert Policy Orchestration Strategy
BACHI: Boundary-Aware Symbolic Chord Recognition Through Masked Iterative Decoding on Pop and Classical Music
From Description to Detection: LLM based Extendable O-RAN Compliant Blind DoS Detection in 5G and Beyond
Cluster Paths: Navigating Interpretability in Neural Networks
From Acceleration to Saturation: Scaling Behavior of Bootstrapped Language Model Pretraining
Adapting Quantum Machine Learning for Energy Dissociation of Bonds
FEAorta: A Fully Automated Framework for Finite Element Analysis of the Aorta From 3D CT Images
Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks
A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures
Q-Learning with Fine-Grained Gap-Dependent Regret
Fitzpatrick Thresholding for Skin Image Segmentation
Gaussian Equivalence for Self-Attention: Asymptotic Spectral Analysis of Attention Matrix
Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer
Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)
Quantum Computing Methods for Malware Detection
BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
Reconquering Bell sampling on qudits: stabilizer learning and testing, quantum pseudorandomness bounds, and more
Quantum Sparse Recovery and Quantum Orthogonal Matching Pursuit
Textual interpretation of transient image classifications from large language models
PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing
Accelerating Sparse Ternary GEMM for Quantized LLM inference on Apple Silicon
Falsification-Driven Reinforcement Learning for Maritime Motion Planning
Relational Database Distillation: From Structured Tables to Condensed Graph Data
Root Cause Analysis of Outliers in Unknown Cyclic Graphs
Pseudo-MDPs: A Novel Framework for Efficiently Optimizing Last Revealer Seed Manipulations in Blockchains
Explaining Models under Multivariate Bernoulli Distribution via Hoeffding Decomposition
Diffusion-Augmented Reinforcement Learning for Robust Portfolio Optimization under Stress Scenarios
Active Control of Turbulent Airfoil Flows Using Adjoint-based Deep Learning
GNN-enhanced Traffic Anomaly Detection for Next-Generation SDN-Enabled Consumer Electronics
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
Spectral Graph Clustering under Differential Privacy: Balancing Privacy, Accuracy, and Efficiency
NurseLLM: The First Specialized Language Model for Nursing
Quantifying Data Contamination in Psychometric Evaluations of LLMs
Bayesian Portfolio Optimization by Predictive Synthesis
Split Conformal Classification with Unsupervised Calibration
A Multi-Agent Framework for Stateful Inference-Time Search
ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Resolution scaling governs DINOv3 transfer performance in chest radiograph classification
HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving
Language Lives in Sparse Dimensions: Toward Interpretable and Efficient Multilingual Control for Large Language Models
GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation
Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation
Benchmarking LLM Causal Reasoning with Scientifically Validated Relationships
LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
On the false election between regulation and innovation. Ideas for regulation through the responsible use of artificial intelligence in research and education.[Spanish version]
Online Rubrics Elicitation from Pairwise Comparisons
GTCN-G: A Residual Graph-Temporal Fusion Network for Imbalanced Intrusion Detection (Preprint)
Evolutionary Profiles for Protein Fitness Prediction
AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
Cocoon: A System Architecture for Differentially Private Training with Correlated Noises
MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations
Vibe Checker: Aligning Code Evaluation with Human Preference
Artificial Hippocampus Networks for Efficient Long-Context Modeling
Inferring Capabilities from Task Performance with Bayesian Triangulation
Transparent and Coherent Procedural Mistake Detection
An Illusion of Progress? Assessing the Current State of Web Agents
Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Controlled Agentic Planning & Reasoning for Mechanism Synthesis
Functional Matching of Logic Subgraphs: Beyond Structural Isomorphism
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Attacking the Spike: On the Transferability and Security of Spiking Neural Networks to Adversarial Examples
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Is My Data in Your AI? Membership Inference Test (MINT) applied to Face Biometrics
Unlocking Dataset Distillation with Diffusion Models
ECLM: Entity Level Language Model for Spoken Language Understanding with Chain of Intent
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
A Deep Learning System for Rapid and Accurate Warning of Acute Aortic Syndrome on Non-contrast CT in China
Approximately Aligned Decoding
NAR-*ICP: Neural Execution of Classical ICP-based Pointcloud Registration Algorithms
Error Bounds for Physics-Informed Neural Networks in Fokker-Planck PDEs
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Machine Learning and Multi-source Remote Sensing in Forest Aboveground Biomass Estimation: A Review
Sustainable Self-evolution Adversarial Training
Evil twins are not that evil: Qualitative insights into machine-generated prompts
Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine
KunServe: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM Serving
Tempo: Compiled Dynamic Deep Learning with Symbolic Dependence Graphs
FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks
A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
Achieving Hyperbolic-Like Expressiveness with Arbitrary Euclidean Regions: A New Approach to Hierarchical Embeddings
LLM Unlearning via Neural Activation Redirection
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
Lossy Neural Compression for Geospatial Analytics: A Review
Mind the (Belief) Gap: Group Identity in the World of LLMs
Improving Neutral Point-of-View Generation with Data- and Parameter-Efficient RL
Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided and Self-Consistent MLLMs for Task Planning in Instruction-Following Manipulation
NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
Weight Ensembling Improves Reasoning in Language Models
Efficient Flow Matching using Latent Variables
Generative Pre-trained Autoregressive Diffusion Transformer
MONAQ: Multi-Objective Neural Architecture Querying for Time-Series Analysis on Resource-Constrained Devices
AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs
AdaDim: Dimensionality Adaptation for SSL Representational Dynamics
MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs
Performance of machine-learning-assisted Monte Carlo in sampling from simple statistical physics models
Exchangeability in Neural Network and its Application to Dynamic Pruning
CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
Learning to Recover: Dynamic Reward Shaping with Wheel-Leg Coordination for Fallen Robots
KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Prefilled responses enhance zero-shot detection of AI-generated images
Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories
Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization
Enjoying Non-linearity in Multinomial Logistic Bandits
Token-based Audio Inpainting via Discrete Diffusion
Quantum Machine Learning in Multi-Qubit Phase-Space Part I: Foundations
Community-Centered Spatial Intelligence for Climate Adaptation at Nova Scotia's Eastern Shore
Intelligent Healthcare Imaging Platform: A VLM-Based Framework for Automated Medical Image Analysis and Clinical Report Generation
On knot detection via picture recognition
PIKAN: Physics-Inspired Kolmogorov-Arnold Networks for Explainable UAV Channel Modelling
Lagrangian neural ODEs: Measuring the existence of a Lagrangian with Helmholtz metrics
RareGraph-Synth: Knowledge-Guided Diffusion Models for Generating Privacy-Preserving Synthetic Patient Trajectories in Ultra-Rare Diseases
MCCE: A Framework for Multi-LLM Collaborative Co-Evolution
Reproducibility Study of "XRec: Large Language Models for Explainable Recommendation"
A Total Variation Regularized Framework for Epilepsy-Related MRI Image Segmentation
RVFL-X: A Novel Randomized Network Based on Complex Transformed Real-Valued Tabular Datasets
Surgeons Are Indian Males and Speech Therapists Are White Females: Auditing Biases in Vision-Language Models for Healthcare Professionals
Improving the Spatial Resolution of GONG Solar Images to GST Quality Using Deep Learning
SER-Diff: Synthetic Error Replay Diffusion for Incremental Brain Tumor Segmentation
Soft-Evidence Fused Graph Neural Network for Cancer Driver Gene Identification across Multi-View Biological Graphs
Traj-Transformer: Diffusion Models with Transformer for GPS Trajectory Generation
ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
BlockGPT: Spatio-Temporal Modelling of Rainfall via Frame-Level Autoregression
Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling
VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code
RGBD Gaze Tracking Using Transformer for Feature Fusion
SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation
Leveraging Large Language Models for Cybersecurity Risk Assessment -- A Case from Forestry Cyber-Physical Systems
Flexible Swarm Learning May Outpace Foundation Models in Essential Tasks
Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation
TransFIRA: Transfer Learning for Face Image Recognizability Assessment
Constrained Natural Language Action Planning for Resilient Embodied Systems
EverydayMMQA: A Multilingual and Multimodal Framework for Culturally Grounded Spoken Visual QA
Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
Monte Carlo Permutation Search
Protecting De-identified Documents from Search-based Linkage Attacks
Reward Model Perspectives: Whose Opinions Do Reward Models Reward?
Adaptive Protein Design Protocols and Middleware
Geometry-Aware Backdoor Attacks: Leveraging Curvature in Hyperbolic Embeddings
Context-Aware Inference via Performance Forecasting in Decentralized Learning Networks
A Survey on Agentic Security: Applications, Threats and Defenses
How NOT to benchmark your SITE metric: Beyond Static Leaderboards and Towards Realistic Evaluation
Evaluating Node-tree Interfaces for AI Explainability
Deep Generative Model for Human Mobility Behavior
Attention Sinks and Compression Valleys in LLMs are Two Sides of the Same Coin
Valid Stopping for LLM Generation via Empirical Dynamic Formal Lift
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
ATLO-ML: Adaptive Time-Length Optimizer for Machine Learning -- Insights from Air Quality Forecasting
A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval
Visualizing Multimodality in Combinatorial Search Landscapes
CLAQS: Compact Learnable All-Quantum Token Mixer with Shared-ansatz for Text Classification
Scalable Policy-Based RL Algorithms for POMDPs
Incoherence in goal-conditioned autoregressive models
The Markovian Thinker
The Algebra of Meaning: Why Machines Need Montague More Than Moore's Law
HSNet: Heterogeneous Subgraph Network for Single Image Super-resolution
The Framework That Survives Bad Models: Human-AI Collaboration For Clinical Trials
SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation
Reading Between the Lines: Towards Reliable Black-box LLM Fingerprinting via Zeroth-order Gradient Estimation
AI-Driven Forecasting and Monitoring of Urban Water System
Control-Augmented Autoregressive Diffusion for Data Assimilation
StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
Distilling Lightweight Language Models for C/C++ Vulnerabilities
The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions
Delay Independent Safe Control with Neural Networks: Positive Lur'e Certificates for Risk Aware Autonomy
Automated Neural Architecture Design for Industrial Defect Detection
Heptapod: Language Modeling on Visual Signals
Incremental Summarization for Customer Support via Progressive Note-Taking and Agent Feedback
Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
AISysRev -- LLM-based Tool for Title-abstract Screening
Dual Goal Representations
LLM Company Policies and Policy Implications in Software Organizations
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
Are LLMs Reliable Rankers? Rank Manipulation via Two-Stage Token Optimization
Evaluating LLMs for Historical Document OCR: A Methodological Framework for Digital Humanities
Modeling COVID-19 Dynamics in German States Using Physics-Informed Neural Networks
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
Extreme Amodal Face Detection
FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
Recurrence-Complete Frame-based Action Models
CNN-TFT explained by SHAP with multi-head attention weights for time series forecasting
SID: Multi-LLM Debate Driven by Self Signals
OpenJAI-v1.0: An Open Thai Large Language Model
Enhancing Bankruptcy Prediction of Banks through Advanced Machine Learning Techniques: An Innovative Approach and Analysis
Explaining raw data complexity to improve satellite onboard processing
Towards Generalization of Graph Neural Networks for AC Optimal Power Flow
Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval
MoRE-GNN: Multi-omics Data Integration with a Heterogeneous Graph Autoencoder
Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices
M3Retrieve: Benchmarking Multimodal Retrieval for Medicine
Angular Constraint Embedding via SpherePair Loss for Constrained Clustering
Emotionally Vulnerable Subtype of Internet Gaming Disorder: Measuring and Exploring the Pathology of Problematic Generative AI Use
DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning
LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling
Bayesian Nonparametric Dynamical Clustering of Time Series
Expressive and Scalable Quantum Fusion for Multimodal Learning
Grouped Differential Attention
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation
EDUMATH: Generating Standards-aligned Educational Math Word Problems
Generating Surface for Text-to-3D using 2D Gaussian Splatting
Learning Global Representation from Queries for Vectorized HD Map Construction
VelLMes: A high-interaction AI-based deception framework
The Limits of Goal-Setting Theory in LLM-Driven Assessment
Pragyaan: Designing and Curating High-Quality Cultural Post-Training Datasets for Indian Languages
Native Hybrid Attention for Efficient Sequence Modeling
Federated Unlearning in the Wild: Rethinking Fairness and Data Discrepancy
Mining the Mind: What 100M Beliefs Reveal About Frontier LLM Knowledge
Unified Molecule Pre-training with Flexible 2D and 3D Modalities: Single and Paired Modality Integration
Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models
Introspection in Learned Semantic Scene Graph Localisation
LuxInstruct: A Cross-Lingual Instruction Tuning Dataset For Luxembourgish
Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
Generative World Modelling for Humanoids: 1X World Model Challenge Technical Report
Opt-ICL at LeWiDi-2025: Maximizing In-Context Signal from Rater Examples via Meta-Learning
Graph Conditioned Diffusion for Controllable Histopathology Image Generation
A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model
TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
Comparing human and language models sentence processing difficulties on complex structures
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization
BuilderBench -- A benchmark for generalist agents
Requirements for Game-Based Learning Design Framework for Information System Integration in the Context of Post-Merger Integration
Belief-Calibrated Multi-Agent Consensus Seeking for Complex NLP Tasks
Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?
Flavonoid Fusion: Creating a Knowledge Graph to Unveil the Interplay Between Food and Health
PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles
Beneficial Reasoning Behaviors in Agentic Search and Effective Post-training to Obtain Them
Auto-Prompt Ensemble for LLM Judge
WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks
Fine-Grained Emotion Recognition via In-Context Learning
Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support
Inefficiencies of Meta Agents for Agent Design
MultiCNKG: Integrating Cognitive Neuroscience, Gene, and Disease Knowledge Graphs Using Large Language Models
Verifying Memoryless Sequential Decision-making of Large Language Models
Evolving and Executing Research Plans via Double-Loop Multi-Agent Collaboration
Autoformalizer with Tool Feedback
TGPR: Tree-Guided Policy Refinement for Robust Self-Debugging of LLMs
LLM-Assisted Modeling of Semantic Web-Enabled Multi-Agents Systems with AJAN
Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces
Tool-Augmented Policy Optimization: Synergizing Reasoning and Adaptive Tool Use with Reinforcement Learning
Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations
Inductive Learning for Possibilistic Logic Programs Under Stable Models
VRPAgent: LLM-Driven Discovery of Heuristic Operators for Vehicle Routing Problems
The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas
The Contingencies of Physical Embodiment Allow for Open-Endedness and Care
Integrating Domain Knowledge into Process Discovery Using Large Language Models
NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
Multi-Objective Multi-Agent Path Finding with Lexicographic Cost Preferences
Agentic generative AI for media content discovery at the national football league
DeepXPalm: Tilt and Position Rendering using Palm-worn Haptic Display and CNN-based Tactile Pattern Recognition
TiltXter: CNN-based Electro-tactile Rendering of Tilt Angle for Telemanipulation of Pasteur Pipettes
A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants
Exploring Human-AI Collaboration Using Mental Models of Early Adopters of Multi-Agent Generative AI Tools
Generalized Multi-agent Social Simulation Framework
Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report)
Uncertainty Quantification In Surface Landmines and UXO Classification Using MC Dropout
Knowledge Graph-Guided Multi-Agent Distillation for Reliable Industrial Question Answering with Datasets
Transparent Reference-free Automated Evaluation of Open-Ended User Survey Responses
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
Evaluating Embedding Frameworks for Scientific Domain
DynBenchmark: Customizable Ground Truths to Benchmark Community Detection and Tracking in Temporal Networks
TRepLiNa: Layer-wise CKA+REPINA Alignment Improves Low-Resource Machine Translation in Aya-23 8B
Scalable multilingual PII annotation for responsible AI in LLMs
Dream2Image : An Open Multimodal EEG Dataset for Decoding and Visualizing Dreams with Artificial Intelligence
LLM-Driven Rubric-Based Assessment of Algebraic Competence in Multi-Stage Block Coding Tasks with Design and Field Evaluation
Ensemble Deep Learning and LLM-Assisted Reporting for Automated Skin Lesion Diagnosis
Prakriti200: A Questionnaire-Based Dataset of 200 Ayurvedic Prakriti Assessments
Dual-stage and Lightweight Patient Chart Summarization for Emergency Physicians
Language models for longitudinal analysis of abusive content in Billboard Music Charts

Research Sources: 650 | Generated: 10/10/2025