AI RESEARCH PAPERS & ACADEMIC SOURCES
- RAISE: A self-driving laboratory for interfacial property formulation discovery
- Safe Obstacle-Free Guidance of Space Manipulators in Debris Removal Missions via Deep Reinforcement Learning
- Assist-As-Needed: Adaptive Multimodal Robotic Assistance for Medication Management in Dementia Care
- RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
- SanDRA: Safe Large-Language-Model-Based Decision Making for Automated Vehicles Using Reachability Analysis
- Distributed 3D Source Seeking via SO(3) Geometric Control of Robot Swarms
- Tailoring materials into kirigami robots
- Temporal-Prior-Guided View Planning for Periodic 3D Plant Reconstruction
- Diffusing Trajectory Optimization Problems for Recovery During Multi-Finger Manipulation
- Bring the Apple, Not the Sofa: Impact of Irrelevant Context in Embodied AI Commands on VLA Models
- Sampling Strategies for Robust Universal Quadrupedal Locomotion Policies
- DPL: Depth-only Perceptive Humanoid Locomotion via Realistic Depth Synthesis and Cross-Attention Terrain Reconstruction
- A Narwhal-Inspired Sensing-to-Control Framework for Small Fixed-Wing Aircraft
- COMPAct: Computational Optimization and Automated Modular design of Planetary Actuators
- Three-dimensional Integrated Guidance and Control for Leader-Follower Flexible Formation of Fixed Wing UAVs
- Terrain-Aided Navigation Using a Point Cloud Measurement Sensor
- Artists' Views on Robotics Involvement in Painting Productions
- M^3RS: Multi-robot, Multi-objective, and Multi-mode Routing and Scheduling
- EffiTune: Diagnosing and Mitigating Training Inefficiency for Parameter Tuner in Robot Navigation System
- P2 Explore: Efficient Exploration in Unknown Cluttered Environment with Floor Plan Prediction
- Generating and Optimizing Topologically Distinct Guesses for Mobile Manipulator Path Planning with Path Constraints
- Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation
- Development of a magnetorheological hand exoskeleton featuring a high force-to-power ratio for enhanced grip endurance
- Control of Humanoid Robots with Parallel Mechanisms using Differential Actuation Models
- Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions
- Touch Speaks, Sound Feels: A Multimodal Approach to Affective and Social Touch from Robots to Humans
- UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography
- BIM Informed Visual SLAM for Construction Monitoring
- Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making
- "Grillz on a hijabi": Intersectional Identities in Fostering Critical AI Literacy
- Code Semantic Zooming
- Back to the Future Museum -- Speculative Design for Virtual Citizen-Curated Museums
- AI Eyes on the Road: Cross-Cultural Perspectives on Traffic Surveillance
- A Meat-Summer Night's Dream: A Tangible Design Fiction Exploration of Eating Biohybrid Flying Robots
- Examining Solidarity Against AI-Enabled Surveillance at the Intersection of Workplace and Carceral Realities
- PriorWeaver: Prior Elicitation via Iterative Dataset Construction
- RAVEN: Realtime Accessibility in Virtual ENvironments for Blind and Low-Vision People
- Investigating Students' Preferences for AI Roles in Mathematical Modelling: Evidence from a Randomized Controlled Trial
- "It feels like hard work trying to talk to it": Understanding Older Adults' Experiences of Encountering and Repairing Conversational Breakdowns with AI Systems
- "Sometimes You Need Facts, and Sometimes a Hug": Understanding Older Adults' Preferences for Explanations in LLM-Based Conversational AI Systems
- Lonely Individuals Show Distinct Patterns of Social Media Engagement
- Am I Productive? Exploring the Experience of Remote Workers with Task Management Tools
- Prototyping Multimodal GenAI Real-Time Agents with Counterfactual Replays and Hybrid Wizard-of-Oz
- The Feature Understandability Scale for Human-Centred Explainable AI: Assessing Tabular Feature Importance
- AI for Abolition? A Participatory Design Approach
- Exploring the Feasibility of Gaze-Based Navigation Across Path Types
- Regulating Social Media: Surveying the Impact of Nepali Government's TikTok Ban
- A Review of 10 Years of ProtoSpace: Spacecraft CAD Visualization in Collaborative Augmented Reality
- The Stage Comes to You: A Real-Time Tele-Immersive System with 3D Point Clouds and Vibrotactile Feedback
- From Neural Sensing to Stimulation: An Interdisciplinary Roadmap for Neurotechnology
- A risk model and analysis method for the psychological safety of human and autonomous vehicles interaction
- Desirable Unfamiliarity: Insights from Eye Movements on Engagement and Readability of Dictation Interfaces
- Geometric Queries on Closed Implicit Surfaces for Walk on Stars
- SAR-GS: Gaussian Splatting based SAR Images Rendering and Target Reconstruction
- LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS
- LLM-Powered Nuanced Video Attribute Annotation for Enhanced Recommendations
- Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks
- Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers
- Ethical AI prompt recommendations in large language models using collaborative filtering
- Reasoning-enhanced Query Understanding through Decomposition and Interpretation
- Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
- Breaking Precision Time: OS Vulnerability Exploits Against IEEE 1588
- Proofs of No Intrusion
- BATTLE for Bitcoin: Capital-Efficient Optimistic Bridges with Large Committees
- SpyChain: Multi-Vector Supply Chain Attacks on Small Satellite Systems
- Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
- Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent
- I Can't Patch My OT Systems! A Look at CISA's KEVC Workarounds & Mitigations for OT
- A multi-layered embedded intrusion detection framework for programmable logic controllers
- Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions
- Security-Robustness Trade-offs in Diffusion Steganography: A Comparative Analysis of Pixel-Space and VAE-Based Architectures
- Benchmarking Fake Voice Detection in the Fake Voice Generation Arms Race
- Representation Gap of the Motzkin Monoid
- The Knowledge Complexity of Quantum Problems
- Friend or Foe Inside? Exploring In-Process Isolation to Maintain Memory Safety for Unsafe Rust
- Streamlining Plug-and-Charge Authorization for Electric Vehicles with OAuth2 and OIDC
- WAFFLED: Exploiting Parsing Discrepancies to Bypass Web Application Firewalls
- On Univariate Sumcheck
- RevealNet: Distributed Traffic Correlation for Attack Attribution on Programmable Networks
- Security through the Eyes of AI: How Visualization is Shaping Malware Detection
- Securing WiFi Fingerprint-based Indoor Localization Systems from Malicious Access Points
- Obfuscated Quantum and Post-Quantum Cryptography
- jmstate, a Flexible Python Package for Multi-State Joint Modeling
- Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs
- Vi-TacMan: Articulated Object Manipulation via Vision and Touch
- A Formal gatekeeper Framework for Safe Dual Control with Active Exploration
- What You Don't Know Can Hurt You: How Well do Latent Safety Filters Understand Partially Observable Safety Constraints?
- StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
- Lattice-allocated Real-time Line Segment Feature Detection and Tracking Using Only an Event-based Camera
- Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
- Online Generic Event Boundary Detection
- HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic Segmentation
- Lung Infection Severity Prediction Using Transformers with Conditional TransMix Augmentation and Cross-Attention
- Label-frugal satellite image change detection with generative virtual exemplar learning
- IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction
- OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects
- Addressing the ID-Matching Challenge in Long Video Captioning
- No MoCap Needed: Post-Training Motion Diffusion Models with Reinforcement Learning using Only Textual Prompts
- Bayesian Modelling of Multi-Year Crop Type Classification Using Deep Neural Networks and Hidden Markov Models
- U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking
- Concept Retrieval -- What and How?
- DADO: A Depth-Attention framework for Object Discovery
- Enhancing Concept Localization in CLIP-based Concept Bottleneck Models
- MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
- Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?
- Few-Shot Adaptation Benchmark for Remote Sensing Vision-Language Models
- Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
- MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis
- EigenScore: OOD Detection using Covariance in Diffusion Models
- TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation
- Evaluating Fundus-Specific Foundation Models for Diabetic Macular Edema Detection
- SpecGuard: Spectral Projection-based Advanced Invisible Watermarking
- MATRIX: Mask Track Alignment for Interaction-aware Video Generation
- WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
- Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
- Quantum-enhanced Computer Vision: Going Beyond Classical Algorithms
- Temporal Prompting Matters: Rethinking Referring Video Object Segmentation
- Active Next-Best-View Optimization for Risk-Averse Path Planning
- Real-Time Glass Detection and Reprojection using Sensor Fusion Onboard Aerial Robots
- UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene
- Bionetta: Efficient Client-Side Zero-Knowledge Machine Learning Proving
- Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity
- LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition
- Decomposed Global Optimization for Robust Point Matching with Low-Dimensional Branching
- Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics
- CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
- Taming Diffusion Models for Image Restoration: A Review
- Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
- Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
- SubGrapher: Visual Fingerprinting of Chemical Structures
- MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
- Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
- Uncertainty-Aware Remaining Lifespan Prediction from Images
- WAFT: Warping-Alone Field Transforms for Optical Flow
- BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring
- OpenStaxQA: A multilingual dataset based on open-source college textbooks
- A Comprehensive Survey of Hallucination in Large Language Models: Causes, Detection, and Mitigation
- Type and Complexity Signals in Multilingual Question Representations
- LLM Bias Detection and Mitigation through the Lens of Desired Distributions
- EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preference
- Semantic Regexes: Auto-Interpreting LLM Features with a Structured Language
- Controllable Stylistic Text Generation with Train-Time Attribute-Regularized Diffusion
- Instructional Goal-Aligned Question Generation for Student Evaluation in Virtual Lab Settings: How Closely Do LLMs Actually Align?
- FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering
- Bridging Discourse Treebanks with a Unified Rhetorical Structure Parser
- MathRobust-LV: Evaluation of Large Language Models' Robustness to Linguistic Variations in Mathematical Reasoning
- Linguistically Informed Tokenization Improves ASR for Underresourced Languages
- Test-Time Scaling of Reasoning Models for Machine Translation
- Flipping the Dialogue: Training and Evaluating User Language Models
- TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
- Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?
- Aligning Large Language Models via Fully Self-Synthetic Data
- ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
- PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
- How Language Models Conflate Logical Validity with Plausibility: A Representational Analysis of Content Effects
- PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs
- AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
- TWIST: Training-free and Label-free Short Text Clustering through Iterative Vector Updating with LLMs
- A Formal Framework for Fluency-based Multi-Reference Evaluation in Grammatical Error Correction
- Gold-Switch: Training-Free Superposition of Slow- and Fast- Thinking LLMs
- Adaptive LLM-Symbolic Reasoning via Dynamic Logical Solver Composition
- Overview of the Plagiarism Detection Task at PAN 2025
- Adaptive Tool Generation with Models as Tools and Reinforcement Learning
- Mid-Training of Large Language Models: A Survey
- GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics
- Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
- $\lambda$-GRPO: Unifying the GRPO Frameworks with Learnable Token Preferences
- MeXtract: Light-Weight Metadata Extraction from Scientific Papers
- SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
- Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups
- Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
- Beyond Monolingual Assumptions: A Survey of Code-Switched NLP in the Era of Large Language Models
- Does Local News Stay Local?: Online Content Shifts in Sinclair-Acquired Stations
- Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages
- Accelerating Diffusion LLM Inference via Local Determinism Propagation
- All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations
- Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
- TALENT: Table VQA via Augmented Language-Enhanced Natural-text Transcription
- Reasoning for Hierarchical Text Classification: The Case of Patents
- More Data or Better Data? A Critical Analysis of Data Selection and Synthesis for Mathematical Reasoning
- CARPAS: Towards Content-Aware Refinement of Provided Aspects for Summarization in Large Language Models
- Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible
- Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models
- How much speech data is necessary for ASR in African languages? An evaluation of data scaling in Kinyarwanda and Kikuyu
- Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
- LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding
- When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation
- Red-Bandit: Test-Time Adaptation for LLM Red-Teaming via Bandit-Guided LoRA Experts
- Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models
- Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
- Agent Bain vs. Agent McKinsey: A New Text-to-SQL Benchmark for the Business Domain
- CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
- XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection
- GPT-5 Model Corrected GPT-4V's Chart Reading Errors, Not Prompting
- Exposing Citation Vulnerabilities in Generative Engines
- Crossing Domains without Labels: Distant Supervision for Term Extraction
- RedTWIZ: Diverse LLM Red Teaming via Adaptive Attack Planning
- Machines in the Crowd? Measuring the Footprint of Machine-Generated Text on Reddit
- LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models
- Benchmarking Gaslighting Negation Attacks Against Multimodal Large Language Models
- Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning
- Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
- Speculative Decoding and Beyond: An In-Depth Survey of Techniques
- MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
- Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
- GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
- Geometry of Semantics in Next-Token Prediction: How Optimization Implicitly Organizes Linguistic Representations
- AutoRev: Multi-Modal Graph Retrieval for Automated Peer-Review Generation
- HopWeaver: Cross-Document Synthesis of High-Quality and Authentic Multi-Hop Questions
- FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management
- Do RAG Systems Really Suffer From Positional Bias?
- MIST: Towards Multi-dimensional Implicit BiaS Evaluation of LLMs via Theory of Mind
- PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction
- Do LLMs Overthink Basic Math Reasoning? Benchmarking the Accuracy-Efficiency Tradeoff in Language Models
- LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
- LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
- Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation
- Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models
- User to Video: A Model for Spammer Detection Inspired by Video Classification Technology
- multimodars: A Rust-powered toolkit for multi-modality cardiac image fusion and registration
- Does Physics Knowledge Emerge in Frontier Models?
- Enhanced Self-Distillation Framework for Efficient Spiking Neural Network Training
- Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
- TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion
- SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation
- Superpixel Integrated Grids for Fast Image Segmentation
- Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation
- From Captions to Keyframes: Efficient Video Summarization via Caption- and Context-Aware Frame Scoring
- Limited-Angle Tomography Reconstruction via Projector Guided 3D Diffusion
- VUGEN: Visual Understanding priors for GENeration
- Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation
- Improving Artifact Robustness for CT Deep Learning Models Without Labeled Artifact Images via Domain Adaptation
- Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
- Adaptive Stain Normalization for Cross-Domain Medical Histology
- AIM 2025 Challenge on Real-World RAW Image Denoising
- Self-supervised Physics-guided Model with Implicit Representation Regularization for Fast MRI Reconstruction
- A Bridge from Audio to Video: Phoneme-Viseme Alignment Allows Every Face to Speak Multiple Languages
- MSITrack: A Challenging Benchmark for Multispectral Single Object Tracking
- DreamOmni2: Multimodal Instruction-based Editing and Generation
- SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis
- DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining
- OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
- Transforming Noise Distributions with Histogram Matching: Towards a Single Denoiser for All
- A deep multiple instance learning approach based on coarse labels for high-resolution land-cover mapping
- TTRV: Test-Time Reinforcement Learning for Vision Language Models
- VA-Adapter: Adapting Ultrasound Foundation Model to Echocardiography Probe Guidance
- Covert Quantum Learning: Privately and Verifiably Learning from Quantum Data
- Accelerating Inference for Multilayer Neural Networks with Quantum Computers
- Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
- On the Convergence of Moral Self-Correction in Large Language Models
- Maximising the Utility of Validation Sets for Imbalanced Noisy-label Meta-learning
- Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation
- Domain Generalization by Rejecting Extreme Augmentations
- Generalizable Physics-Informed Learning for Stochastic Safety-Critical Systems
- Want to train KANS at scale? Now UKAN!
- Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
- Reinforcement Learning for Dynamic Memory Allocation
- Quantum Rationale-Aware Graph Contrastive Learning for Jet Discrimination
- VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction
- Contrastive Graph Condensation: Advancing Data Versatility through Self-Supervised Learning
- DPGIIL: Dirichlet Process-Deep Generative Model-Integrated Incremental Learning for Clustering in Transmissibility-based Online Structural Anomaly Detection
- Towards the Worst-case Robustness of Large Language Models
- Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves
- A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
- A Novel Collaborative Framework for Efficient Synchronization in Split Federated Learning over Wireless Networks
- Nonparametric Bellman Mappings for Value Iteration in Distributed Reinforcement Learning
- Unveiling the Basin-Like Loss Landscape in Large Language Models
- HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
- Dual Natural Gradient Descent for Scalable Training of Physics-Informed Neural Networks
- Learning where to learn: Training data distribution optimization for scientific machine learning
- Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design
- AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction
- AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
- Adversarial Surrogate Risk Bounds for Binary Classification
- Auto-Compressing Networks
- On the necessity of adaptive regularisation:Optimal anytime online learning on $\boldsymbol{\ell_p}$-balls
- P3D: Scalable Neural Surrogates for High-Resolution 3D Physics Simulations with Global Context
- Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
- An Empirical Analysis of the Laplace and Neural Tangent Kernels
- Train-Free Segmentation in MRI with Cubical Persistent Homology
- Testing Support Size More Efficiently Than Learning Histograms
- 2 OLMo 2 Furious
- Automating RT Planning at Scale: High Quality Data For AI Training
- GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm
- Bit-Level Discrete Diffusion with Markov Probabilistic Models: An Improved Framework with Sharp Convergence Bounds under Minimal Assumptions
- Last-iterate Convergence for Symmetric, General-sum, $2 \times 2$ Games Under The Exponential Weights Dynamic
- Jailbreak Attack Initializations as Extractors of Compliance Directions
- DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion
- Is Supervised Learning Really That Different from Unsupervised?
- TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference
- Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
- Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models
- MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning
- 360-LLaMA-Factory: Plug & Play Sequence Parallelism for Long Post-Training
- Estimating the Joint Probability of Scenario Parameters with Gaussian Mixture Copula Models
- Probing forced responses and causal mechanisms in large-scale climate dynamics with reduced-order neural models
- Making and Evaluating Calibrated Forecasts
- The Effect of Label Noise on the Information Content of Neural Representations
- Test-Time Efficient Pretrained Model Portfolios for Time Series Forecasting
- Nearly Instance-Optimal Parameter Recovery from Many Trajectories via Hellinger Localization
- Bayesian Optimization under Uncertainty for Training a Scale Parameter in Stochastic Models
- GUIDE: Guided Initialization and Distillation of Embeddings
- Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security
- Wide Neural Networks as a Baseline for the Computational No-Coincidence Conjecture
- DPA-Net: A Dual-Path Attention Neural Network for Inferring Glycemic Control Metrics from Self-Monitored Blood Glucose Data
- POME: Post Optimization Model Edit via Muon-style Projection
- Chem-NMF: Multi-layer $\alpha$-divergence Non-Negative Matrix Factorization for Cardiorespiratory Disease Clustering, with Improved Convergence Inspired by Chemical Catalysts and Rigorous Asymptotic Analysis
- Three Forms of Stochastic Injection for Improved Distribution-to-Distribution Generative Modeling
- StruSR: Structure-Aware Symbolic Regression with Physics-Informed Taylor Guidance
- Rethinking Nonlinearity: Trainable Gaussian Mixture Modules for Modern Neural Architectures
- The Effect of Attention Head Count on Transformer Approximation
- XRPO: Pushing the limits of GRPO with Targeted Exploration and Exploitation
- TimeFormer: Transformer with Attention Modulation Empowered by Temporal Characteristics for Time Series Forecasting
- Distributed Algorithms for Multi-Agent Multi-Armed Bandits with Collision
- AutoBalance: An Automatic Balancing Framework for Training Physics-Informed Neural Networks
- Is the Hard-Label Cryptanalytic Model Extraction Really Polynomial?
- A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking
- Incorporating Expert Knowledge into Bayesian Causal Discovery of Mixtures of Directed Acyclic Graphs
- Function regression using the forward forward training and inferring paradigm
- Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness
- The Unreasonable Effectiveness of Randomized Representations in Online Continual Graph Learning
- Efficient numeracy in language models through single-token number embeddings
- Early wind turbine alarm prediction based on machine learning: AlarmForecasting
- Vectorized FlashAttention with Low-cost Exponential Computation in RISC-V Vector Processors
- SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models
- Vacuum Spiker: A Spiking Neural Network-Based Model for Efficient Anomaly Detection in Time Series
- Utilizing Large Language Models for Machine Learning Explainability
- Revisiting Node Affinity Prediction in Temporal Graphs
- Fisher Information, Training and Bias in Fourier Regression Models
- From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
- High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization
- Revisiting Mixout: An Overlooked Path to Robust Finetuning
- Spiral Model Technique For Data Science & Machine Learning Lifecycle
- Sharpness-Aware Data Generation for Zero-shot Quantization
- COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
- Enhancing Speech Emotion Recognition via Fine-Tuning Pre-Trained Models and Hyper-Parameter Optimisation
- Blind Construction of Angular Power Maps in Massive MIMO Networks
- Non-Stationary Online Structured Prediction with Surrogate Losses
- Non-Asymptotic Analysis of Efficiency in Conformalized Regression
- DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering
- Bridged Clustering for Representation Learning: Semi-Supervised Sparse Bridging
- Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
- An in-depth look at approximation via deep and narrow neural networks
- Guided by the Experts: Provable Feature Learning Dynamic of Soft-Routed Mixture-of-Experts
- A Broader View of Thompson Sampling
- Discriminative Feature Feedback with General Teacher Classes
- Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
- Dynamic Regret Bounds for Online Omniprediction with Long Term Constraints
- MolGA: Molecular Graph Adaptation with Pre-trained 2D Graph Encoder
- Enhancing Resilience for IoE: A Perspective of Networking-Level Safeguard
- Layerwise Federated Learning for Heterogeneous Quantum Clients using Quorus
- Milestone Determination for Autonomous Railway Operation
- Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions
- Toward Uncertainty-Aware and Generalizable Neural Decoding for Quantum LDPC Codes
- Developing a Sequential Deep Learning Pipeline to Model Alaskan Permafrost Thaw Under Climate Change
- Beyond Static Knowledge Messengers: Towards Adaptive, Fair, and Scalable Federated Learning for Medical AI
- A Mixed-Methods Analysis of Repression and Mobilization in Bangladesh's July Revolution Using Machine Learning and Statistical Modeling
- Vision Transformer for Transient Noise Classification
- General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
- Mass Conservation on Rails -- Rethinking Physics-Informed Learning of Ice Flow Vector Fields
- Scalable deep fusion of spaceborne lidar and synthetic aperture radar for global forest structural complexity mapping
- Conditional Denoising Diffusion Model-Based Robust MR Image Reconstruction from Highly Undersampled Data
- Diffusion-Guided Renormalization of Neural Systems via Tensor Networks
- A General Constructive Upper Bound on Shallow Neural Nets Complexity
- Road Surface Condition Detection with Machine Learning using New York State Department of Transportation Camera Images and Weather Forecast Data
- Online Matching via Reinforcement Learning: An Expert Policy Orchestration Strategy
- BACHI: Boundary-Aware Symbolic Chord Recognition Through Masked Iterative Decoding on Pop and Classical Music
- From Description to Detection: LLM based Extendable O-RAN Compliant Blind DoS Detection in 5G and Beyond
- Cluster Paths: Navigating Interpretability in Neural Networks
- From Acceleration to Saturation: Scaling Behavior of Bootstrapped Language Model Pretraining
- Adapting Quantum Machine Learning for Energy Dissociation of Bonds
- FEAorta: A Fully Automated Framework for Finite Element Analysis of the Aorta From 3D CT Images
- Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks
- A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures
- Q-Learning with Fine-Grained Gap-Dependent Regret
- Fitzpatrick Thresholding for Skin Image Segmentation
- Gaussian Equivalence for Self-Attention: Asymptotic Spectral Analysis of Attention Matrix
- Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer
- Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)
- Quantum Computing Methods for Malware Detection
- BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
- Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
- Reconquering Bell sampling on qudits: stabilizer learning and testing, quantum pseudorandomness bounds, and more
- Quantum Sparse Recovery and Quantum Orthogonal Matching Pursuit
- Textual interpretation of transient image classifications from large language models
- PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing
- Accelerating Sparse Ternary GEMM for Quantized LLM inference on Apple Silicon
- Falsification-Driven Reinforcement Learning for Maritime Motion Planning
- Relational Database Distillation: From Structured Tables to Condensed Graph Data
- Root Cause Analysis of Outliers in Unknown Cyclic Graphs
- Pseudo-MDPs: A Novel Framework for Efficiently Optimizing Last Revealer Seed Manipulations in Blockchains
- Explaining Models under Multivariate Bernoulli Distribution via Hoeffding Decomposition
- Diffusion-Augmented Reinforcement Learning for Robust Portfolio Optimization under Stress Scenarios
- Active Control of Turbulent Airfoil Flows Using Adjoint-based Deep Learning
- GNN-enhanced Traffic Anomaly Detection for Next-Generation SDN-Enabled Consumer Electronics
- TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
- Spectral Graph Clustering under Differential Privacy: Balancing Privacy, Accuracy, and Efficiency
- NurseLLM: The First Specialized Language Model for Nursing
- Quantifying Data Contamination in Psychometric Evaluations of LLMs
- Bayesian Portfolio Optimization by Predictive Synthesis
- Split Conformal Classification with Unsupervised Calibration
- A Multi-Agent Framework for Stateful Inference-Time Search
- ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
- Resolution scaling governs DINOv3 transfer performance in chest radiograph classification
- HyPlan: Hybrid Learning-Assisted Planning Under Uncertainty for Safe Autonomous Driving
- Language Lives in Sparse Dimensions: Toward Interpretable and Efficient Multilingual Control for Large Language Models
- GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation
- Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation
- Benchmarking LLM Causal Reasoning with Scientifically Validated Relationships
- LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
- On the false election between regulation and innovation. Ideas for regulation through the responsible use of artificial intelligence in research and education.[Spanish version]
- Online Rubrics Elicitation from Pairwise Comparisons
- GTCN-G: A Residual Graph-Temporal Fusion Network for Imbalanced Intrusion Detection (Preprint)
- Evolutionary Profiles for Protein Fitness Prediction
- AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs
- Cocoon: A System Architecture for Differentially Private Training with Correlated Noises
- MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline
- h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
- GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations
- Vibe Checker: Aligning Code Evaluation with Human Preference
- Artificial Hippocampus Networks for Efficient Long-Context Modeling
- Inferring Capabilities from Task Performance with Bayesian Triangulation
- Transparent and Coherent Procedural Mistake Detection
- An Illusion of Progress? Assessing the Current State of Web Agents
- Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
- Controlled Agentic Planning & Reasoning for Mechanism Synthesis
- Functional Matching of Logic Subgraphs: Beyond Structural Isomorphism
- Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
- Attacking the Spike: On the Transferability and Security of Spiking Neural Networks to Adversarial Examples
- Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
- Is My Data in Your AI? Membership Inference Test (MINT) applied to Face Biometrics
- Unlocking Dataset Distillation with Diffusion Models
- ECLM: Entity Level Language Model for Spoken Language Understanding with Chain of Intent
- Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
- V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
- A Deep Learning System for Rapid and Accurate Warning of Acute Aortic Syndrome on Non-contrast CT in China
- Approximately Aligned Decoding
- NAR-*ICP: Neural Execution of Classical ICP-based Pointcloud Registration Algorithms
- Error Bounds for Physics-Informed Neural Networks in Fokker-Planck PDEs
- SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications
- VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
- Machine Learning and Multi-source Remote Sensing in Forest Aboveground Biomass Estimation: A Review
- Sustainable Self-evolution Adversarial Training
- Evil twins are not that evil: Qualitative insights into machine-generated prompts
- Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine
- KunServe: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM Serving
- Tempo: Compiled Dynamic Deep Learning with Symbolic Dependence Graphs
- FedAGHN: Personalized Federated Learning with Attentive Graph HyperNetworks
- A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
- Achieving Hyperbolic-Like Expressiveness with Arbitrary Euclidean Regions: A New Approach to Hierarchical Embeddings
- LLM Unlearning via Neural Activation Redirection
- MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
- Lossy Neural Compression for Geospatial Analytics: A Review
- Mind the (Belief) Gap: Group Identity in the World of LLMs
- Improving Neutral Point-of-View Generation with Data- and Parameter-Efficient RL
- Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided and Self-Consistent MLLMs for Task Planning in Instruction-Following Manipulation
- NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks
- Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification
- AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
- Weight Ensembling Improves Reasoning in Language Models
- Efficient Flow Matching using Latent Variables
- Generative Pre-trained Autoregressive Diffusion Transformer
- MONAQ: Multi-Objective Neural Architecture Querying for Time-Series Analysis on Resource-Constrained Devices
- AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs
- AdaDim: Dimensionality Adaptation for SSL Representational Dynamics
- MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
- The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs
- Performance of machine-learning-assisted Monte Carlo in sampling from simple statistical physics models
- Exchangeability in Neural Network and its Application to Dynamic Pruning
- CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
- Learning to Recover: Dynamic Reward Shaping with Wheel-Leg Coordination for Fallen Robots
- KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes
- AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
- Prefilled responses enhance zero-shot detection of AI-generated images
- Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning
- Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories
- Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization
- Enjoying Non-linearity in Multinomial Logistic Bandits
- Token-based Audio Inpainting via Discrete Diffusion
- Quantum Machine Learning in Multi-Qubit Phase-Space Part I: Foundations
- Community-Centered Spatial Intelligence for Climate Adaptation at Nova Scotia's Eastern Shore
- Intelligent Healthcare Imaging Platform: A VLM-Based Framework for Automated Medical Image Analysis and Clinical Report Generation
- On knot detection via picture recognition
- PIKAN: Physics-Inspired Kolmogorov-Arnold Networks for Explainable UAV Channel Modelling
- Lagrangian neural ODEs: Measuring the existence of a Lagrangian with Helmholtz metrics
- RareGraph-Synth: Knowledge-Guided Diffusion Models for Generating Privacy-Preserving Synthetic Patient Trajectories in Ultra-Rare Diseases
- MCCE: A Framework for Multi-LLM Collaborative Co-Evolution
- Reproducibility Study of "XRec: Large Language Models for Explainable Recommendation"
- A Total Variation Regularized Framework for Epilepsy-Related MRI Image Segmentation
- RVFL-X: A Novel Randomized Network Based on Complex Transformed Real-Valued Tabular Datasets
- Surgeons Are Indian Males and Speech Therapists Are White Females: Auditing Biases in Vision-Language Models for Healthcare Professionals
- Improving the Spatial Resolution of GONG Solar Images to GST Quality Using Deep Learning
- SER-Diff: Synthetic Error Replay Diffusion for Incremental Brain Tumor Segmentation
- Soft-Evidence Fused Graph Neural Network for Cancer Driver Gene Identification across Multi-View Biological Graphs
- Traj-Transformer: Diffusion Models with Transformer for GPS Trajectory Generation
- ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
- BlockGPT: Spatio-Temporal Modelling of Rainfall via Frame-Level Autoregression
- Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling
- VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code
- RGBD Gaze Tracking Using Transformer for Feature Fusion
- SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation
- Leveraging Large Language Models for Cybersecurity Risk Assessment -- A Case from Forestry Cyber-Physical Systems
- Flexible Swarm Learning May Outpace Foundation Models in Essential Tasks
- Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation
- TransFIRA: Transfer Learning for Face Image Recognizability Assessment
- Constrained Natural Language Action Planning for Resilient Embodied Systems
- EverydayMMQA: A Multilingual and Multimodal Framework for Culturally Grounded Spoken Visual QA
- Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
- Monte Carlo Permutation Search
- Protecting De-identified Documents from Search-based Linkage Attacks
- Reward Model Perspectives: Whose Opinions Do Reward Models Reward?
- Adaptive Protein Design Protocols and Middleware
- Geometry-Aware Backdoor Attacks: Leveraging Curvature in Hyperbolic Embeddings
- Context-Aware Inference via Performance Forecasting in Decentralized Learning Networks
- A Survey on Agentic Security: Applications, Threats and Defenses
- How NOT to benchmark your SITE metric: Beyond Static Leaderboards and Towards Realistic Evaluation
- Evaluating Node-tree Interfaces for AI Explainability
- Deep Generative Model for Human Mobility Behavior
- Attention Sinks and Compression Valleys in LLMs are Two Sides of the Same Coin
- Valid Stopping for LLM Generation via Empirical Dynamic Formal Lift
- Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
- ATLO-ML: Adaptive Time-Length Optimizer for Machine Learning -- Insights from Air Quality Forecasting
- A Median Perspective on Unlabeled Data for Out-of-Distribution Detection
- LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval
- Visualizing Multimodality in Combinatorial Search Landscapes
- CLAQS: Compact Learnable All-Quantum Token Mixer with Shared-ansatz for Text Classification
- Scalable Policy-Based RL Algorithms for POMDPs
- Incoherence in goal-conditioned autoregressive models
- The Markovian Thinker
- The Algebra of Meaning: Why Machines Need Montague More Than Moore's Law
- HSNet: Heterogeneous Subgraph Network for Single Image Super-resolution
- The Framework That Survives Bad Models: Human-AI Collaboration For Clinical Trials
- SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation
- Reading Between the Lines: Towards Reliable Black-box LLM Fingerprinting via Zeroth-order Gradient Estimation
- AI-Driven Forecasting and Monitoring of Urban Water System
- Control-Augmented Autoregressive Diffusion for Data Assimilation
- StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
- Distilling Lightweight Language Models for C/C++ Vulnerabilities
- The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
- Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions
- Delay Independent Safe Control with Neural Networks: Positive Lur'e Certificates for Risk Aware Autonomy
- Automated Neural Architecture Design for Industrial Defect Detection
- Heptapod: Language Modeling on Visual Signals
- Incremental Summarization for Customer Support via Progressive Note-Taking and Agent Feedback
- Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion
- Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
- AISysRev -- LLM-based Tool for Title-abstract Screening
- Dual Goal Representations
- LLM Company Policies and Policy Implications in Software Organizations
- Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
- Are LLMs Reliable Rankers? Rank Manipulation via Two-Stage Token Optimization
- Evaluating LLMs for Historical Document OCR: A Methodological Framework for Digital Humanities
- Modeling COVID-19 Dynamics in German States Using Physics-Informed Neural Networks
- Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
- Extreme Amodal Face Detection
- FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
- Recurrence-Complete Frame-based Action Models
- CNN-TFT explained by SHAP with multi-head attention weights for time series forecasting
- SID: Multi-LLM Debate Driven by Self Signals
- OpenJAI-v1.0: An Open Thai Large Language Model
- Enhancing Bankruptcy Prediction of Banks through Advanced Machine Learning Techniques: An Innovative Approach and Analysis
- Explaining raw data complexity to improve satellite onboard processing
- Towards Generalization of Graph Neural Networks for AC Optimal Power Flow
- Multi-hop Deep Joint Source-Channel Coding with Deep Hash Distillation for Semantically Aligned Image Retrieval
- MoRE-GNN: Multi-omics Data Integration with a Heterogeneous Graph Autoencoder
- Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices
- M3Retrieve: Benchmarking Multimodal Retrieval for Medicine
- Angular Constraint Embedding via SpherePair Loss for Constrained Clustering
- Emotionally Vulnerable Subtype of Internet Gaming Disorder: Measuring and Exploring the Pathology of Problematic Generative AI Use
- DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning
- LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling
- Bayesian Nonparametric Dynamical Clustering of Time Series
- Expressive and Scalable Quantum Fusion for Multimodal Learning
- Grouped Differential Attention
- Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation
- EDUMATH: Generating Standards-aligned Educational Math Word Problems
- Generating Surface for Text-to-3D using 2D Gaussian Splatting
- Learning Global Representation from Queries for Vectorized HD Map Construction
- VelLMes: A high-interaction AI-based deception framework
- The Limits of Goal-Setting Theory in LLM-Driven Assessment
- Pragyaan: Designing and Curating High-Quality Cultural Post-Training Datasets for Indian Languages
- Native Hybrid Attention for Efficient Sequence Modeling
- Federated Unlearning in the Wild: Rethinking Fairness and Data Discrepancy
- Mining the Mind: What 100M Beliefs Reveal About Frontier LLM Knowledge
- Unified Molecule Pre-training with Flexible 2D and 3D Modalities: Single and Paired Modality Integration
- Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models
- Introspection in Learned Semantic Scene Graph Localisation
- LuxInstruct: A Cross-Lingual Instruction Tuning Dataset For Luxembourgish
- Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
- HTMformer: Hybrid Time and Multivariate Transformer for Time Series Forecasting
- Generative World Modelling for Humanoids: 1X World Model Challenge Technical Report
- Opt-ICL at LeWiDi-2025: Maximizing In-Context Signal from Rater Examples via Meta-Learning
- Graph Conditioned Diffusion for Controllable Histopathology Image Generation
- A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model
- TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
- Comparing human and language models sentence processing difficulties on complex structures
- AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
- Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization
- BuilderBench -- A benchmark for generalist agents
- Requirements for Game-Based Learning Design Framework for Information System Integration in the Context of Post-Merger Integration
- Belief-Calibrated Multi-Agent Consensus Seeking for Complex NLP Tasks
- Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?
- Flavonoid Fusion: Creating a Knowledge Graph to Unveil the Interplay Between Food and Health
- PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles
- Beneficial Reasoning Behaviors in Agentic Search and Effective Post-training to Obtain Them
- Auto-Prompt Ensemble for LLM Judge
- WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks
- Fine-Grained Emotion Recognition via In-Context Learning
- Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support
- Inefficiencies of Meta Agents for Agent Design
- MultiCNKG: Integrating Cognitive Neuroscience, Gene, and Disease Knowledge Graphs Using Large Language Models
- Verifying Memoryless Sequential Decision-making of Large Language Models
- Evolving and Executing Research Plans via Double-Loop Multi-Agent Collaboration
- Autoformalizer with Tool Feedback
- TGPR: Tree-Guided Policy Refinement for Robust Self-Debugging of LLMs
- LLM-Assisted Modeling of Semantic Web-Enabled Multi-Agents Systems with AJAN
- Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces
- Tool-Augmented Policy Optimization: Synergizing Reasoning and Adaptive Tool Use with Reinforcement Learning
- Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations
- Inductive Learning for Possibilistic Logic Programs Under Stable Models
- VRPAgent: LLM-Driven Discovery of Heuristic Operators for Vehicle Routing Problems
- The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas
- The Contingencies of Physical Embodiment Allow for Open-Endedness and Care
- Integrating Domain Knowledge into Process Discovery Using Large Language Models
- NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
- Multi-Objective Multi-Agent Path Finding with Lexicographic Cost Preferences
- Agentic generative AI for media content discovery at the national football league
- DeepXPalm: Tilt and Position Rendering using Palm-worn Haptic Display and CNN-based Tactile Pattern Recognition
- TiltXter: CNN-based Electro-tactile Rendering of Tilt Angle for Telemanipulation of Pasteur Pipettes
- A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants
- Exploring Human-AI Collaboration Using Mental Models of Early Adopters of Multi-Agent Generative AI Tools
- Generalized Multi-agent Social Simulation Framework
- Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report)
- Uncertainty Quantification In Surface Landmines and UXO Classification Using MC Dropout
- Knowledge Graph-Guided Multi-Agent Distillation for Reliable Industrial Question Answering with Datasets
- Transparent Reference-free Automated Evaluation of Open-Ended User Survey Responses
- CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
- Evaluating Embedding Frameworks for Scientific Domain
- DynBenchmark: Customizable Ground Truths to Benchmark Community Detection and Tracking in Temporal Networks
- TRepLiNa: Layer-wise CKA+REPINA Alignment Improves Low-Resource Machine Translation in Aya-23 8B
- Scalable multilingual PII annotation for responsible AI in LLMs
- Dream2Image : An Open Multimodal EEG Dataset for Decoding and Visualizing Dreams with Artificial Intelligence
- LLM-Driven Rubric-Based Assessment of Algebraic Competence in Multi-Stage Block Coding Tasks with Design and Field Evaluation
- Ensemble Deep Learning and LLM-Assisted Reporting for Automated Skin Lesion Diagnosis
- Prakriti200: A Questionnaire-Based Dataset of 200 Ayurvedic Prakriti Assessments
- Dual-stage and Lightweight Patient Chart Summarization for Emergency Physicians
- Language models for longitudinal analysis of abusive content in Billboard Music Charts
Research Sources: 650 | Generated: 10/10/2025