AI RESEARCH PAPERS & ACADEMIC SOURCES
- NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
- D$^2$GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction
- ReSplat: Learning Recurrent Gaussian Splats
- FlowLensing: Simulating Gravitational Lensing with Flow Matching
- SatFusion: A Unified Framework for Enhancing Satellite IoT Images via Multi-Temporal and Multi-Source Data Fusion
- SViM3D: Stable Video Material Diffusion for Single Image 3D Generation
- Spectral Prefiltering of Neural Fields
- Splat the Net: Radiance Fields with Splattable Neural Primitives
- X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering
- R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation
- DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model
- Scalable Offline Metrics for Autonomous Driving
- PRVR: Partially Relevant Video Retrieval
- I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
- Redundant Semantic Environment Filling via Misleading-Learning for Fair Deepfake Detection
- Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video
- Motion Capture from Inertial and Vision Sensors
- CurvNet: Latent Contour Representation and Iterative Data Engine for Curvature Angle Estimation
- MonoGSDF: Exploring Monocular Geometric Cues for Gaussian Splatting-Guided Implicit Surface Reconstruction
- EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
- Scalable Cosmic AI Inference using Cloud Serverless Computing
- Self-Training with Dynamic Weighting for Robust Gradual Domain Adaptation
- H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical Imaging
- TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
- DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks
- Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes
- Targetless LiDAR-Camera Calibration with Neural Gaussian Splatting
- DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinates-based Diffusion Model
- ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks
- MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs
- IMAGHarmony: Controllable Image Editing with Consistent Object Quantity and Layout
- OASIS: Online Sample Selection for Continual Visual Instruction Tuning
- Feedback Guidance of Diffusion Models
- ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?
- Language learning shapes visual category-selectivity in deep neural networks
- MAMBO: High-Resolution Generative Approach for Mammography Images
- Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
- PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing
- Who Stole Your Data? A Method for Detecting Unauthorized RAG Theft
- From Keywords to Clusters: AI-Driven Analysis of YouTube Comments to Reveal Election Issue Salience in 2024
- Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition
- ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
- The Visual Iconicity Challenge: Evaluating Vision-Language Models on Sign Language Form-Meaning Mapping
- SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
- Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
- ThinkNote: Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognition Modeling
- Expert-Token Resonance MoE: Bidirectional Routing with Efficiency Affinity-Driven Active Selection
- Med-R$^2$: Crafting Trustworthy LLM Physicians via Retrieval and Reasoning of Evidence-Based Medicine
- Mitigating Forgetting in LLM Fine-Tuning via Low-Perplexity Token Learning
- Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples
- Less is More: Compact Clue Selection for Efficient Retrieval-Augmented Generation Reasoning
- Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?
- Argument Summarization and its Evaluation in the Era of Large Language Models
- Sherkala-Chat: Building a State-of-the-Art LLM for Kazakh in a Moderately Resourced Setting
- DiMA: An LLM-Powered Ride-Hailing Assistant at DiDi
- UniEDU: A Unified Language and Vision Assistant for Education Applications
- Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Cultural Intelligence with CQ-Bench
- Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework
- What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse
- UNCLE: Benchmarking Uncertainty Expressions in Long-Form Generation
- FlashDLM: Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion
- FlowNIB: An Information Bottleneck Analysis of Bidirectional vs. Unidirectional Language Models
- From Handwriting to Feedback: Evaluating VLMs and LLMs for AI-Powered Assessment in Indonesian Classrooms
- Language Surgery in Multilingual Large Language Models
- How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval
- The Behavioural Translation Style Space: Towards simulating the temporal dynamics of affect, behaviour, and cognition in human translation production
- Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
- Play to Generalize: Learning to Reason Through Game Play
- DynamicEval: Rethinking Evaluation for Dynamic Text-to-Video Synthesis
- Provably Accelerated Imaging with Restarted Inertia and Score-based Image Priors
- D2RA: Dual Domain Regeneration Attack
- PickStyle: Video-to-Video Style Transfer with Context-Style Adapters
- Cross-Modal Attention Guided Unlearning in Vision-Language Models
- MaizeStandCounting (MaSC): Automated and Accurate Maize Stand Counting from UAV Imagery Using Image Processing and Deep Learning
- Quick-CapsNet (QCN): A fast alternative to Capsule Networks
- Rectified-CFG++ for Flow Based Models
- PIT-QMM: A Large Multimodal Model For No-Reference Point Cloud Quality Assessment
- Dual-Stream Alignment for Action Segmentation
- Once Is Enough: Lightweight DiT-Based Video Virtual Try-On via One-Time Garment Appearance Injection
- MONKEY: Masking ON KEY-Value Activation Adapter for Personalization
- Automatic Text Box Placement for Supporting Typographic Design
- Hybrid CNN-BYOL Approach for Fault Detection in Induction Motors Using Thermal Images
- Mutual Learning for Hashing: Unlocking Strong Hash Functions from Weak Supervision
- RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning
- SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
- ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes
- DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream
- Demystifying Deep Learning-based Brain Tumor Segmentation with 3D UNets and Explainable AI (XAI): A Comparative Analysis
- GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
- FMANet: A Novel Dual-Phase Optical Flow Approach with Fusion Motion Attention Network for Robust Micro-expression Recognition
- An End-to-End Room Geometry Constrained Depth Estimation Framework for Indoor Panorama Images
- Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation
- MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
- PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting
- IsoSignVid2Aud: Sign Language Video to Audio Conversion without Text Intermediaries
- AlignGS: Aligning Geometry and Semantics for Robust Indoor Reconstruction from Sparse Views
- XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method
- MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
- ASBench: Image Anomalies Synthesis Benchmark for Anomaly Detection
- CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving
- Latent Harmony: Synergistic Unified UHD Image Restoration via Latent Space Regularization and Controllable Refinement
- The impact of abstract and object tags on image privacy classification
- GraphEnet: Event-driven Human Pose Estimation with a Graph Neural Network
- CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning
- RayFusion: Ray Fusion Enhanced Collaborative Visual Perception
- RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans
- RetouchLLM: Training-free White-box Image Retouching
- A class-driven hierarchical ResNet for classification of multispectral remote sensing images
- Towards Real-World Deepfake Detection: A Diverse In-the-wild Dataset of Forgery Faces
- DarkHash: A Data-Free Backdoor Attack Against Deep Hashing
- Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting
- Real-Time Motion-Controllable Autoregressive Video Diffusion
- UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution
- Beyond Textual CoT: Interleaved Text-Image Chains with Deep Confidence Reasoning for Image Editing
- InstructUDrag: Joint Text Instructions and Object Dragging for Interactive Image Editing
- Fine-grained text-driven dual-human motion generation via dynamic hierarchical interaction
- Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene Classification
- One Stone with Two Birds: A Null-Text-Null Frequency-Aware Diffusion Models for Text-Guided Image Inpainting
- A Multimodal Depth-Aware Method For Embodied Reference Understanding
- LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
- Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge
- LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation
- SPICE: Simple and Practical Image Clarification and Enhancement
- Hyperspectral data augmentation with transformer-based diffusion models
- UniVideo: Unified Understanding, Generation, and Editing for Videos
- Robust Source-Free Domain Adaptation for Medical Image Segmentation based on Curriculum Learning
- VideoVerse: How Far is Your T2V Generator from a World Model?
- Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
- Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
- InstructX: Towards Unified Visual Editing with MLLM Guidance
- MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration
- Have We Scene It All? Scene Graph-Aware Deep Point Cloud Compression
- FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
- MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
- ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation
- VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning
- MultiCOIN: Multi-Modal COntrollable Video INbetweening
- ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving
- Lemma Dilemma: On Lemma Generation Without Domain- or Language-Specific Training Data
- Meaningful Pose-Based Sign Language Evaluation
- Populism Meets AI: Advancing Populism Research with LLMs
- MAPRO: Recasting Multi-Agent Prompt Optimization as Maximum a Posteriori Inference
- AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding
- ParsTranslit: Truly Versatile Tajik-Farsi Transliteration
- IASC: Interactive Agentic System for ConLangs
- Toward Reliable Clinical Coding with Language Models: Verification and Lightweight Adaptation
- Role-Conditioned Refusals: Evaluating Access Control Reasoning in Large Language Models
- Textual Entailment and Token Probability as Bias Evaluation Metrics
- MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
- SUBQRAG: sub-question driven dynamic graph rag
- Multilingual Knowledge Graph Completion via Efficient Multilingual Knowledge Sharing
- OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
- Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers
- Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards
- The Unintended Trade-off of AI Alignment:Balancing Hallucination Mitigation and Safety in LLMs
- RCPU: Rotation-Constrained Error Compensation for Structured Pruning of a Large Language Model
- Multilingual Generative Retrieval via Cross-lingual Semantic Compression
- Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains
- Do LLMs Really Need 10+ Thoughts for "Find the Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking
- CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching
- Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
- Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
- ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
- Comprehensiveness Metrics for Automatic Evaluation of Factual Recall in Text Generation
- Vision-Enabled LLMs in Historical Lexicography: Digitising and Enriching Estonian-German Dictionaries from the 17th and 18th Centuries
- ChatGPT as a Translation Engine: A Case Study on Japanese-English
- Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
- Mitigating Judgment Preference Bias in Large Language Models through Group-Based Polling
- Beyond Over-Refusal: Scenario-Based Diagnostics and Post-Hoc Mitigation for Exaggerated Refusals in LLMs
- ARM2: Adaptive Reasoning Model with Vision Understanding and Executable Code
- METRICALARGS: A Taxonomy for Studying Metrical Poetry with LLMs
- Training-Free Group Relative Policy Optimization
- SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets
- The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
- Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
- Neuron-Level Analysis of Cultural Understanding in Large Language Models
- AutoRed: A Free-form Adversarial Prompt Generation Framework for Automated Red Teaming
- Two-Stage Voting for Robust and Efficient Suicide Risk Detection on Social Media
- If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
- ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
- LeWiDi-2025 at NLPerspectives: The Third Edition of the Learning with Disagreements Shared Task
- Neologism Learning for Controllability and Self-Verbalization
- Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator
- WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
- LLM Fingerprinting via Semantically Conditioned Watermarks
- Continuum Transformers Perform In-Context Learning by Operator Gradient Descent
- Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check
- Multi-Trigger Poisoning Amplifies Backdoor Vulnerabilities in LLMs
- Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
- Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Finetuning
- Unsupervised Multi-Source Federated Domain Adaptation under Domain Diversity through Group-Wise Discrepancy Minimization
- Beyond Sub-6 GHz: Leveraging mmWave Wi-Fi for Gait-Based Person Identification
- Bidirectional Representations Augmented Autoregressive Biological Sequence Generation:Application in De Novo Peptide Sequencing
- Long-tailed Recognition with Model Rebalancing
- Dual-granularity Sinkhorn Distillation for Enhanced Learning from Long-tailed Noisy Data
- Post-hoc Stochastic Concept Bottleneck Models
- Reinforcement Learning from Probabilistic Forecasts for Safe Decision-Making via Conditional Value-at-Risk Planning
- Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
- Bridging the Physics-Data Gap with FNO-Guided Conditional Flow Matching: Designing Inductive Bias through Hierarchical Physical Constraints
- Dynamic Features Adaptation in Networking: Toward Flexible training and Explainable inference
- Robust and Efficient Collaborative Learning
- To Ask or Not to Ask: Learning to Require Human Feedback
- Guided Star-Shaped Masked Diffusion
- Contrastive Self-Supervised Learning at the Edge: An Energy Perspective
- Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions
- Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin
- Reinforcing Diffusion Models by Direct Group Preference Optimization
- SummDiff: Generative Modeling of Video Summarization with Diffusion
- In-Context Clustering with Large Language Models
- Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models
- DYNAMIX: RL-based Adaptive Batch Size Optimization in Distributed Machine Learning Systems
- Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
- Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
- Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization
- Who Said Neural Networks Aren't Linear?
- Geodesics in the Deep Linear Network
- Decoding the dark proteome: Deep learning-enabled discovery of druggable enzymes in Wuchereria bancrofti
- SpotDiff: Spotting and Disentangling Interference in Feature Space for Subject-Preserving Image Generation
- Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding
- Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation
- Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments
- Bayesian Optimization of Multi-Bit Pulse Encoding in In2O3/Al2O3 Thin-film Transistors for Temporal Data Processing
- VeMo: A Lightweight Data-Driven Approach to Model Vehicle Dynamics
- Comparison of Fully Homomorphic Encryption and Garbled Circuit Techniques in Privacy-Preserving Machine Learning Inference
- Evaluating and Learning Optimal Dynamic Treatment Regimes under Truncation by Death
- Time-Frequency Filtering Meets Graph Clustering
- Beyond independent component analysis: identifiability and algorithms
- Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices
- Locality-Sensitive Hashing-Based Efficient Point Transformer for Charged Particle Reconstruction
- From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation
- A Honest Cross-Validation Estimator for Prediction Performance
- Large Language Models Meet Virtual Cell: A Survey
- ToolExpander: Extending the Frontiers of Tool-Using Reinforcement Learning to Weak LLMs
- When Robustness Meets Conservativeness: Conformalized Uncertainty Calibration for Balanced Decision Making
- Instance Relation Learning Network with Label Knowledge Propagation for Few-shot Multi-label Intent Detection
- PLUM: Adapting Pre-trained Language Models for Industrial-scale Generative Recommendations
- Adaptive Execution Scheduler for DataDios SmartDiff
- Surrogate Graph Partitioning for Spatial Prediction
- On the Optimality of Tracking Fisher Information in Adaptive Testing with Stochastic Binary Responses
- On the Optimality of the Median-of-Means Estimator under Adversarial Contamination
- Multi-level informed optimization via decomposed Kriging for large design problems under uncertainty
- SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation
- Stick-Breaking Mixture Normalizing Flows with Component-Wise Tail Adaptation for Variational Inference
- Climate Knowledge in Large Language Models
- Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
- Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation
- Computations and ML for surjective rational maps
- Beyond Real Data: Synthetic Data through the Lens of Regularization
- Random Window Augmentations for Deep Learning Robustness in CT and Liver Tumor Segmentation
- High-dimensional Analysis of Synthetic Data Selection
- Investigating Counterclaims in Causality Extraction from Text
- New Machine Learning Approaches for Intrusion Detection in ADS-B
- PAC Learnability in the Presence of Performativity
- On the Relationship Between the Choice of Representation and In-Context Learning
- Optimal Stopping in Latent Diffusion Models
- Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency
- Navigating Sparsities in High-Dimensional Linear Contextual Bandits
- Wavefunction Flows: Efficient Quantum Simulation of Continuous Flow Models
- Don't Run with Scissors: Pruning Breaks VLA Models but They Can Be Recovered
- Accelerated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box Models
- DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos
- Implementing Semantic Join Operators Efficiently
- Permutation-Invariant Spectral Learning via Dyson Diffusion
- Computational and statistical lower bounds for low-rank estimation under general inhomogeneous noise
- SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
- Where Have All the Kaczmarz Iterates Gone?
- Reconstructing the local density field with combined convolutional and point cloud architecture
- Maintaining Performance with Less Data
- Stochastic Interpolants: A Unifying Framework for Flows and Diffusions
- Graph-SCP: Accelerating Set Cover Problems with Graph Neural Networks
- The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
- Adaptive Collaborative Correlation Learning-based Semi-Supervised Multi-Label Feature Selection
- Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training
- PFAttack: Stealthy Attack Bypassing Group Fairness in Federated Learning
- Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures
- Empirical evaluation of normalizing flows in Markov Chain Monte Carlo
- Efficient Graph Condensation via Gaussian Process
- Learning General Causal Structures with Hidden Dynamic Process for Climate Analysis
- Task Vector Bases: A Unified and Scalable Framework for Compressed Task Arithmetic
- InfoPos: A Design Support Framework for ML-Assisted Fault Detection and Identification in Industrial Cyber-Physical Systems
- Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
- Learn to Bid as a Price-Maker Wind Power Producer
- Unified Cross-Scale 3D Generation and Understanding via Autoregressive Modeling
- Solving Time-Fractional Partial Integro-Differential Equations Using Tensor Neural Network
- Chisme: Fully Decentralized Differentiated Deep Learning for IoT Intelligence
- Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning
- Can Large Reasoning Models Self-Train?
- Martingale Posterior Neural Networks for Fast Sequential Decision Making
- Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning
- Anticipating the Selectivity of Intramolecular Cyclization Reaction Pathways with Neural Network Potentials
- Cost-aware Stopping for Bayesian Optimization
- A Kernel Distribution Closeness Testing
- HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions
- TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization
- It's All in the Mix: Wasserstein Classification and Regression with Mixed Features
- Attention based End to end network for Offline Writer Identification on Word level data
- Data-Error Scaling Laws in Machine Learning on Combinatorial Mutation-prone Sets: Proteins and Small Molecules
- Recurrent Natural Policy Gradient for POMDPs
- MeanSparse: Post-Training Robustness Enhancement Through Mean-Centered Feature Sparsification
- BaTCAVe: Trustworthy Explanations for Robot Behaviors
- Latency-Aware Contextual Bandit: Application to Cryo-EM Data Collection
- Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
- Learning to Partially Defer for Sequences
- Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
- Erasing Without Remembering: Implicit Knowledge Forgetting in Large Language Models
- Markets for Models
- Efficient and Adaptable Overlapping for Computation and Communication via Signaling and Reordering
- Phantora: Maximizing Code Reuse in Simulation-based Machine Learning System Performance Estimation
- Efficient Multi Subject Visual Reconstruction from fMRI Using Aligned Representations
- SmartUT: Receive Beamforming for Spectral Coexistence of NGSO Satellite Systems
- Graphon Mixtures
- PO-Flow: Flow-based Generative Models for Sampling Potential Outcomes and Counterfactuals
- Foundation Models for Structural Health Monitoring
- Objective Features Extracted from Motor Activity Time Series for Food Addiction Analysis Using Machine Learning - A Pilot Study
- Nearest Neighbor CCP-Based Molecular Sequence Analysis
- Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study
- Language Model Embeddings Can Be Sufficient for Bayesian Optimization
- Multi-Continental Healthcare Modelling Using Blockchain-Enabled Federated Learning
- Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
- RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation
- Kernel-Free Universum Quadratic Surface Twin Support Vector Machines for Imbalanced Data
- HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design
- EpiCoder: Encompassing Diversity and Complexity in Code Generation
- BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response
- Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning
- Rex: Reversible Solvers for Diffusion Models
- MoM: Linear Sequence Modeling with Mixture-of-Memories
- BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational Biology
- Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling
- LLM Applications: Current Paradigms and the Next Frontier
- Adoption of Watermarking for Generative AI Systems in Practice and Implications under the new EU AI Act
- More Bang for the Buck: Process Reward Modeling with Entropy-Driven Uncertainty
- Adaptive Layer-skipping in Pre-trained LLMs
- $\textit{Agents Under Siege}$: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks
- PiCo: Jailbreaking Multimodal Large Language Models via Pictorial Code Contextualization
- Hallucination Detection in LLMs with Topological Divergence on Attention Graphs
- T-VEC: A Telecom-Specific Vectorization Model with Enhanced Semantic Understanding via Deep Triplet Loss Fine-Tuning
- Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
- Understanding In-context Learning of Addition via Activation Subspaces
- Hakim: Farsi Text Embedding Model
- FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation
- Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression
- LLINBO: Trustworthy LLM-in-the-Loop Bayesian Optimization
- Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
- Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty
- STOPA: A Database of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution
- Inference-time Alignment in Continuous Space
- The Shape of Adversarial Influence: Characterizing LLM Latent Spaces with Persistent Homology
- Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties
- CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation
- GL-PGENet: A Parameterized Generation Framework for Robust Document Image Enhancement
- MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
- Tug-of-war between idioms' figurative and literal interpretations in LLMs
- Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study
- Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining
- Product of Experts for Visual Generation
- Intention-Conditioned Flow Occupancy Models
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
- Rethinking Losses for Diffusion Bridge Samplers
- Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients
- Breaking the Reviewer: Assessing the Vulnerability of Large Language Models in Automated Peer Review Under Textual Adversarial Attacks
- The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
- LLMs on a Budget? Say HOLA
- Truth, Trust, and Trouble: Medical AI on the Edge
- Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers
- ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations
- Understanding Teen Overreliance on AI Companion Chatbots Through Self-Reported Reddit Narratives
- Leveraging Personalized PageRank and Higher-Order Topological Structures for Heterophily Mitigation in Graph Neural Networks
- A Modality-Aware Cooperative Co-Evolutionary Framework for Multimodal Graph Neural Architecture Search
- Out-of-Distribution Generalization in Climate-Aware Yield Prediction with Earth Observation Data
- ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
- Best-of-Both Worlds for linear contextual bandits with paid observations
- Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs
- Parameter-Free Federated TD Learning with Markov Noise in Heterogeneous Environments
- metabeta - A fast neural model for Bayesian mixed-effects regression
- Surrogate Modeling for the Design of Optimal Lattice Structures using Tensor Completion
- Reinforcement Learning-based Task Offloading in the Internet of Wearable Things
- Black-box Detection of LLM-generated Text Using Generalized Jensen-Shannon Divergence
- PEAR: Planner-Executor Agent Robustness Benchmark
- Efficient Generalization via Multimodal Co-Training under Data Scarcity and Distribution Shift
- Estimating Fair Graphs from Graph-Stationary Data
- Targeted Digital Twin via Flow Map Learning and Its Application to Fluid Dynamics
- Phase Diagram of Dropout for Two-Layer Neural Networks in the Mean-Field Regime
- EBGAN-MDN: An Energy-Based Adversarial Framework for Multi-Modal Behavior Cloning
- Automated Machine Learning for Unsupervised Tabular Tasks
- Symbolic-Diffusion: Deep Learning Based Symbolic Regression with D3PM Discrete Token Diffusion
- Expanding the Action Space of LLMs to Reason Beyond Language
- Transformer-Based Indirect Structural Health Monitoring of Rail Infrastructure with Attention-Driven Detection and Localization of Transient Defects
- LLM Unlearning Under the Microscope: A Full-Stack View on Methods and Metrics
- Property Classification of Vacation Rental Properties during Covid-19
- Design-Based Bandits Under Network Interference: Trade-Off Between Regret and Statistical Inference
- Continual Learning for Adaptive AI Systems
- Incremental Hybrid Ensemble with Graph Attention and Frequency-Domain Features for Stable Long-Term Credit Risk Modeling
- FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
- LiveThinking: Enabling Real-Time Efficient Reasoning for AI-Powered Livestreaming via Reinforcement Learning
- Computationally-efficient Graph Modeling with Refined Graph Random Features
- GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation
- t-SNE Exaggerates Clusters, Provably
- FedBook: A Unified Federated Graph Foundation Codebook with Intra-domain and Inter-domain Knowledge Modeling
- R\'enyi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization
- FedLAM: Low-latency Wireless Federated Learning via Layer-wise Adaptive Modulation
- Weak Form Learning for Mean-Field Partial Differential Equations: an Application to Insect Movement
- HySim-LLM: Embedding-Weighted Fine-Tuning Bounds and Manifold Denoising for Domain-Adapted LLMs
- Signal-to-Noise Ratio in Scanning Electron Microscopy: A Comprehensive Review
- Adaptive Optimizable Gaussian Process Regression Linear Least Squares Regression Filtering Method for SEM Images
- GRADE: Personalized Multi-Task Fusion via Group-relative Reinforcement Learning with Adaptive Dirichlet Exploratio
- SketchGuard: Scaling Byzantine-Robust Decentralized Federated Learning via Sketch-Based Screening
- Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers
- Some theoretical improvements on the tightness of PAC-Bayes risk certificates for neural networks
- PRESCRIBE: Predicting Single-Cell Responses with Bayesian Estimation
- Climate Surrogates for Scalable Multi-Agent Reinforcement Learning: A Case Study with CICERO-SCM
- DemandCast: Global hourly electricity demand forecasting
- Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training
- Accelerated Evolving Set Processes for Local PageRank Computation
- Unsupervised Radio Map Construction in Mixed LoS/NLoS Indoor Environments
- Do We Really Need Permutations? Impact of Width Expansion on Linear Mode Connectivity
- From Tokens to Layers: Redefining Stall-Free Scheduling for LLM Serving with Layered Prefill
- Mitigating Subject Dependency in EEG Decoding with Subject-Specific Low-Rank Adapters
- Trajectory Conditioned Cross-embodiment Skill Transfer
- Drift No More? Context Equilibria in Multi-Turn LLM Interactions
- IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
- LLM4Cell: A Survey of Large Language and Agentic Models for Single-Cell Biology
- HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation
- Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models
- Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
- SIMU: Selective Influence Machine Unlearning
- The Rise of the Knowledge Sculptor: A New Archetype for Knowledge Work in the Age of Generative AI
- MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
- Self-Improving LLM Agents at Test-Time
- AdaSwitch: Adaptive Switching Generation for Knowledge Distillation
- Meta-Learning Based Few-Shot Graph-Level Anomaly Detection
- Self-Supervised Learning Strategies for a Platform to Test the Toxicity of New Chemicals and Materials
- DM1: MeanFlow with Dispersive Regularization for 1-Step Robotic Manipulation
- Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track
- Contrastive Weak-to-strong Generalization
- MMM: Quantum-Chemical Molecular Representation Learning for Combinatorial Drug Recommendation
- Towards Human-Like Grading: A Unified LLM-Enhanced Framework for Subjective Question Evaluation
- STEPER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models
- TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
- A Large-scale Dataset for Robust Complex Anime Scene Text Detection
- A$^2$Search: Ambiguity-Aware Question Answering with Reinforcement Learning
- DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
- A Systematic Evaluation of Self-Supervised Learning for Label-Efficient Sleep Staging with Wearable EEG
- LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
- Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning
- Executable Analytic Concepts as the Missing Link Between VLM Insight and Precise Manipulation
- Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training
- ZeroCard: Cardinality Estimation with Zero Dependence on Target Databases -- No Data, No Query, No Retraining
- Is Architectural Complexity Always the Answer? A Case Study on SwinIR vs. an Efficient CNN
- Fewer Weights, More Problems: A Practical Attack on LLM Pruning
- Leveraging Author-Specific Context for Scientific Figure Caption Generation: 3rd SciCap Challenge
- Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
- Past, Present, and Future of Bug Tracking in the Generative AI Era
- Backdoor Vectors: a Task Arithmetic View on Backdoor Attacks and Defenses
- FastUMI-100K: Advancing Data-driven Robotic Manipulation with a Large-scale UMI-style Dataset
- MRI-derived quantification of hepatic vessel-to-volume ratios in chronic liver disease using a deep learning approach
- Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
- Verifying Graph Neural Networks with Readout is Intractable
- TaoSR-AGRL: Adaptive Guided Reinforcement Learning Framework for E-commerce Search Relevance
- A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models
- FedDTRE: Federated Dialogue Generation Models Powered by Trustworthiness Evaluation
- Attribution-by-design: Ensuring Inference-Time Provenance in Generative Music Systems
- An Adaptive Multi Agent Bitcoin Trading System
- A Novel Ensemble Learning Approach for Enhanced IoT Attack Detection: Redefining Security Paradigms in Connected Systems
- Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility
- The Price of Thought: A Multilingual Analysis of Reasoning, Performance, and Cost of Negotiation in Large Language Models
- Lossless Vocabulary Reduction for Auto-Regressive Language Models
- Development of Mental Models in Human-AI Collaboration: A Conceptual Framework
- VersionRAG: Version-Aware Retrieval-Augmented Generation for Evolving Documents
- Bayesian Decision Making around Experts
- Interpreting LLM-as-a-Judge Policies via Verifiable Global Explanations
- Approximate Domain Unlearning for Vision-Language Models
- Improving Temporal Understanding Logic Consistency in Video-Language Models via Attention Enhancement
- Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning
- AI Knowledge Assist: An Automated Approach for the Creation of Knowledge Bases for Conversational AI Agents
- DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations
- Quantum Agents for Algorithmic Discovery
- NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
- Leveraging Whisper Embeddings for Audio-based Lyrics Matching
- Robust Canonicalization through Bootstrapped Data Re-Alignment
- Sentiment Matters: An Analysis of 200 Human-SAV Interactions
- Memory Retrieval and Consolidation in Large Language Models through Function Tokens
- LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions
- FuelCast: Benchmarking Tabular and Temporal Models for Ship Fuel Consumption
- Expressive Value Learning for Scalable Offline Reinforcement Learning
- The Hidden Bias: A Study on Explicit and Implicit Political Stereotypes in Large Language Models
- Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling
- Opponent Shaping in LLM Agents
- Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization
- A Distributed Emulation Environment for In-Memory Computing Systems
- Learning Neural Exposure Fields for View Synthesis
- Counterfactual Identifiability via Dynamic Optimal Transport
- Iterated Agent for Symbolic Regression
- Learning What's Missing: Attention Dispersion and EMA Stabilization in Length Generalization
- DeepEN: Personalized Enteral Nutrition for Critically Ill Patients using Deep Reinforcement Learning
- Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
- Airy: Reading Robot Intent through Height and Sky
- Detecting Legend Items on Historical Maps Using GPT-4o with In-Context Learning
- FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts
- Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
- Prompts Generalize with Low Data: Non-vacuous Generalization Bounds for Optimizing Prompts with More Informative Priors
- ClauseLens: Clause-Grounded, CVaR-Constrained Reinforcement Learning for Trustworthy Reinsurance Pricing
- xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
- Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning
- Synthetic Series-Symbol Data Generation for Time Series Foundation Models
- gLSTM: Mitigating Over-Squashing by Increasing Storage Capacity
- Integral Signatures of Activation Functions: A 9-Dimensional Taxonomy and Stability Theory for Deep Learning
- Platform-Agnostic Modular Architecture for Quantum Benchmarking
- DeepPrune: Parallel Scaling without Inter-trace Redundancy
- AI-Driven Radiology Report Generation for Traumatic Brain Injuries
- To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
- CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
- SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
- Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing
- On the optimization dynamics of RLVR: Gradient gap and step size thresholds
- VideoNorms: Benchmarking Cultural Awareness of Video Language Models
- Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
- SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
- MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
- NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos
- ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
- BLAZER: Bootstrapping LLM-based Manipulation Agents with Zero-Shot Data Generation
- Advancing Automated Urban Planning: Exploring Algorithmic Approaches with Generative Artificial Intelligence
- LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
- Average Controlled and Average Natural Micro Direct Effects in Summary Causal Graphs
- Aligning LLM+PDDL Symbolic Plans with Human Objective Specifications through Evolutionary Algorithm Guidance
- BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving
- AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents
- Position Paper: Towards Open Complex Human-AI Agents Collaboration Systems for Problem Solving and Knowledge Management
- Advancing AI Research Assistants with Expert-Involved Learning
- Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing
- Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability
- Bloated Disclosures: Can ChatGPT Help Investors Process Information?
- Contrastive Difference Predictive Coding
- Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO
- Thousands of AI Authors on the Future of AI
- Depression Detection on Social Media with Large Language Models
- Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation
- L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint)
- Base Models Know How to Reason, Thinking Models Learn When
- Position: AI Will Transform Neuropsychology Through Mental Health Digital Twins for Dynamic Mental Health Care, Especially for ADHD
- ProSEA: Problem Solving via Exploration Agents
- Less is More: Strategic Expert Selection Outperforms Ensemble Complexity in Traffic Forecasting
- TS-Agent: A Time Series Reasoning Agent with Iterative Statistical Insight Gathering
- ExpertAgent: Enhancing Personalized Education through Dynamic Planning and Retrieval-Augmented Long-Chain Reasoning
- Evaluation of LLMs for Process Model Analysis and Optimization
- Optimizing Ethical Risk Reduction for Medical Intelligent Systems with Constraint Programming
- CompassLLM: A Multi-Agent Approach toward Geo-Spatial Reasoning for Popular Path Query
- Measuring and Mitigating Identity Bias in Multi-Agent Debate via Anonymization
- An Evaluation Study of Hybrid Methods for Multilingual PII Detection
- Benchmarking is Broken - Don't Let AI be its Own Judge
- AgentAsk: Multi-Agent Systems Need to Ask
- Traceability and Accountability in Role-Specialized Multi-Agent LLM Pipelines
- A Case for Leveraging Generative AI to Expand and Enhance Training in the Provision of Mental Health Services
- Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models
- Safely Exploring Novel Actions in Recommender Systems via Deployment-Efficient Policy Learning
- Multimodal Safety Evaluation in Generative Agent Social Simulations
- Control Synthesis of Cyber-Physical Systems for Real-Time Specifications through Causation-Guided Reinforcement Learning
- oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning
- SurveyG: A Multi-Agent LLM Framework with Hierarchical Citation Graph for Automated Survey Generation
- Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains
- From Noisy to Native: LLM-driven Graph Restoration for Test-Time Graph Domain Adaptation
- An approach for systematic decomposition of complex llm tasks
- GCPO: When Contrast Fails, Go Gold
- Strategic Communication under Threat: Learning Information Trade-offs in Pursuit-Evasion Games
- An LLM-Powered Cooperative Framework for Large-Scale Multi-Vehicle Navigation
- FinMR: A Knowledge-Intensive Multimodal Benchmark for Advanced Financial Reasoning
- Augur: Modeling Covariate Causal Associations in Time Series via Large Language Models
- Understanding DeepResearch via Reports
- Towards Meaningful Transparency in Civic AI Systems
- Profit Mirage: Revisiting Information Leakage in LLM-based Financial Agents
- Enabling Personalized Long-term Interactions in LLM-based Agents through Persistent Memory and User Profiles
- Agent-Based Genetic Algorithm for Crypto Trading Strategy Optimization
- TaoSR-SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
- VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
- ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation
- Language Models Do Not Embed Numbers Continuously
- PEAR: Phase Entropy Aware Reward for Efficient Reasoning
- AILoRA: Function-Aware Asymmetric Initialization for Low-Rank Adaptation of Large Language Models
- LinguaSim: Interactive Multi-Vehicle Testing Scenario Generation via Natural Language Instruction Based on Large Language Models
- Multi-Condition Conformal Selection
- AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment
- From Ethical Declarations to Provable Independence: An Ontology-Driven Optimal-Transport Framework for Certifiably Fair AI Systems
- Can Risk-taking AI-Assistants suitably represent entities
- Prepared mind, fast response: A temporal decoupling framework for adaptive knowledge orchestration in open-domain dialogue
- R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
- Measuring What Matters: The AI Pluralism Index
- The Tournament Tree Method for preference elicitation in Multi-criteria decision-making
- DODO: Causal Structure Learning with Budgeted Interventions
- Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
- Chain-of-Trigger: An Agentic Backdoor that Paradoxically Enhances Agentic Robustness
- Co-TAP: Three-Layer Agent Interaction Protocol Technical Report
- Symmetry-Aware Fully-Amortized Optimization with Scale Equivariant Graph Metanetworks
- First Try Matters: Revisiting the Role of Reflection in Reasoning Models
- Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries
- LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings
- QAgent: A modular Search Agent with Interactive Query Understanding
- Revisiting Hallucination Detection with Effective Rank-based Uncertainty
- Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
- AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
- CaRT: Teaching LLM Agents to Know When They Know Enough
- FlowSearch: Advancing deep research with dynamic structured knowledge flow
- Agent Learning via Early Experience
- How to Teach Large Multimodal Models New Skills
- Deep Learning Based Approach to Enhanced Recognition of Emotions and Behavioral Patterns of Autistic Children
- MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation
- Local MAP Sampling for Diffusion Models
- Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
- Encode, Think, Decode: Scaling test-time reasoning with recursive latent thoughts
- Attention to Order: Transformers Discover Phase Transitions via Learnability
- Quantum Grid Path Planning Using Parallel QAOA Circuits Based on Minimum Energy Principle
- Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation
- LASER: An LLM-based ASR Scoring and Evaluation Rubric
- Minimizing the Value-at-Risk of Loan Portfolio via Deep Neural Networks
- MoGU: Mixture-of-Gaussians with Uncertainty-based Gating for Time Series Forecasting
- HEMERA: A Human-Explainable Transformer Model for Estimating Lung Cancer Risk using GWAS Data
- Can Lessons From Human Teams Be Applied to Multi-Agent Systems? The Role of Structure, Diversity, and Interaction Dynamics
- A Denoising Framework for Real-World Ultra-Low Dose Lung CT Images Based on an Image Purification Strategy
- Can Speech LLMs Think while Listening?
- When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
- MLLM4TS: Leveraging Vision and Multimodal Language Models for General Time-Series Analysis
- EEG Sleep Stage Classification with Continuous Wavelet Transform and Deep Learning
- OWL: Overcoming Window Length-Dependence in Speculative Decoding for Long-Context Inputs
- TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility
- Label Semantics for Robust Hyperspectral Image Classification
- Investigating Thematic Patterns and User Preferences in LLM Interactions using BERTopic
- Multi-Task Pre-Finetuning of Lightweight Transformer Encoders for Text Classification and NER
- Accuracy, Memory Efficiency and Generalization: A Comparative Study on Liquid Neural Networks and Recurrent Neural Networks
- Linguistic Patterns in Pandemic-Related Content: A Comparative Analysis of COVID-19, Constraint, and Monkeypox Datasets
- TGM: a Modular and Efficient Library for Machine Learning on Temporal Graphs
- Vocabulary embeddings organize linguistic structure early in language model training
- DGTEN: A Robust Deep Gaussian based Graph Neural Network for Dynamic Trust Evaluation with Uncertainty-Quantification Support
- Retentive Relevance: Capturing Long-Term User Value in Recommendation Systems
- Banking Done Right: Redefining Retail Banking with Language-Centric AI
- Value Flows
- OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
- IKNet: Interpretable Stock Price Prediction via Keyword-Guided Integration of News and Technical Indicators
- TCIP: Threshold-Controlled Iterative Pyramid Network for Deformable Medical Image Registration
- Controllable Video Synthesis via Variational Inference
- Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs
- Stress-Testing Model Specs Reveals Character Differences among Language Models
- Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs
- Causality Guided Representation Learning for Cross-Style Hate Speech Detection
- DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
- MeSH: Memory-as-State-Highways for Recursive Transformers
- AppForge: From Assistant to Independent Developer - Are GPTs Ready for Software Development?
- UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes
- Parallel Test-Time Scaling for Latent Reasoning Models
- A Unified Multi-Task Learning Framework for Generative Auto-Bidding with Validation-Aligned Optimization
- ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Research Sources: 650 | Generated: 10/11/2025