AI RESEARCH PAPERS & ACADEMIC SOURCES
- AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
- LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction
- I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners
- Recurrent Video Masked Autoencoders
- Towards Scalable Pre-training of Visual Tokenizers for Generation
- LitePT: Lighter Yet Stronger Point Transformer
- Benchmarking Tesla's Traffic Light and Stop Sign Control: Field Dataset and Behavior Insights
- A Reproducible Workflow for Scraping, Structuring, and Segmenting Legacy Archaeological Artifact Images
- ReGlove: A Soft Pneumatic Glove for Activities of Daily Living Assistance via Wrist-Mounted Vision
- Aion: Towards Hierarchical 4D Scene Graphs with Temporal Flow Dynamics
- Pre-training vision models for the classification of alerts from wide-field time-domain surveys
- AutoMV: An Automatic Multi-Agent System for Music Video Generation
- Navigation Around Unknown Space Objects Using Visible-Thermal Image Fusion
- Resolution-Independent Neural Operators for Multi-Rate Sparse-View CT
- JPEG-Inspired Cloud-Edge Holography
- Hybrid Retrieval-Augmented Generation for Robust Multilingual Document Question Answering
- JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation
- SLIM-VDB: A Real-Time 3D Probabilistic Semantic Mapping Framework
- Leveraging Compression to Construct Transferable Bitrate Ladders
- Post-Training and Test-Time Scaling of Generative Agent Behavior Models for Interactive Autonomous Driving
- Self-Supervised Ultrasound Representation Learning for Renal Anomaly Prediction in Prenatal Imaging
- RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics
- We Can Always Catch You: Detecting Adversarial Patched Objects WITH or WITHOUT Signature
- ExReg: Wide-range Photo Exposure Correction via a Multi-dimensional Regressor with Attention
- An Efficient and Harmonized Framework for Balanced Cross-Domain Feature Integration
- TimeWalker: Personalized Neural Space for Lifelong Head Avatars
- Deep priors for satellite image restoration with accurate uncertainties
- GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting
- Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection and Simulation
- VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
- Counting Hallucinations in Diffusion Models
- Tau Anomaly Detection in PET Imaging via Bilateral-Guided Deterministic Diffusion Model
- From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields
- More Than the Final Answer: Improving Visual Extraction and Logical Consistency in Vision-Language Models
- Advancing Cache-Based Few-Shot Classification via Patch-Driven Relational Gated Graph Attention
- Anatomy Guided Coronary Artery Segmentation from CCTA Using Spatial Frequency Joint Modeling
- From Tokens to Photons: Test-Time Physical Prompting for Vison-Language Models
- StegaVAR: Privacy-Preserving Video Action Recognition via Steganographic Domain Analysis
- Automatic Wire-Harness Color Sequence Detector
- Vision-Enhanced Large Language Models for High-Resolution Image Synthesis and Multimodal Data Interpretation
- Geometry-Aware Scene-Consistent Image Generation
- No Cache Left Idle: Accelerating diffusion model via Extreme-slimming Caching
- Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching
- D3D-VLP: Dynamic 3D Vision-Language-Planning Model for Embodied Grounding and Navigation
- Cross-modal Fundus Image Registration under Large FoV Disparity
- CogDoc: Towards Unified thinking in Documents
- InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
- Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning
- Progressive Conditioned Scale-Shift Recalibration of Self-Attention for Online Test-time Adaptation
- $\beta$-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
- Spinal Line Detection for Posture Evaluation through Train-ing-free 3D Human Body Reconstruction with 2D Depth Images
- GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
- FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning
- Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior
- L-STEC: Learned Video Compression with Long-term Spatio-Temporal Enhanced Context
- DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
- Learning Common and Salient Generative Factors Between Two Image Datasets
- Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
- Cross-Level Sensor Fusion with Object Lists via Transformer for 3D Object Detection
- Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
- Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
- Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
- UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction
- SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
- VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference
- Scaling Up AI-Generated Image Detection via Generator-Aware Prototypes
- Few-Step Distillation for Text-to-Image Generation: A Practical Guide
- Light Field Based 6DoF Tracking of Previously Unobserved Objects
- TWLR: Text-Guided Weakly-Supervised Lesion Localization and Severity Regression for Explainable Diabetic Retinopathy Grading
- JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion
- What Happens Next? Next Scene Prediction with a Unified Video Model
- SneakPeek: Future-Guided Instructional Streaming Video Generation
- Comprehensive Evaluation of Rule-Based, Machine Learning, and Deep Learning in Human Estimation Using Radio Wave Sensing: Accuracy, Spatial Generalization, and Output Granularity Trade-offs
- Bi-Erasing: A Bidirectional Framework for Concept Removal in Diffusion Models
- Towards Test-time Efficient Visual Place Recognition via Asymmetric Query Processing
- Forging a Dynamic Memory: Retrieval-Guided Continual Learning for Generalist Medical Foundation Models
- FID-Net: A Feature-Enhanced Deep Learning Network for Forest Infestation Detection
- LeafTrackNet: A Deep Learning Framework for Robust Leaf Tracking in Top-Down Plant Phenotyping
- StarryGazer: Leveraging Monocular Depth Estimation Models for Domain-Agnostic Single Depth Image Completion
- Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation
- MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion
- CoRA: A Collaborative Robust Architecture with Hybrid Fusion for Efficient Perception
- POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling
- Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
- STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
- CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
- Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
- CausalCLIP: Causally-Informed Feature Disentanglement and Filtering for Generalizable Detection of Generated Images
- ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement
- KlingAvatar 2.0 Technical Report
- Automated User Identification from Facial Thermograms with Siamese Networks
- Unlocking Generalization in Polyp Segmentation with DINO Self-Attention "keys"
- Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs
- Computer vision training dataset generation for robotic environments using Gaussian splatting
- USTM: Unified Spatial and Temporal Modeling for Continuous Sign Language Recognition
- Learning to Generate Cross-Task Unexploitable Examples
- RecTok: Reconstruction Distillation along Rectified Flow
- A Domain-Adapted Lightweight Ensemble for Resource-Efficient Few-Shot Plant Disease Classification
- IMILIA: interpretable multiple instance learning for inflammation prediction in IBD from H&E whole slide images
- Test-Time Modification: Inverse Domain Transformation for Robust Perception
- PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
- Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$
- Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
- Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
- TARA: Simple and Efficient Time Aware Retrieval Adaptation of MLLMs for Video Understanding
- 3D Human-Human Interaction Anomaly Detection
- MMhops-R1: Multimodal Multi-hop Reasoning
- Lighting in Motion: Spatiotemporal HDR Lighting Estimation
- LongVie 2: Multimodal Controllable Ultra-Long Video World Model
- DBT-DINO: Towards Foundation model based analysis of Digital Breast Tomosynthesis
- SCR2-ST: Combine Single Cell with Spatial Transcriptomics for Efficient Active Sampling via Reinforcement Learning
- MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
- Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All
- Grab-3D: Detecting AI-Generated Videos from 3D Geometric Temporal Consistency
- PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation
- Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation
- A stylometric analysis of speaker attribution from speech transcripts
- Towards Effective Model Editing for LLM Personalization
- Beyond surface form: A pipeline for semantic analysis in Alzheimer's Disease detection from spontaneous speech
- VEGAS: Mitigating Hallucinations in Large Vision-Language Models via Vision-Encoder Attention Guided Adaptive Steering
- From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving
- VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding
- The Morphemic Origin of Zipf's Law: A Factorized Combinatorial Framework
- Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
- Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space
- ERA-IT: Aligning Semantic Models with Revealed Economic Preference for Real-Time and Explainable Patent Valuation
- Heart Disease Prediction using Case Based Reasoning (CBR)
- SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
- Fine-tuned LLM-based Code Migration Framework
- Towards Interactive Intelligence for Digital Humans
- DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models
- Safety Alignment of Large Language Models via Contrasting Safe and Harmful Distributions
- DeBERTa-KC: A Transformer-Based Classifier for Knowledge Construction in Online Learning Discourse
- Temporal-Anchor3DLane: Enhanced 3D Lane Detection with Multi-Task Losses and LSTM Fusion
- Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training
- Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors
- Hot H\'em: S\`ai G\`on Gi\~ua C\'ai N\'ong H\^ong C\`ong B\`ang -- Saigon in Unequal Heat
- Microscopic Vehicle Trajectory Datasets from UAV-collected Video for Heterogeneous, Area-Based Urban Traffic
- Read or Ignore? A Unified Benchmark for Typographic-Attack Robustness and Text Recognition in Vision-Language Models
- Smartphone monitoring of smiling as a behavioral proxy of well-being in everyday life
- TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
- Contextual Peano Scan and Fast Image Segmentation Using Hidden and Evidential Markov Chains
- A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer
- CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction
- Adaptive federated learning for ship detection across diverse satellite imagery sources
- Enhancing deep learning performance on burned area delineation from SPOT-6/7 imagery for emergency management
- RePack: Representation Packing of Vision Foundation Model Features Enhances Diffusion Transformer
- EchoVLM: Measurement-Grounded Multimodal Learning for Echocardiography
- Open Horizons: Evaluating Deep Models in the Wild
- Audio-Visual Camera Pose Estimationn with Passive Scene Sounds and In-the-Wild Video
- SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
- A Multi-Year Urban Streetlight Imagery Dataset for Visual Monitoring and Spatio-Temporal Drift Detection
- A Hybrid Deep Learning Framework for Emotion Recognition in Children with Autism During NAO Robot-Mediated Interaction
- CineLOG: A Training Free Approach for Cinematic Long Video Generation
- Fine-Grained Zero-Shot Learning with Attribute-Centric Representations
- ProImage-Bench: Rubric-Based Evaluation for Professional Image Generation
- Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder
- Moment and Highlight Detection via MLLM Frame Segmentation
- MetaTPT: Meta Test-time Prompt Tuning for Vision-Language Models
- Feature Aggregation for Efficient Continual Learning of Complex Facial Expressions
- Cognitive-YOLO: LLM-Driven Architecture Synthesis from First Principles of Data for Object Detection
- RealDrag: The First Dragging Benchmark with Real Target Image
- OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation
- MRD: Using Physically Based Differentiable Rendering to Probe Vision Models for 3D Scene Understanding
- WeDetect: Fast Open-Vocabulary Object Detection as Retrieval
- TCLeaf-Net: a transformer-convolution framework with global-local attention for robust in-field lesion-level plant leaf disease detection
- STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
- V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
- M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
- Speedrunning ImageNet Diffusion
- ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
- BokehDepth: Enhancing Monocular Depth Estimation through Bokeh Generation
- Endless World: Real-Time 3D-Aware Long Video Generation
- Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
- Comparative Analysis of Wave Scattering Numerical Modeling Using the Boundary Element Method and Physics-Informed Neural Networks
- CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer
- The prediction of the quality of results in Logic Synthesis using Transformer and Graph Neural Networks
- An Anytime Algorithm for Good Arm Identification
- "All of Me": Mining Users' Attributes from their Public Spotify Playlists
- CIC: Circular Image Compression
- Deep-ER: Deep Learning ECCENTRIC Reconstruction for fast high-resolution neurometabolic imaging
- WALINET: A water and lipid identification convolutional Neural Network for nuisance signal removal in 1H MR Spectroscopic Imaging
- On the physics of nested Markov models: a generalized probabilistic theory perspective
- QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain
- Self-test loss functions for learning weak-form operators and gradient flows
- Navigating AI to Unpack Youth Privacy Concerns: An In-Depth Exploration and Systematic Review
- A Physics-Embedded Dual-Learning Imaging Framework for Electrical Impedance Tomography
- Compact Neural Network Algorithm for Electrocardiogram Classification
- Multipole Attention for Efficient Long Context Reasoning
- A PyTorch Framework for Scalable Non-Crossing Quantile Regression
- AQCat25: Unlocking spin-aware, high-fidelity machine learning potentials for heterogeneous catalysis
- Direct Confidence Alignment: Aligning Verbalized Confidence with Internal Confidence In Large Language Models
- Benchmarking Contextual Understanding for In-Car Conversational Systems
- BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding
- Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics
- F5-TTS-RO: Extending F5-TTS to Romanian TTS via Lightweight Input Adaptation
- Can GPT replace human raters? Validity and reliability of machine-generated norms for metaphors
- Large language models have learned to use language
- The American Ghost in the Machine: How language models align culturally and the effects of cultural prompting
- NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data
- StruProKGR: A Structural and Probabilistic Framework for Sparse Knowledge Graph Reasoning
- Which Pieces Does Unigram Tokenization Really Need?
- LexRel: Benchmarking Legal Relation Extraction for Chinese Civil Cases
- CoDA: A Context-Decoupled Hierarchical Agent with Reinforcement Learning
- NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
- Curi\'o-Edu 7B: Examining Data Selection Impacts in LLM Continued Pretraining
- Persistent Personas? Role-Playing, Instruction Following, and Safety in Extended Interactions
- What Matters in Evaluating Book-Length Stories? A Systematic Study of Long Story Evaluation
- QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management
- Authors Should Annotate
- An Open and Reproducible Deep Research Agent for Long-Form Question Answering
- AIR: Post-training Data Selection for Reasoning via Attention Head Influence
- Integrating Causal Reasoning into Automated Fact-Checking
- Large language models are not about language
- Scaling Laws for Code: Every Programming Language Matters
- Advancing Bangla Machine Translation Through Informal Datasets
- CreativeVR: Diffusion-Prior-Guided Approach for Structure and Motion Restoration in Generative and Real Videos
- VOYAGER: A Training Free Approach for Generating Diverse Datasets using LLMs
- BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models
- SPDMark: Selective Parameter Displacement for Robust Video Watermarking
- AI-Augmented Pollen Recognition in Optical and Holographic Microscopy for Veterinary Imaging
- A Novel Patch-Based TDA Approach for Computed Tomography
- Citation-Grounded Code Comprehension: Preventing LLM Hallucination Through Hybrid Retrieval and Graph-Augmented Context
- Modeling Dabrafenib Response Using Multi-Omics Modality Fusion and Protein Network Embeddings Based on Graph Convolutional Networks
- Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting
- Journey Before Destination: On the importance of Visual Faithfulness in Slow Thinking
- Learning to Get Up Across Morphologies: Zero-Shot Recovery with a Unified Humanoid Policy
- Hellinger loss function for Generative Adversarial Networks
- Robust Outlier Detection and Low-Latency Concept Drift Adaptation for Data Stream Regression: A Dual-Channel Architecture
- Near-Zero-Overhead Freshness for Recommendation Systems via Inference-Side Model Updates
- GrowTAS: Progressive Expansion from Small to Large Subnets for Efficient ViT Architecture Search
- Extending the application of dynamic Bayesian networks in calculating market risk: Standard and stressed expected shortfall
- Unified Control for Inference-Time Guidance of Denoising Diffusion Models
- Towards a pretrained deep learning estimator of the Linfoot informational correlation
- ElasticVR: Elastic Task Computing in Multi-User Multi-Connectivity Wireless Virtual Reality (VR) Systems
- ViInfographicVQA: A Benchmark for Single and Multi-image Visual Question Answering on Vietnamese Infographics
- Data-driven modelling of autonomous and forced dynamical systems
- Co-Hub Node Based Multiview Graph Learning with Theoretical Guarantees
- Efficient Level-Crossing Probability Calculation for Gaussian Process Modeled Data
- Breaking the Curse of Dimensionality: On the Stability of Modern Vector Retrieval
- Understanding Overparametrization in Survival Models through Double-Descent
- Generative Spatiotemporal Data Augmentation
- Animus3D: Text-driven 3D Animation via Motion Score Distillation
- HyperEdit: Unlocking Instruction-based Text Editing in LLMs via Hypernetworks
- Supervised Contrastive Frame Aggregation for Video Representation Learning
- Iterative Sampling Methods for Sinkhorn Distributionally Robust Optimization
- Mind the Jumps: A Scalable Robust Local Gaussian Process for Multidimensional Response Surfaces with Discontinuities
- Scalable Quantum Error Mitigation with Neighbor-Informed Learning
- ceLLMate: Sandboxing Browser AI Agents
- CoLSE: A Lightweight and Robust Hybrid Learned Model for Single-Table Cardinality Estimation using Joint CDF
- Modeling Authorial Style in Urdu Novels Using Character Interaction Graphs and Graph Neural Networks
- Robust Variational Bayes by Min-Max Median Aggregation
- Efficient Vision-Language Reasoning via Adaptive Token Pruning
- Practical Hybrid Quantum Language Models with Observable Readout on Real Hardware
- Self-Motivated Growing Neural Network for Adaptive Architecture via Local Structural Plasticity
- Limits To (Machine) Learning
- Transport Reversible Jump Markov Chain Monte Carlo with proposals generated by Variational Inference with Normalizing Flows
- Flow-matching Operators for Residual-Augmented Probabilistic Learning of Partial Differential Equations
- An End-to-End Approach for Microgrid Probabilistic Forecasting and Robust Operation via Decision-focused Learning
- HaShiFlex: A High-Throughput Hardened Shifter DNN Accelerator with Fine-Tuning Flexibility
- KANEL\'E: Kolmogorov-Arnold Networks for Efficient LUT-based Evaluation
- Qonvolution: Towards Learning High-Frequency Signals with Queried Convolution
- PAC-Bayes Bounds for Multivariate Linear Regression and Linear Autoencoders
- Evaluating Singular Value Thresholds for DNN Weight Matrices based on Random Matrix Theory
- Continuous Edit Distance, Geodesics and Barycenters of Time-varying Persistence Diagrams
- VoroLight: Learning Quality Volumetric Voronoi Meshes from General Inputs
- General OOD Detection via Model-aware and Subspace-aware Variable Priority
- Comprehensive Deployment-Oriented Assessment for Cross-Environment Generalization in Deep Learning-Based mmWave Radar Sensing
- Motus: A Unified Latent Action World Model
- Progressive Refinement of E-commerce Search Ranking Based on Short-Term Activities of the Buyer
- DiRe: Diversity-promoting Regularization for Dataset Condensation
- PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations
- ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
- Towards Practical Large-scale Dynamical Heterogeneous Graph Embedding: Cold-start Resilient Recommendation
- Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences
- Weight Space Correlation Analysis: Quantifying Feature Utilization in Deep Learning Models
- Iterative Tuning of Nonlinear Model Predictive Control for Robotic Manufacturing Tasks
- MicroPhaseNO: Adapting an Earthquake-Trained Phase Neural Operator for Microseismic Phase Picking
- Rethinking Physics-Informed Regression Beyond Training Loops and Bespoke Architectures
- Better LMO-based Momentum Methods with Second-Order Information
- AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning
- Fast Policy Learning for 6-DOF Position Control of Underwater Vehicles
- rNCA: Self-Repairing Segmentation Masks
- MineTheGap: Automatic Mining of Biases in Text-to-Image Models
- Real-Time AI-Driven Milling Digital Twin Towards Extreme Low-Latency
- From Zipf's Law to Neural Scaling through Heaps' Law and Hilberg's Hypothesis
- A Deep Learning Model of Mental Rotation Informed by Interactive VR Experiments
- Enhancing lithological interpretation from petrophysical well log of IODP expedition 390/393 using machine learning
- Actively Learning Joint Contours of Multiple Computer Experiments
- Adaptive Sampling for Hydrodynamic Stability
- Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains
- A Nonparametric Statistics Approach to Feature Selection in Deep Neural Networks with Theoretical Guarantees
- Textual Gradients are a Flawed Metaphor for Automatic Prompt Optimization
- Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models
- Temporal Tokenization Strategies for Event Sequence Modeling with Large Language Models
- Universality of high-dimensional scaling limits of stochastic gradient descent
- SEDULity: A Proof-of-Learning Framework for Distributed and Secure Blockchains with Efficient Useful Work
- Adaptive Risk Mitigation in Demand Learning
- Vertical Semi-Federated Learning for Efficient Online Advertising
- Certifying Robustness of Graph Convolutional Networks for Node Perturbation with Polyhedra Abstract Interpretation
- Dynamic Fraud Detection: Integrating Reinforcement Learning into Graph Neural Networks
- The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
- Defending Collaborative Filtering Recommenders via Adversarial Robustness Based Edge Reweighting
- Learning Dynamics in Memristor-Based Equilibrium Propagation
- Knowledge-Guided Masked Autoencoder with Linear Spectral Mixing and Spectral-Angle-Aware Reconstruction
- Optimized Architectures for Kolmogorov-Arnold Networks
- Sparse Concept Anchoring for Interpretable and Controllable Neural Representations
- GoMS: Graph of Molecule Substructure Network for Molecule Property Prediction
- AI-Driven Early Warning Systems for Student Success: Discovering Static Feature Dominance in Temporal Prediction Models
- Policy Optimization for Dynamic Heart Transplant Allocation
- Empirical Mode Decomposition and Graph Transformation of the MSCI World Index: A Multiscale Topological Analysis for Graph Neural Network Modeling
- Effective Fine-Tuning with Eigenvector Centrality Based Pruning
- Optimal Mistake Bounds for Transductive Online Learning
- On the Accuracy of Newton Step and Influence Function Data Attributions
- Differentiable Energy-Based Regularization in GANs: A Simulator-Based Exploration of VQE-Inspired Auxiliary Losses
- Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
- Causal inference and model explainability tools for retail
- Spectral Sentinel: Scalable Byzantine-Robust Decentralized Federated Learning via Sketched Random Matrix Theory on Blockchain
- Torch Geometric Pool: the Pytorch library for pooling in Graph Neural Networks
- On Approaches to Building Surrogate ODE Models for Diffusion Bridges
- Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning
- Multi-Trajectory Physics-Informed Neural Networks for HJB Equations with Hard-Zero Terminal Inventory: Optimal Execution on Synthetic & SPY Data
- Solving a Machine Learning Regression Problem Based on the Theory of Random Functions
- SPARK: Igniting Communication-Efficient Decentralized Learning via Stage-wise Projected NTK and Accelerated Regularization
- Resting Neurons, Active Insights: Improving Input Sparsification for Large Language Models
- OLR-WAA: Adaptive and Drift-Resilient Online Regression with Dynamic Weighted Averaging
- Credit Risk Estimation with Non-Financial Features: Evidence from a Synthetic Istanbul Dataset
- TRACER: Transfer Learning based Real-time Adaptation for Clinical Evolving Risk
- Optimal Resource Allocation for ML Model Training and Deployment under Concept Drift
- GradID: Adversarial Detection via Intrinsic Dimensionality of Gradients
- Improving Recursive Transformers with Mixture of LoRAs
- Unsupervised learning of multiscale switching dynamical system models from multimodal neural data
- Distillation of Discrete Diffusion by Exact Conditional Distribution Matching
- Wait, Wait, Wait... Why Do Reasoning Models Loop?
- Probability Estimation for Predicted-Occupancy Grids in Vehicle Safety Applications Based on Machine Learning
- Predicted-occupancy grids for vehicle safety applications based on autoencoders and the Random Forest algorithm
- Next-generation reservoir computing validated by classification task
- Machine Learning Architectures for the Estimation of Predicted Occupancy Grids in Road Traffic
- LLM-based Personalized Portfolio Recommender: Integrating Large Language Models and Reinforcement Learning for Intelligent Investment Strategy Optimization
- SeVeDo: A Heterogeneous Transformer Accelerator for Low-Bit Inference via Hierarchical Group Quantization and SVD-Guided Mixed Precision
- Understanding When Graph Convolutional Networks Help: A Diagnostic Study on Label Scarcity and Structural Properties
- Application of Deep Learning in Biological Data Compression
- CoDeQ: End-to-End Joint Model Compression with Dead-Zone Quantizer for High-Sparsity and Low-Precision Networks
- Deep Learning-Driven Inversion Framework for Shear Modulus Estimation in Magnetic Resonance Elastography (DIME)
- Alada: Alternating Adaptation of Momentum Method for Memory-Efficient Matrix Optimization
- Understanding Structured Financial Data with LLMs: A Case Study on Fraud Detection
- Deep Q-Learning-Based Intelligent Scheduling for ETL Optimization in Heterogeneous Data Environments
- Multi-fidelity aerodynamic data fusion by autoencoder transfer learning
- LikeBench: Evaluating Subjective Likability in LLMs for Personalization
- Quanvolutional Neural Networks for Spectrum Peak-Finding
- Enhancing Node-Level Graph Domain Adaptation by Alleviating Local Dependency
- Noise-Resilient Quantum Aggregation on NISQ for Federated ADAS Learning
- Evaluating Adversarial Attacks on Federated Learning for Temperature Forecasting
- ModSSC: A Modular Framework for Semi-Supervised Classification on Heterogeneous Data
- Learning to Retrieve with Weakened Labels: Robust Training under Label Noise
- B\'ezierFlow: B\'ezier Stochastic Interpolant Schedulers for Few-Step Generation
- KD-PINN: Knowledge-Distilled PINNs for ultra-low-latency real-time neural PDE solvers
- FROC: A Unified Framework with Risk-Optimized Control for Machine Unlearning in LLMs
- Link-Aware Energy-Frugal Continual Learning for Fault Detection in IoT Networks
- On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models
- Dual-Phase Federated Deep Unlearning via Weight-Aware Rollback and Reconstruction
- Multiclass Graph-Based Large Margin Classifiers: Unified Approach for Support Vectors and Neural Networks
- XNNTab -- Interpretable Neural Networks for Tabular Data using Sparse Autoencoders
- DP-EMAR: A Differentially Private Framework for Autonomous Model Weight Repair in Federated IoT Systems
- Element-wise Modulation of Random Matrices for Efficient Neural Layers
- On-Device Continual Learning for Unsupervised Visual Anomaly Detection in Dynamic Manufacturing
- Learning under Distributional Drift: Reproducibility as an Intrinsic Statistical Resource
- Async Control: Stress-testing Asynchronous Control Measures for LLM Agents
- Image Diffusion Preview with Consistency Solver
- Scalable Formal Verification via Autoencoder Latent Space Abstraction
- LightTopoGAT: Enhancing Graph Attention Networks with Topological Features for Efficient Graph Classification
- StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion
- A Scientific Reasoning Model for Organic Synthesis Procedure Generation
- Directional Textual Inversion for Personalized Text-to-Image Generation
- Reinforcement Learning for Latent-Space Thinking in LLMs
- Love First, Know Later: Persona-Based Romantic Compatibility Through LLM Text World Engines
- Evolving Deep Learning Optimizers
- The Art of Storytelling in Authoritarian Regimes: Crafting State Narratives on Chinese Social Media
- mmWEAVER: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description
- CLARGA: Multimodal Graph Representation Learning over Arbitrary Sets of Modalities
- MPath: Multimodal Pathology Report Generation from Whole Slide Images
- Interval Fisher's Discriminant Analysis and Visualisation
- Policy Gradient Algorithms for Age-of-Information Cost Minimization
- Adversarial Attacks Against Deep Learning-Based Radio Frequency Fingerprint Identification
- Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition
- Human-computer interactions predict mental health
- On the Design of One-step Diffusion via Shortcutting Flow Paths
- Hybrid twinning using PBDW and DeepONet for the effective state estimation and prediction on partially known systems
- D-STEER - Preference Alignment Techniques Learn to Behave, not to Believe -- Beneath the Surface, DPO as Steering Vector Perturbation in Activation Space
- Large Language Models as Generalist Policies for Network Optimization
- Amortized Causal Discovery with Prior-Fitted Networks
- Meta-Continual Mobility Forecasting for Proactive Handover Prediction
- Exploring Topological Bias in Heterogeneous Graph Neural Networks
- Tiny Recursive Models on ARC-AGI-1: Inductive Biases, Identity Conditioning, and Test-Time Compute
- Phase transitions reveal hierarchical structure in deep neural networks
- Neural Chameleons: Language Models Can Learn to Hide Their Thoughts from Unseen Activation Monitors
- Learning to Extract Context for Context-Aware LLM Inference
- EnviroLLM: Resource Tracking and Optimization for Local AI
- DFedReweighting: A Unified Framework for Objective-Oriented Reweighting in Decentralized Federated Learning
- Goal Reaching with Eikonal-Constrained Hierarchical Quasimetric Reinforcement Learning
- Physics-informed neural networks to solve inverse problems in unbounded domains
- SigTime: Learning and Visually Explaining Time Series Signatures
- CLOAK: Contrastive Guidance for Latent Diffusion-Based Data Obfuscation
- GraphPerf-RT: A Graph-Driven Performance Model for Hardware-Aware Scheduling of OpenMP Codes
- Neural CDEs as Correctors for Learned Time Series Models
- High-Dimensional Tensor Discriminant Analysis: Low-Rank Discriminant Structure, Representation Synergy, and Theoretical Guarantees
- BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
- On the Approximation Power of SiLU Networks: Exponential Rates and Depth Efficiency
- HydroDiffusion: Diffusion-Based Probabilistic Streamflow Forecasting with a State Space Backbone
- MolGuidance: Advanced Guidance Strategies for Conditional Molecular Generation with Flow Matching
- EEG-DLite: Dataset Distillation for Efficient Large EEG Model Training
- Optimized Learned Count-Min Sketch
- Balancing Accuracy and Speed: A Multi-Fidelity Ensemble Kalman Filter with a Machine Learning Surrogate Model
- TwinFormer: A Dual-Level Transformer for Long-Sequence Time-Series Forecasting
- Eventually LIL Regret: Almost Sure $\ln\ln T$ Regret for a sub-Gaussian Mixture on Unbounded Data
- Uncertainty Quantification for Machine Learning: One Size Does Not Fit All
- Synthetic Swarm Mosquito Dataset for Acoustic Classification: A Proof of Concept
- The Data Efficiency Frontier of Financial Foundation Models: Scaling Laws from Continued Pretraining
- Anchoring Values in Temporal and Group Dimensions for Flow Matching Model Alignment
- DeepVekua: Geometric-Spectral Representation Learning for Physics-Informed Fields
- Can Graphs Improve Tabular Foundation Models?
- UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
- Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation
- OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning
- Harmonizing Generalization and Specialization: Uncertainty-Informed Collaborative Learning for Semi-supervised Medical Image Segmentation
- TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
- Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather
- Uncovering the Role of Initial Saliency in U-Shaped Attention Bias: Scaling Initial Token Weight for Enhanced Long-Text Processing
- From Overfitting to Reliability: Introducing the Hierarchical Approximate Bayesian Neural Network
- DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass
- Intrinsic Image Fusion for Multi-View 3D Material Reconstruction
- A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
- SACn: Soft Actor-Critic with n-step Returns
- Carrot, stick, or both? Price incentives for sustainable food choice in competitive environments
- PolySet: Restoring the Statistical Ensemble Nature of Polymers for Machine Learning
- WAY: Estimation of Vessel Destination in Worldwide AIS Trajectory
- Efficient Adaptive Rejection Sampling for Accelerating Speculative Decoding in Large Language Models
- CORE: Contrastive Masked Feature Reconstruction on Graphs
- LINA: Learning INterventions Adaptively for Physical Alignment and Generalization in Diffusion Models
- Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration
- MiniLingua: A Small Open-Source LLM for European Languages
- No One Left Behind: How to Exploit the Incomplete and Skewed Multi-Label Data for Conversion Rate Prediction
- ALIGN-FL: Architecture-independent Learning through Invariant Generative component sharing in Federated Learning
- Face Identity Unlearning for Retrieval via Embedding Dispersion
- Security and Detectability Analysis of Unicode Text Watermarking Methods Against Large Language Models
- FIN-bench-v2: A Unified and Robust Benchmark Suite for Evaluating Finnish Large Language Models
- Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)
- Detecting Emotion Drift in Mental Health Text Using Pre-Trained Transformers
- End2Reg: Learning Task-Specific Segmentation for Markerless Registration in Spine Surgery
- From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents
- SSAS: Cross-subject EEG-based Emotion Recognition through Source Selection with Adversarial Strategy
- Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
- SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping
- Behavior-Aware and Generalizable Defense Against Black-Box Adversarial Attacks for ML-Based IDS
- Verifying Rumors via Stance-Aware Structural Modeling
- Memory in the Age of AI Agents
- Superposition as Lossy Compression: Measure with Sparse Autoencoders and Connect to Adversarial Vulnerability
- DP-CSGP: Differentially Private Stochastic Gradient Push with Compressed Communication
- ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
- DA-SSL: self-supervised domain adaptor to leverage foundational models in turbt histopathology slides
- Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
- From Code to Field: Evaluating the Robustness of Convolutional Neural Networks for Disease Diagnosis in Mango Leaves
- World Models Can Leverage Human Videos for Dexterous Manipulation
- Large-Language Memorization During the Classification of United States Supreme Court Cases
- Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance
- Feedforward 3D Editing via Text-Steerable Image-to-3D
- DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders
- AI Copilots for Reproducibility in Science: A Case Study
- Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making
- Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach
- PADS: Plug-and-Play 3D Human Pose Analysis via Diffusion Generative Modeling
- A Comprehensive Survey on Self-Supervised Learning for Recommendation
- Fast Wrong-way Cycling Detection in CCTV Videos: Sparse Sampling is All You Need
- Efficient Neural Common Neighbor for Temporal Graph Link Prediction
- Dy-mer: An Explainable DNA Sequence Representation Scheme using Dictionary Learning
- Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
- No Screening is More Efficient with Multiple Objects
- Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models
- MAISI: Medical AI for Synthetic Imaging
- KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment
- Training Versatile Coding Agents in Synthetic Environments
- Comparison of different segmentation algorithms on brain volume and fractal dimension in infant brain MRIs
- Semantic Distance Measurement based on Multi-Kernel Gaussian Processes
- Adversarially Probing Cross-Family Sound Symbolism in 27 Languages
- Stochastic Volatility Modelling with LSTM Networks: A Hybrid Approach for S&P 500 Index Volatility Forecasting
- Accurate de novo sequencing of the modified proteome with OmniNovo
- GRC-Net: Gram Residual Co-attention Net for epilepsy prediction
- V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval
- Fractional Differential Equation Physics-Informed Neural Network and Its Application in Battery State Estimation
- UniMark: Artificial Intelligence Generated Content Identification Toolkit
- Dynamic Homophily with Imperfect Recall: Modeling Resilience in Adversarial Networks
- SCIR: A Self-Correcting Iterative Refinement Framework for Enhanced Information Extraction Based on Schema
- A Graph Attention Network-Based Framework for Reconstructing Missing LiDAR Beams
- Rough Sets for Explainability of Spectral Graph Clustering
- Cross-Modal Representational Knowledge Distillation for Enhanced Spike-Informed LFP Modeling
- Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference
- Exploring the Design Space of Transition Matching
- AI-Driven Real-Time Kick Classification in Olympic Taekwondo Using Sensor Fusion
- Mage: Cracking Elliptic Curve Cryptography with Cross-Axis Transformers
- Explainable AI as a Double-Edged Sword in Dermatology: The Impact on Clinicians versus The Public
- Explainable Artificial Intelligence for Economic Time Series: A Comprehensive Review and a Systematic Taxonomy of Methods and Concepts
- Can You Keep a Secret? Exploring AI for Care Coordination in Cognitive Decline
- Noise-robust Contrastive Learning for Critical Transition Detection in Dynamical Systems
- Diverse LLMs vs. Vulnerabilities: Who Detects and Fixes Them Better?
- Skillful Subseasonal-to-Seasonal Forecasting of Extreme Events with a Multi-Sphere Coupled Probabilistic Model
- StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
- Coupled Variational Reinforcement Learning for Language Model General Reasoning
- Detecting Prompt Injection Attacks Against Application Using Classifiers
- Content-Aware Ad Banner Layout Generation with Two-Stage Chain-of-Thought in Vision Language Models
- Human-Inspired Learning for Large Language Models via Obvious Record and Maximum-Entropy Method Discovery
- Understanding Syllogistic Reasoning in LLMs from Formal and Natural Language Perspectives
- ORIBA: Exploring LLM-Driven Role-Play Chatbot as a Creativity Support Tool for Original Character Artists
- DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model
- Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images
- PerNodeDrop: A Method Balancing Specialized Subnets and Regularization in Deep Neural Networks
- DynaGen: Unifying Temporal Knowledge Graph Reasoning with Dynamic Subgraphs and Generative Regularization
- Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling
- Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches
- Quantum Implicit Neural Representations for 3D Scene Reconstruction and Novel View Synthesis
- Theoretical Foundations of Prompt Engineering: From Heuristics to Expressivity
- Co-Exploration and Co-Exploitation via Shared Structure in Multi-Task Bandits
- Robust Motion Generation using Part-level Reliable Data from Videos
- Intelligent Scientific Literature Explorer using Machine Learning (ISLE)
- Federated Learning with Feedback Alignment
- CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence
- Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models (ASTA)
- Designing The Drive: Enhancing User Experience through Adaptive Interfaces in Autonomous Vehicles
- State over Tokens: Characterizing the Role of Reasoning Tokens
- OLC-WA: Drift Aware Tuning-Free Online Classification with Weighted Average
- Unveiling Statistical Significance of Online Regression over Multiple Datasets
- Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems
- Liquid Reasoning Transformers: A Sudoku-Based Prototype for Chess-Scale Algorithmic Tasks
- A Disproof of Large Language Model Consciousness: The Necessity of Continual Learning for Consciousness
- From Small to Large: Generalization Bounds for Transformers on Variable-Size Inputs
- OPAL: Operator-Programmed Algorithms for Landscape-Aware Black-Box Optimization
- Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, LLaMA
- Decoding Human and AI Persuasion in National College Debate: Analyzing Prepared Arguments Through Aristotle's Rhetorical Principles
- Hindsight is 20/20: Building Agent Memory that Retains, Recalls, and Reflects
- On the continuity of flows
- Lemon: A Unified and Scalable 3D Multimodal Model for Universal Spatial Understanding
- Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners
- Network Level Evaluation of Hangup Susceptibility of HRGCs using Deep Learning and Sensing Techniques: A Goal Towards Safer Future
- PRIVEE: Privacy-Preserving Vertical Federated Learning Against Feature Inference Attacks
- SAGA: Open-World Mobile Manipulation via Structured Affordance Grounding
- Selective Conformal Risk Control
- Information-Consistent Language Model Recommendations through Group Relative Policy Optimization
- Counting Clues: A Lightweight Probabilistic Baseline Can Match an LLM
- Optimal Labeler Assignment and Sampling for Active Learning in the Presence of Imperfect Labels
- SignRAG: A Retrieval-Augmented System for Scalable Zero-Shot Road Sign Recognition
- Meta-GPT: Decoding the Metasurface Genome with Generative Artificial Intelligence
- CTIGuardian: A Few-Shot Framework for Mitigating Privacy Leakage in Fine-Tuned LLMs
- Cisco Integrated AI Security and Safety Framework Report
- MADTempo: An Interactive System for Multi-Event Temporal Video Retrieval with Query Augmentation
- Investigating Data Pruning for Pretraining Biological Foundation Models at Scale
- Unified Interactive Multimodal Moment Retrieval via Cascaded Embedding-Reranking and Temporal-Aware Score Fusion
- Content Adaptive based Motion Alignment Framework for Learned Video Compression
- Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping
- Tackling Snow-Induced Challenges: Safe Autonomous Lane-Keeping with Robust Reinforcement Learning
- Calibrating Uncertainty for Zero-Shot Adversarial CLIP
- Scaling Bidirectional Spans and Span Violations in Attention Mechanism
- GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training
- LLM Rationalis? Measuring Bargaining Capabilities of AI Negotiators
- A Simple and Effective Framework for Symmetric Consistent Indexing in Large-Scale Dense Retrieval
- Reflective Preference Optimization (RPO): Enhancing On-Policy Alignment via Hint-Guided Reflection
- MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data
- Error-Driven Prompt Optimization for Arithmetic Reasoning
- Behavior and Representation in Large Language Models for Combinatorial Optimization: From Feature Extraction to Algorithm Selection
- Differentiable Evolutionary Reinforcement Learning
- neuralFOMO: Can LLMs Handle Being Second Best? Measuring Envy-Like Preferences in Multi-Agent Settings
- Defending the Hierarchical Result Models of Precedential Constraint
- MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph
- A Multitask VAE for Time Series Preprocessing and Prediction of Blood Glucose Level
- Enhancing Urban Visual Place Recognition for Crowdsourced Flood Imagery via LLM-Guided Attention
- Totalitarian Technics: The Hidden Cost of AI Scribes in Healthcare
- The Ontological Dissonance Hypothesis: AI-Triggered Delusional Ideation as Folie a Deux Technologique
- Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?
- Active Inference with Reusable State-Dependent Value Profiles
- CR3G: Causal Reasoning for Patient-Centric Explanations in Radiology Report Generation
- Performance and Efficiency of Climate In-Situ Data Reconstruction: Why Optimized IDW Outperforms kriging and Implicit Neural Representation
- Soft Decision Tree classifier: explainable and extendable PyTorch implementation
- Semantic Nutrition Estimation: Predicting Food Healthfulness from Text Descriptions
- Vision Foundry: A System for Training Foundational Vision AI Models
- Spiking Manifesto
- Airport Passenger Flow Forecasting via Deformable Temporal-Spectral Transformer Approach
- KH-FUNSD: A Hierarchical and Fine-Grained Layout Analysis Dataset for Low-Resource Khmer Business Document
- KV Cache Recycling to Expand Usable Context Capacity in Low Parameter LLMs
- Explainable AI for Smart Greenhouse Control: Interpretability of Temporal Fusion Transformer in the Internet of Robotic Things
- Rep Smarter, Not Harder: AI Hypertrophy Coaching with Wearable Sensors and Edge Neural Networks
- Achieving Approximate Symmetry Is Exponentially Easier than Exact Symmetry
- GCoDE: Efficient Device-Edge Co-Inference for GNNs via Architecture-Mapping Co-Search
- TopicProphet: Prophesies on Temporal Topic Trends and Stocks
- Adaptive Path Integral Diffusion: AdaPID
- Generative Stochastic Optimal Transport: Guided Harmonic Path-Integral Diffusion
- An Operator-Consistent Graph Neural Network for Learning Diffusion Dynamics on Irregular Meshes
- Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL
- Expert Assessment: The Systemic Environmental Risks of Artficial Intelligence
- Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation
- On the Dangers of Bootstrapping Generation for Continual Learning and Beyond
- Industrial AI Robustness Card: Evaluating and Monitoring Time Series Models
- Using Socio-economic Indicators, Smart Transit Systems, and Urban Simulator to Accelerate ZEV Adoption and Reduce VMT
- Automated Plant Disease and Pest Detection System Using Hybrid Lightweight CNN-MobileViT Models for Diagnosis of Indigenous Crops
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving
- It's About Time: The Temporal and Modal Dynamics of Copilot Usage
- Understanding Structural Representation in Foundation Models for Polymers
- An Experience Report on a Pedagogically Controlled, Curriculum-Constrained AI Tutor for SE Education
- Aesthetic Alignment Risks Assimilation: How Image Generation and Reward Models Reinforce Beauty Bias and Ideological "Censorship"
- Advancing Autonomous Driving System Testing: Demands, Challenges, and Future Directions
- Should AI Become an Intergenerational Civil Right?
- Beyond Automation: Rethinking Work, Creativity, and Governance in the Age of Generative AI
- A fine-grained look at causal effects in causal spaces
- Towards Accessible Physical AI: LoRA-Based Fine-Tuning of VLA Models for Real-World Robot Control
- Vibe Coding in Practice: Flow, Technical Debt, and Guidelines for Sustainable Use
- FloraForge: LLM-Assisted Procedural Generation of Editable and Analysis-Ready 3D Plant Geometric Models For Agricultural Applications
- Gene regulatory network inference algorithm based on spectral signed directed graph convolution
- MONET -- Virtual Cell Painting of Brightfield Images and Time Lapses Using Reference Consistent Diffusion
- Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction
- Mapping AI Risk Mitigations: Evidence Scan and Preliminary AI Risk Mitigation Taxonomy
- The Agentic Regulator: Risks for AI in Finance and a Proposed Agent-based Framework for Governance
- Unveiling User Perceptions in the Generative AI Era: A Sentiment-Driven Evaluation of AI Educational Apps' Role in Digital Transformation of e-Teaching
- DynaPURLS: Dynamic Refinement of Part-aware Representations for Skeleton-based Zero-Shot Action Recognition
- How AI Agents Follow the Herd of AI? Network Effects, History, and Machine Optimism
- A Review of Learning-Based Motion Planning: Toward a Data-Driven Optimal Control Approach
- Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations
- Designing The Internet of Agents: A Framework for Trustworthy, Transparent, and Collaborative Human-Agent Interaction (HAX)
- Semantic search for 100M+ galaxy images using AI-generated captions
- Evidence-Driven Decision Support for AI Model Selection in Research Software Engineering
- V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions
- Hold Onto That Thought: Assessing KV Cache Compression On Reasoning
- Semantic-Drive: Democratizing Long-Tail Data Curation via Open-Vocabulary Grounding and Neuro-Symbolic VLM Consensus
- AI as a Teaching Partner: Early Lessons from Classroom Codesign with Secondary Teachers
- Instruction-Tuning Open-Weight Language Models for BPMN Model Generation
- The Instability of Safety: How Random Seeds and Temperature Expose Inconsistent LLM Refusal Behavior
- Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring
- Congestion Reduction in EV Charger Placement Using Traffic Equilibrium Models
- A neuro-symbolic framework for accountability in public-sector AI
- MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models
- A Benchmark Dataset for Spatially Aligned Road Damage Assessment in Small Uncrewed Aerial Systems Disaster Imagery
- BaRISTA: Brain Scale Informed Spatiotemporal Representation of Human Intracranial Neural Activity
- MeltwaterBench: Deep learning for spatiotemporal downscaling of surface meltwater
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings
- Diffusion Language Model Inference with Monte Carlo Tree Search
- Thermal RGB Fusion for Micro-UAV Wildfire Perimeter Tracking with Minimal Comms
- Epistemoverse: Toward an AI-Driven Knowledge Metaverse for Intellectual Heritage Preservation
- ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB
- Not All Transparency Is Equal: Source Presentation Effects on Attention, Interaction, and Persuasion in Conversational Search
- Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous Driving
- A Monad-Based Clause Architecture for Artificial Age Score (AAS) in Large Language Models
- Solving Parallel Machine Scheduling With Precedences and Cumulative Resource Constraints With Calendars
- Mirror Mode in Fire Emblem: Beating Players at their own Game with Imitation and Reinforcement Learning
- Structured Personalization: Modeling Constraints as Matroids for Data-Minimal LLM Agents
- Causal Strengths and Leaky Beliefs: Interpreting LLM Reasoning via Noisy-OR Causal Bayes Nets
- Robustness of Probabilistic Models to Low-Quality Data: A Multi-Perspective Analysis
- CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving
- AGAPI-Agents: An Open-Access Agentic AI Platform for Accelerated Materials Design on AtomGPT.org
- Hypergame Rationalisability: Solving Agent Misalignment In Strategic Play
- Log Anomaly Detection with Large Language Models via Knowledge-Enriched Fusion
- Context-Aware Agentic Power Resources Optimisation in EV using Smart2ChargeApp
- The Forecast Critic: Leveraging Large Language Models for Poor Forecast Identification
- Reliable Policy Iteration: Performance Robustness Across Architecture and Environment Perturbations
- Rethinking Label Consistency of In-Context Learning: An Implicit Transductive Label Propagation Perspective
- Floorplan2Guide: LLM-Guided Floorplan Parsing for BLV Indoor Navigation
- TA-KAND: Two-stage Attention Triple Enhancement and U-KAN based Diffusion For Few-shot Knowledge Graph Completion
- A Geometric Theory of Cognition
- A Multi-Axial Mindset for Ontology Design Lessons from Wikidata's Polyhierarchical Structure
- Quantum-Aware Generative AI for Materials Discovery: A Framework for Robust Exploration Beyond DFT Biases
- Entropy Collapse: A Universal Failure Mode of Intelligent Systems
- Feeling the Strength but Not the Source: Partial Introspection in LLMs
- Understanding Critical Thinking in Generative Artificial Intelligence Use: Development, Validation, and Correlates of the Critical Thinking in AI Use Scale
- AI Transparency Atlas: Framework, Scoring, and Real-Time Model Card Evaluation Pipeline
- MetaHGNIE: Meta-Path Induced Hypergraph Contrastive Learning in Heterogeneous Knowledge Graphs
- SafeGen: Embedding Ethical Safeguards in Text-to-Image Generation
- KidsArtBench: Multi-Dimensional Children's Art Evaluation with Attribute-Aware MLLMs
- World Models Unlock Optimal Foraging Strategies in Reinforcement Learning Agents
- Large Language Newsvendor: Decision Biases and Cognitive Mechanisms
- AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation
- Modular and Multi-Path-Aware Offline Benchmarking for Mobile GUI Agents
- Value-Aware Multiagent Systems
- Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI
- WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
- Synergizing Code Coverage and Gameplay Intent: Coverage-Aware Game Playtesting with LLM-Guided Reinforcement Learning
- Personalized QoE Prediction: A Demographic-Augmented Machine Learning Framework for 5G Video Streaming Networks
- Causal Counterfactuals Reconsidered
- Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution
- Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privacy-Aware Generative Agents
- Satisfiability Modulo Theory Meets Inductive Logic Programming
- Towards Open Standards for Systemic Complexity in Digital Forensics
- M-GRPO: Stabilizing Self-Supervised Reinforcement Learning for Large Language Models with Momentum-Anchored Policy Optimization
- Socratic Students: Teaching Language Models to Learn by Asking Questions
- Towards Unified Co-Speech Gesture Generation via Hierarchical Implicit Periodicity Learning
- Can AI Understand What We Cannot Say? Measuring Multilevel Alignment Through Abortion Stigma Across Cognitive, Interpersonal, and Structural Levels
- MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations
- SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
- Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
Research Sources: 695 | Generated: 12/16/2025
