AI RESEARCH PAPERS & ACADEMIC SOURCES
- SoLA-Vision: Fine-grained Layer-wise Linear Softmax Hybrid Attention
- Democratizing planetary-scale analysis: An ultra-lightweight Earth embedding database for accurate and flexible global land monitoring
- ATATA: One Algorithm to Align Them All
- Bio-inspired fine-tuning for selective transfer learning in image classification
- Image-Text Knowledge Modeling for Unsupervised Multi-Scenario Person Re-Identification
- Language-Agnostic Visual Embeddings for Cross-Script Handwriting Retrieval
- FTDMamba: Frequency-Assisted Temporal Dilation Mamba for Unmanned Aerial Vehicle Video Anomaly Detection
- Efficient On-Board Processing of Oblique UAV Video for Rapid Flood Extent Mapping
- SAMannot: A Memory-Efficient, Local, Open-source Framework for Interactive Video Instance Segmentation based on SAM2
- Context-Aware Semantic Segmentation via Stage-Wise Attention
- Enhancing Vision Language Models with Logic Reasoning for Situational Awareness
- Assessing Building Heat Resilience Using UAV and Street-View Imagery with Coupled Global Context Vision Transformer
- Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning
- SUG-Occ: An Explicit Semantics and Uncertainty Guided Sparse Learning Framework for Real-Time 3D Occupancy Prediction
- SME-YOLO: A Real-Time Detector for Tiny Defect Detection on PCB Surfaces
- Generative Scenario Rollouts for End-to-End Autonomous Driving
- ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
- UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation
- Differentiating through binarized topology changes: Second-order subpixel-smoothed projection
- KOCOBrain: Kuramoto-Guided Graph Network for Uncovering Structure-Function Coupling in Adolescent Prenatal Drug Exposure
- Convolutions Need Registers Too: HVS-Inspired Dynamic Attention for Video Quality Assessment
- Visual question answering-based image-finding generation for pulmonary nodules on chest CT from structured annotations
- Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset
- Simple Models, Rich Representations: Visual Decoding from Primate Intracortical Neural Signals
- VidLeaks: Membership Inference Attacks Against Text-to-Video Models
- ProSGNeRF: Progressive Dynamic Neural Scene Graph with Frequency Modulated Foundation Model in Urban Scenes
- Controllable Video Generation: A Survey
- BYOL: Bring Your Own Language Into LLMs
- A Concise Agent is Less Expert: Revealing Side Effects of Using Style Features on Conversational Agents
- EncodeRec: An Embedding Backbone for Recommendation Systems
- DialDefer: A Framework for Detecting and Mitigating LLM Dialogic Deference
- Neural Induction of Finite-State Transducers
- Massively Multilingual Joint Segmentation and Glossing
- ZPD Detector: Data Selection via Capability-Difficulty Alignment for Large Language Models
- Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies
- NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
- From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models
- Budget-Aware Anytime Reasoning with LLM-Synthesized Preference Data
- Integrity Shield A System for Ethical AI Use & Authorship Transparency in Assessments
- The Growing Gains and Pains of Iterative Web Corpora Crawling: Insights from South Slavic CLASSLA-web 2.0 Corpora
- DOREMI: Optimizing Long Tail Predictions in Document-Level Relation Extraction
- T$^\star$: Progressive Block Scaling for MDM Through Trajectory Aware RL
- MultiCaption: Detecting disinformation using multilingual visual claims
- Language of Thought Shapes Output Diversity in Large Language Models
- One LLM to Train Them All: Multi-Task Learning Framework for Fact-Checking
- Membership Inference on LLMs in the Wild
- F-Actor: Controllable Conversational Behaviour in Full-Duplex Models
- Idea First, Code Later: Disentangling Problem Solving from Code Generation in Evaluating LLMs for Competitive Programming
- Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models
- Reward Modeling for Scientific Writing Evaluation
- The unreasonable effectiveness of pattern matching
- Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation
- CTest-Metric: A Unified Framework to Assess Clinical Validity of Metrics for CT Report Generation
- How Long Is a Piece of String? A Brief Empirical Analysis of Tokenizers
- AJAR: Adaptive Jailbreak Architecture for Red-teaming
- SonicBench: Dissecting the Physical Perception Bottleneck in Large Audio Language Models
- FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning
- Isotropy-Optimized Contrastive Learning for Semantic Course Recommendation
- Future Optical Flow Prediction Improves Robot Control & Video Generation
- ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research
- A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems
- One Model, Many Behaviors: Training-Induced Effects on Out-of-Distribution Detection
- Effects of Different Attention Mechanisms Applied on 3D Models in Video Classification
- FrankenMotion: Part-level Human Motion Generation and Composition
- Classification of Chest XRay Diseases through image processing and analysis techniques
- MMedExpert-R1: Strengthening Multimodal Medical Reasoning via Domain-Specific Adaptation and Clinical Guideline Reinforcement
- M3DDM+: An improved video outpainting by a modified masking strategy
- PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models
- CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
- Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis
- Operator learning on domain boundary through combining fundamental solution-based artificial data and boundary integral techniques
- Latent Dynamics Graph Convolutional Networks for model order reduction of parameterized time-dependent PDEs
- Sample-Near-Optimal Agnostic Boosting with Improved Running Time
- Metabolomic Biomarker Discovery for ADHD Diagnosis Using Interpretable Machine Learning
- FORESTLLM: Large Language Models Make Random Forest Great on Few-shot Tabular Learning
- Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models
- Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency
- Latent Space Inference via Paired Autoencoders
- Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning
- Forcing and Diagnosing Failure Modes of Fourier Neural Operators Across Diverse PDE Families
- Inter-patient ECG Arrhythmia Classification with LGNs and LUTNs
- When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models
- Low-Rank Key Value Attention
- Extractive summarization on a CMOS Ising machine
- QUPID: A Partitioned Quantum Neural Network for Anomaly Detection in Smart Grid
- SSC-UNet: UNet with Self-Supervised Contrastive Learning for Phonocardiography Noise Reduction
- UBiGTLoc: A Unified BiLSTM-Graph Transformer Localization Framework for IoT Sensor Networks
- Sensor Placement for Urban Traffic Interpolation: A Data-Driven Evaluation to Inform Policy
- Mass Distribution versus Density Distribution in the Context of Clustering
- Physically constrained unfolded multi-dimensional OMP for large MIMO systems
- LLMs for Game Theory: Entropy-Guided In-Context Learning and Adaptive CoT Reasoning
- Reasoning Models Generate Societies of Thought
- Learning collision operators from plasma phase space data using differentiable simulators
- A PAC-Bayesian Analysis of Channel-Induced Degradation in Edge Inference
- Depression Detection Based on Electroencephalography Using a Hybrid Deep Neural Network CNN-GRU and MRMR Feature Selection
- Memorize Early, Then Query: Inlier-Memorization-Guided Active Outlier Detection
- Exact Constraint Enforcement in Physics-Informed Extreme Learning Machines using Null-Space Projection Framework
- CoG: Controllable Graph Reasoning via Relational Blueprints and Failure-Aware Refinement over Knowledge Graphs
- Split-and-Conquer: Distributed Factor Modeling for High-Dimensional Matrix-Variate Time Series
- KANHedge: Efficient Hedging of High-Dimensional Options Using Kolmogorov-Arnold Network-Based BSDE Solver
- Comprehensive Robust Dynamic Mode Decomposition from Mode Extraction to Dimensional Reduction
- Model-free policy gradient for discrete-time mean-field control
- How DDAIR you? Disambiguated Data Augmentation for Intent Recognition
- Reasoning in Trees: Improving Retrieval-Augmented Generation for Multi-Hop Question Answering
- Effects of Introducing Synaptic Scaling on Spiking Neural Network Learning
- Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings
- Information Theoretic Perspective on Representation Learning
- Beer-Lambert Autoencoder for Unsupervised Stain Representation Learning and Deconvolution in Multi-immunohistochemical Brightfield Histology Images
- New Adaptive Mechanism for Large Neighborhood Search using Dual Actor-Critic
- Zero-Shot Detection of Elastic Transient Morphology Across Physical Systems
- Statistical Robustness of Interval CVaR Based Regression Models under Perturbation and Contamination
- PubMed-OCR: PMC Open Access OCR Annotations
- Near-Optimal Decentralized Stochastic Nonconvex Optimization with Heavy-Tailed Noise
- IMS: Intelligent Hardware Monitoring System for Secure SoCs
- Learning Semantic-Geometric Task Graph-Representations from Human Demonstrations
- A Probabilistic Approach to Trajectory-Based Optimal Experimental Design
- On the Probability of First Success in Differential Evolution: Hazard Identities and Tail Bounds
- ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
- ThinkEval: Practical Evaluation of Knowledge Leakage in LLM Editing using Thought-based Knowledge Graphs
- UCB-type Algorithm for Budget-Constrained Expert Learning
- Detecting Toxic Flow
- High-Dimensional Tail Index Regression
- A Natural Primal-Dual Hybrid Gradient Method for Adversarial Neural Network Training on Solving Partial Differential Equations
- Conditional Distribution Compression via the Kernel Conditional Mean Embedding
- Feature Propagation on Knowledge Graphs using Cellular Sheaves
- Theorem Prover as a Judge for Synthetic Data Generation
- Utilizing Class Separation Distance for the Evaluation of Corruption Robustness of Machine Learning Classifiers
- A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
- Balanced Edge Pruning for Graph Anomaly Detection with Noisy Labels
- Policy alone is probably not the solution: A large-scale experiment on how developers struggle to design meaningful end-user explanations
- Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training
- Vendor-Aware Industrial Agents: RAG-Enhanced LLMs for Secure On-Premise PLC Code Generation
- Analytic Bijections for Smooth and Interpretable Normalizing Flows
- Towards Tensor Network Models for Low-Latency Jet Tagging on FPGAs
- Mugi: Value Level Parallelism For Efficient LLMs
- AI-Guided Human-In-the-Loop Inverse Design of High Performance Engineering Structures
- Beyond Accuracy: A Stability-Aware Metric for Multi-Horizon Forecasting
- Unit-Consistent (UC) Adjoint for GSD and Backprop in Deep Learning Applications
- Action Shapley: A Training Data Selection Metric for World Model in Reinforcement Learning
- Realistic Curriculum Reinforcement Learning for Autonomous and Sustainable Marine Vessel Navigation
- FAConvLSTM: Factorized-Attention ConvLSTM for Efficient Feature Extraction in Multivariate Climate Data
- HOSL: Hybrid-Order Split Learning for Memory-Constrained Edge Training
- Multivariate LSTM-Based Forecasting for Renewable Energy: Enhancing Climate Change Mitigation
- Transient learning dynamics drive escape from sharp valleys in Stochastic Gradient Descent
- Toward Adaptive Grid Resilience: A Gradient-Free Meta-RL Framework for Critical Load Restoration
- Reasoning Distillation for Lightweight Automated Program Repair
- Constant Metric Scaling in Riemannian Computation
- Backdoor Attacks on Multi-modal Contrastive Learning
- Matching High-Dimensional Geometric Quantiles for Test-Time Adaptation of Transformers and Convolutional Networks Alike
- AVP-Pro: An Adaptive Multi-Modal Fusion and Contrastive Learning Approach for Comprehensive Two-Stage Antiviral Peptide Identification
- Self-Augmented Mixture-of-Experts for QoS Prediction
- OpFML: Pipeline for ML-based Operational Forecasting
- Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
- Soft Bayesian Context Tree Models for Real-Valued Time Series
- Differentially Private Subspace Fine-Tuning for Large Language Models
- Optimized Algorithms for Text Clustering with LLM-Generated Constraints
- Shape-morphing programming of soft materials on complex geometries via neural operator
- FSL-BDP: Federated Survival Learning with Bayesian Differential Privacy for Credit Risk Modeling
- Assesing the Viability of Unsupervised Learning with Autoencoders for Predictive Maintenance in Helicopter Engines
- Theoretically and Practically Efficient Resistance Distance Computation on Large Graphs
- GMM-COMET: Continual Source-Free Universal Domain Adaptation via a Mean Teacher and Gaussian Mixture Model-Based Pseudo-Labeling
- LSTM VS. Feed-Forward Autoencoders for Unsupervised Fault Detection in Hydraulic Pumps
- TimeMar: Multi-Scale Autoregressive Modeling for Unconditional Time Series Generation
- Exploring LLM Features in Predictive Process Monitoring for Small-Scale Event-Logs
- Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning
- BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 Olympics
- Generative AI Purpose-built for Social and Mental Health: A Real-World Pilot
- EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting
- DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion
- Millimeter-Wave Gesture Recognition in ISAC: Does Reducing Sensing Airtime Hamper Accuracy?
- Neuro-Symbolic Activation Discovery: Transferring Mathematical Structures from Physics to Ecology for Parameter-Efficient Neural Networks
- Line-based Event Preprocessing: Towards Low-Energy Neuromorphic Computer Vision
- AnyECG: Evolved ECG Foundation Model for Holistic Health Profiling
- Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers
- LogicLens: Leveraging Semantic Code Graph to explore Multi Repository large systems
- Unified Optimization of Source Weights and Transfer Quantities in Multi-Source Transfer Learning: An Asymptotic Framework
- Digital Metabolism: Decoupling Logic from Facts via Regenerative Unlearning -- Towards a Pure Neural Logic Core
- Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents
- Approximately Optimal Global Planning for Contact-Rich SE(2) Manipulation on a Graph of Reachable Sets
- Can Vision-Language Models Understand Construction Workers? An Exploratory Study
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation
- Self-learned representation-guided latent diffusion model for breast cancer classification in deep ultraviolet whole surface images
- RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions
- Selecting Language Models for Social Science: Start Small, Start Open, and Validate
- Sparse Data Tree Canopy Segmentation: Fine-Tuning Leading Pretrained Models on Only 150 Images
- PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient Diagnosis
- Multi-Stage Patient Role-Playing Framework for Realistic Clinical Interactions
- Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents
- Steering Language Models Before They Speak: Logit-Level Interventions
- When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs
- Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach
- Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs
- Combating Spurious Correlations in Graph Interpretability via Self-Reflection
- IDDR-NGP: Incorporating Detectors for Distractor Removal with Instant Neural Radiance Field
- Your One-Stop Solution for AI-Generated Video Detection
- Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse
- Predicting Biased Human Decision-Making with Large Language Models in Conversational Settings
- H-AIM: Orchestrating LLMs, PDDL, and Behavior Trees for Hierarchical Multi-Robot Planning
- Fairness in Healthcare Processes: A Quantitative Analysis of Decision Making in Triage
- Bridging Cognitive Neuroscience and Graph Intelligence: Hippocampus-Inspired Multi-View Hypergraph Learning for Web Finance Fraud
- A3D: Adaptive Affordance Assembly with Dual-Arm Manipulation
- ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
- Visual Marker Search for Autonomous Drone Landing in Diverse Urban Environments
- Efficient Multilingual Name Type Classification Using Convolutional Networks
- Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning
- Learn Before Represent: Bridging Generative and Contrastive Learning for Domain-Specific LLM Embeddings
- Context-aware Graph Causality Inference for Few-Shot Molecular Property Prediction
- Learning Quadrupedal Locomotion for a Heavy Hydraulic Robot Using an Actuator Model
- Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration
- Cross-Modal Attention Network with Dual Graph Learning in Multimodal Recommendation
- Clustering High-dimensional Data: Balancing Abstraction and Representation Tutorial at AAAI 2026
- Artificial Intelligence and the US Economy: An Accounting Perspective on Investment and Production
- SD-RAG: A Prompt-Injection-Resilient Framework for Selective Disclosure in Retrieval-Augmented Generation
- FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization
- Epistemic Control and the Normativity of Machine Learning-Based Science
- LoRA as Oracle
- SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients
- FactCorrector: A Graph-Inspired Approach to Long-Form Factuality Correction of Large Language Models
- Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation
- X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning
- From SERPs to Sound: How Search Engine Result Pages and AI-generated Podcasts Interact to Influence User Attitudes on Controversial Topics
- How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting
- FEATHer: Fourier-Efficient Adaptive Temporal Hierarchy Forecaster for Time-Series Forecasting
- Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding
- Institutional AI: Governing LLM Collusion in Multi-Agent Cournot Markets via Public Governance Graphs
- Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences
- Wetland mapping from sparse annotations with satellite image time series and temporal-aware segment anything model
- Topology-Guaranteed Image Segmentation: Enforcing Connectivity, Genus, and Width Constraints
- The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents
- Relational Linearity is a Predictor of Hallucinations
- GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance
- Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models
- Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps
- PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs
- Interactive Narrative Analytics: Bridging Computational Narrative Extraction and Human Sensemaking
- MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models
- The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents
- MetaboNet: The Largest Publicly Available Consolidated Dataset for Type 1 Diabetes Management
- Building Production-Ready Probes For Gemini
- Do explanations generalize across large reasoning models?
- Japanese AI Agent System on Human Papillomavirus Vaccination: System Design
- Do You Trust Me? Cognitive-Affective Signatures of Trustworthiness in Large Language Models
- Building AI Agents to Improve Job Referral Requests to Strangers
- ORBITFLOW: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration
- CTHA: Constrained Temporal Hierarchical Architecture for Stable Multi-Agent LLM Systems
- Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration
- Optimisation of complex product innovation processes based on trend models with three-valued logic
- ARC Prize 2025: Technical Report
- What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge
- AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing
- Efficient Protein Optimization via Structure-aware Hamiltonian Dynamics
- BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search
- AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
- MiCA: A Mobility-Informed Causal Adapter for Lightweight Epidemic Forecasting
- ReCreate: Reasoning and Creating Domain Agents Driven by Experience
- Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems
- TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech
- Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems
- Beyond Model Scaling: Test-Time Intervention for Efficient Deep Reasoning
- XChoice: Explainable Evaluation of AI-Human Alignment in LLM-based Constrained Choice Decision Making
- AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems
- Hyperparameter Optimization of Constraint Programming Solvers
Research Sources: 262 | Generated: 1/19/2026
