AI RESEARCH PAPERS & ACADEMIC SOURCES
- A Collaborative Content Moderation Framework for Toxicity Detection based on Conformalized Estimates of Annotation Disagreement
- Retrieval-Augmented Machine Translation with Unstructured Knowledge
- DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis
- DDaTR: Dynamic Difference-aware Temporal Residual Network for Longitudinal Radiology Report Generation
- TrueGL: A Truthful, Reliable, and Unified Engine for Grounded Learning in Full-Stack Search
- Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization
- Dually Hierarchical Drift Adaptation for Online Configuration Performance Learning
- Bringing Attention to CAD: Boundary Representation Learning via Transformer
- Visual Imitation Enables Contextual Humanoid Control
- Explicit Residual-Based Scalable Image Coding for Humans and Machines
- mmFlux: Crowd Flow Analytics with Commodity mmWave MIMO Radar
- Discovering Heterogeneous Treatment Effects in Regression Discontinuity Designs
- Mixed membership estimation for categorical data with weighted responses
- ARGS: Advanced Regularization on Aligning Gaussians over the Surface
- The Rosario Dataset v2: Multimodal Dataset for Agricultural Robotics
- From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China
- ALow-Cost Real-Time Framework for Industrial Action Recognition Using Foundation Models
- JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model
- Maximising Kidney Glomeruli Segmentation using Minimal Labels via Self-Supervision
- CHaRM: Conditioned Heatmap Regression Methodology for Accurate and Fast Dental Landmark Localization
- Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration
- Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
- PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
- Computer-Aided Design of Personalized Occlusal Positioning Splints Using Multimodal 3D Data
- Saliency-Guided Training for Fingerprint Presentation Attack Detection
- InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization
- Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling
- Scale-GS: Efficient Scalable Gaussian Splatting via Redundancy-filtering Training on Streaming Content
- One More Glance with Sharp Eyes: Rethinking Lightweight Captioning as a Practical Visual Specialist
- Federated Fine-tuning of SAM-Med3D for MRI-based Dementia Classification
- Multi-Method Ensemble for Out-of-Distribution Detection
- Adversarial Patch Attack for Ship Detection via Localized Augmentation
- Maybe you don't need a U-Net: convolutional feature upsampling for materials micrograph segmentation
- HCCM: Hierarchical Cross-Granularity Contrastive and Matching Learning for Natural Language-Guided Drones
- ECHO: Ego-Centric modeling of Human-Object interactions
- How Well Do Vision--Language Models Understand Cities? A Comparative Study on Spatial Reasoning from Street-View Images
- Temporal Flow Matching for Learning Spatio-Temporal Trajectories in 4D Longitudinal Medical Imaging
- Integrating Pathology and CT Imaging for Personalized Recurrence Risk Prediction in Renal Cancer
- Unfolding Framework with Complex-Valued Deformable Attention for High-Quality Computer-Generated Hologram Generation
- Towards Interactive Lesion Segmentation in Whole-Body PET/CT with Promptable Models
- Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
- FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA
- Learning from Silence and Noise for Visual Sound Source Localization
- UItron: Foundational GUI Agent with Advanced Perception and Planning
- What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
- A Multi-Stage Fine-Tuning and Ensembling Strategy for Pancreatic Tumor Segmentation in Diagnostic and Therapeutic MRI
- VoCap: Video Object Captioning and Segmentation from Any Prompt
- DriveQA: Passing the Driving Knowledge Test
- ScanMove: Motion Prediction and Transfer for Unregistered Body Meshes
- Mini Autonomous Car Driving based on 3D Convolutional Neural Networks
- Inducing Programmatic Skills for Agentic Tasks
- Testing Conviction: An Argumentative Framework for Measuring LLM Political Stability
- 2COOOL: 2nd Workshop on the Challenge Of Out-Of-Label Hazards in Autonomous Driving
- Q-Align: Alleviating Attention Leakage in Zero-Shot Appearance Transfer via Query-Query Alignment
- ERTACache: Error Rectification and Timesteps Adjustment for Efficient Diffusion
- Video-LLMs with Temporal Visual Screening
- ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments
- GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions
- SYNBUILD-3D: A large, multi-modal, and semantically rich synthetic dataset of 3D building models at Level of Detail 4
- Radially Distorted Homographies, Revisited
- GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability
- Lightweight MRI-Based Automated Segmentation of Pancreatic Cancer with Auto3DSeg
- Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation
- PHD: Personalized 3D Human Body Fitting with Point Diffusion
- Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning
- Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image
- GLENDA: Gynecologic Laparoscopy Endometriosis Dataset
- Identifying Surgical Instruments in Laparoscopy Using Deep Learning Instance Segmentation
- Unsupervised Incremental Learning Using Confidence-Based Pseudo-Labels
- Trees as Gaussians: Large-Scale Individual Tree Mapping
- Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting
- Granite Embedding R2 Models
- How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations
- Can Multimodal LLMs Solve the Basic Perception Problems of Percept-V?
- Do Self-Supervised Speech Models Exhibit the Critical Period Effects in Language Acquisition?
- Automatic Reviewers Fail to Detect Faulty Reasoning in Research Papers: A New Counterfactual Evaluation Framework
- Discovering Semantic Subdimensions through Disentangled Conceptual Representations
- Beyond the Surface: Probing the Ideological Depth of Large Language Models
- Personality Matters: User Traits Predict LLM Preferences in Multi-Turn Collaborative Tasks
- Is this chart lying to me? Automating the detection of misleading visualizations
- Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
- Designing Smarter Conversational Agents for Kids: Lessons from Cognitive Work and Means-Ends Analyses
- CrossTL: A Universal Programming Language Translator with Unified Intermediate Representation
- From Canonical to Complex: Benchmarking LLM Capabilities in Undergraduate Thermodynamics
- Morae: Proactively Pausing UI Agents for User Choices
- E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
- Blind Spot Navigation in Large Language Model Reasoning with Thought Space Explorer
- Strategic resource allocation in memory encoding: An efficiency principle shaping language processing
- Guaranteed Nonconvex Factorization Approach for Tensor Train Recovery
- Revealing Fine-Grained Values and Opinions in Large Language Models
- BrainGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training
- Control of Rayleigh-B\'enard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime
- From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave sampling
- L3Cube-MahaEmotions: A Marathi Emotion Recognition Dataset with Synthetic Annotations using CoTR prompting and Large Language Models
- Interpretation of Deep Learning Model in Embryo Selection for In Vitro Fertilization (IVF) Treatment
- SatDINO: A Deep Dive into Self-Supervised Pretraining for Remote Sensing
- Standardized Multi-Layer Tissue Maps for Enhanced Artificial Intelligence Integration and Search in Large-Scale Whole Slide Image Archives
- Adaptive generative moment matching networks for improved learning of dependence structures
- Machine Intelligence on the Edge: Interpretable Cardiac Pattern Localisation Using Reinforcement Learning
- Surface Stability Modeling with Universal Machine Learning Interatomic Potentials: A Comprehensive Cleavage Energy Benchmarking Study
- A Soft Inducement Framework for Incentive-Aided Steering of No-Regret Players
- Domain Generalization in-the-Wild: Disentangling Classification from Domain-Aware Representations
- Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
- Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
- Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
- Federated Diffusion Modeling with Differential Privacy for Tabular Data Synthesis
- SpecPipe: Accelerating Pipeline Parallelism-based LLM Inference with Speculative Decoding
- On the Adversarial Robustness of Spiking Neural Networks Trained by Local Learning
- Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
- BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning
- Rethinking Layer-wise Model Merging through Chain of Merges
- Beyond expected value: geometric mean optimization for long-term policy performance in reinforcement learning
- Failure Prediction Is a Better Performance Proxy for Early-Exit Networks Than Calibration
- Spiking Decision Transformers: Local Plasticity, Phase-Coding, and Dendritic Routing for Low-Power Sequence Control
- Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
- Summarize-Exemplify-Reflect: Data-driven Insight Distillation Empowers LLMs for Few-shot Tabular Classification
- OASIS: Harnessing Diffusion Adversarial Network for Ocean Salinity Imputation using Sparse Drifter Trajectories
- Convergence of Stochastic Gradient Methods for Wide Two-Layer Physics-Informed Neural Networks
- UniMLR: Modeling Implicit Class Significance for Multi-Label Ranking
- QR-LoRA: QR-Based Low-Rank Adaptation for Efficient Fine-Tuning of Large Language Models
- Achieving Hilbert-Schmidt Independence Under R\'enyi Differential Privacy for Fair and Private Data Generation
- ImmunoAI: Accelerated Antibody Discovery Using Gradient-Boosted Machine Learning with Thermodynamic-Hydrodynamic Descriptors and 3D Geometric Interface Topology
- Advanced Deep Learning Techniques for Classifying Dental Conditions Using Panoramic X-Ray Images
- Synthetic CVs To Build and Test Fairness-Aware Hiring Tools
- Population-Scale Network Embeddings Expose Educational Divides in Network Structure Related to Right-Wing Populist Voting
- Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling
- Faster Inference of Cell Complexes from Flows via Matrix Factorization
- BASE-Q: Bias and Asymmetric Scaling Enhanced Rotational Quantization for Large Language Models
- Single Domain Generalization for Multimodal Cross-Cancer Prognosis via Dirac Rebalancer and Distribution Entanglement
- Adaptive LLM Routing under Budget Constraints
- Model-Task Alignment Drives Distinct RL Outcomes
- RelP: Faithful and Efficient Circuit Discovery via Relevance Patching
- CALM: A Framework for Continuous, Adaptive, and LLM-Mediated Anomaly Detection in Time-Series Streams
- Detecting Domain Shifts in Myoelectric Activations: Challenges and Opportunities in Stream Learning
- Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning
- AI Simulation by Digital Twins: Systematic Survey, Reference Framework, and Mapping to a Standardized Architecture
- QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges
- Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation
- COBRA-PPM: A Causal Bayesian Reasoning Architecture Using Probabilistic Programming for Robot Manipulation Under Uncertainty
- Guiding a diffusion model using sliding windows
- ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning
- Toxicity Begets Toxicity: Unraveling Conversational Chains in Political Podcasts
- LLM Test Generation via Iterative Hybrid Program Analysis
- FROG: Fair Removal on Graphs
- Decentralized Domain Generalization with Style Sharing: Formal Model and Convergence Analysis
- DeepTrans: Deep Reasoning Translation via Reinforcement Learning
- SAGA: A Security Architecture for Governing AI Agentic Systems
- MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness
- Towards Embodiment Scaling Laws in Robot Locomotion
- FedSEA-LLaMA: A Secure, Efficient and Adaptive Federated Splitting Framework for Large Language Models
- Beyond Frequency: The Role of Redundancy in Large Language Model Memorization
- Complete Gaussian Splats from a Single Image with Denoising Diffusion Models
- What Data is Really Necessary? A Feasibility Study of Inference Data Minimization for Recommender Systems
- EZ-Sort: Efficient Pairwise Comparison via Zero-Shot CLIP-Based Pre-Ordering and Human-in-the-Loop Sorting
- Limitations of Physics-Informed Neural Networks: a Study on Smart Grid Surrogation
- Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
- Harnessing IoT and Generative AI for Weather-Adaptive Learning in Climate Resilience Education
- Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks
- OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
- Developer Insights into Designing AI-Based Computer Perception Tools
- Neural Network Acceleration on MPSoC board: Integrating SLAC's SNL, Rogue Software and Auto-SNL
- Benchmarking GPT-5 in Radiation Oncology: Measurable Gains, but Persistent Need for Expert Oversight
- MoE-Health: A Mixture of Experts Framework for Robust Multimodal Healthcare Prediction
- DynaMark: A Reinforcement Learning Framework for Dynamic Watermarking in Industrial Machine Tool Controllers
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
- Transforming Wearable Data into Personal Health Insights using Large Language Model Agents
- A Financial Brain Scan of the LLM
- Efficient Code Embeddings from Code Generation Models
- BLUEX Revisited: Enhancing Benchmark Coverage with Automatic Captioning
- MyGO: Memory Yielding Generative Offline-consolidation for Lifelong Learning Systems
- Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models
- Stairway to Fairness: Connecting Group and Individual Fairness
- DLGAN : Time Series Synthesis Based on Dual-Layer Generative Adversarial Networks
- Adaptive Heavy-Tailed Stochastic Gradient Descent
- EconAgentic in DePIN Markets: A Large Language Model Approach to the Sharing Economy of Decentralized Physical Infrastructure
- Challenges and Applications of Large Language Models: A Comparison of GPT and DeepSeek family of models
- RoboInspector: Unveiling the Unreliability of Policy Code for LLM-enabled Robotic Manipulation
- Iterative Inference in a Chess-Playing Neural Network
- zkLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs
- Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models
- The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management
- MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation
- Diffusion-based Multi-modal Synergy Interest Network for Click-through Rate Prediction
- Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards
- Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
- Spatiotemporal EEG-Based Emotion Recognition Using SAM Ratings from Serious Games with Hybrid Deep Learning
- Dynamic Low-rank Approximation of Full-Matrix Preconditioner for Training Generalized Linear Models
- Learning to Generate Unit Test via Adversarial Reinforcement Learning
- An Explainable, Attention-Enhanced, Bidirectional Long Short-Term Memory Neural Network for Joint 48-Hour Forecasting of Temperature, Irradiance, and Relative Humidity
- Automating the Deep Space Network Data Systems; A Case Study in Adaptive Anomaly Detection through Agentic AI
- EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
- R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
- HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection
- A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
- WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration
- Quantifying Label-Induced Bias in Large Language Model Self- and Cross-Evaluations
- Deep Residual Echo State Networks: exploring residual orthogonal connections in untrained Recurrent Neural Networks
- BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
- Improving Aviation Safety Analysis: Automated HFACS Classification Using Reinforcement Learning with Group Relative Policy Optimization
- Enhancing Robustness of Autoregressive Language Models against Orthographic Attacks via Pixel-based Approach
- Generalizable Object Re-Identification via Visual In-Context Prompting
- Quantum Machine Learning for Optimizing Entanglement Distribution in Quantum Sensor Circuits
- Reinforcement Learning for Optimizing Large Qubit Array based Quantum Sensor Circuits
- Breaking the Cold-Start Barrier: Reinforcement Learning with Double and Dueling DQNs
- Fuzzy, Symbolic, and Contextual: Enhancing LLM Instruction via Cognitive Scaffolding
- Addressing accuracy and hallucination of LLMs in Alzheimer's disease research through knowledge graphs
- MultiFluxAI Enhancing Platform Engineering with Advanced Agent-Orchestrated Retrieval Systems
- Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
- AI Compute Architecture and Evolution Trends
- MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents
- HealthProcessAI: A Technical Framework and Proof-of-Concept for LLM-Enhanced Healthcare Process Mining
- Integrating Large Language Models with Network Optimization for Interactive and Explainable Supply Chain Planning: A Real-World Case Study
- Leveraging Imperfection with MEDLEY A Multi-Model Approach Harnessing Bias in Medical AI
- Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions
- Tree-Guided Diffusion Planner
- Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture
- QuadKAN: KAN-Enhanced Quadruped Motion Control via End-to-End Reinforcement Learning
- Pep2Prob Benchmark: Predicting Fragment Ion Probability for MS$^2$-based Proteomics
- Model-Driven Quantum Code Generation Using Large Language Models and Retrieval-Augmented Generation
- TrInk: Ink Generation with Transformer Network
- Safe-Control: A Safety Patch for Mitigating Unsafe Content in Text-to-Image Generation Models
Research Sources: 221 | Generated: 9/1/2025