AI RESEARCH PAPERS & ACADEMIC SOURCES
- Efficient stereo matching on embedded GPUs with zero-means cross correlation
- Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection
- Surface-Based Visibility-Guided Uncertainty for Continuous Active 3D Neural Reconstruction
- OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation
- Multimodal Markup Document Models for Graphic Design Completion
- Learning Geodesics of Geometric Shape Deformations From Images
- Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields
- A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs
- Reflection Removal through Efficient Adaptation of Diffusion Transformers
- Self-Supervised Learning for Transparent Object Depth Completion Using Depth from Non-Transparent Objects
- Generative Neural Video Compression via Video Diffusion Prior
- RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation
- Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
- 4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer
- BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
- Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints
- Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression
- Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark
- SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
- EvoIR: Towards All-in-One Image Restoration via Evolutionary Frequency Modulation
- ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
- ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
- Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting
- Light-X: Generative 4D Video Rendering with Camera and Illumination Control
- The changing surface of the world's roads
- Efficient Spatially-Variant Convolution via Differentiable Sparse Kernel Complex
- Hardware-aware Neural Architecture Search of Early Exiting Networks on Edge Accelerators
- Shared Multi-modal Embedding Space for Face-Voice Association
- From Generated Human Videos to Physically Plausible Robot Trajectories
- Towards Cross-View Point Correspondence in Vision-Language Models
- OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution
- Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild
- E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
- MT-Depth: Multi-task Instance feature analysis for the Depth Completion
- Order Matters: 3D Shape Generation from Sequential VR Sketches
- PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
- LaFiTe: A Generative Latent Field for 3D Native Texturing
- EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
- RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS
- LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation
- FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis
- A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
- Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens
- Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing
- SP-Det: Self-Prompted Dual-Text Fusion for Generalized Multi-Label Lesion Detection
- SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms
- You Only Train Once (YOTO): A Retraining-Free Object Detection Framework
- Equivariant Symmetry-Aware Head Pose Estimation for Fetal MRI
- ReflexFlow: Rethinking Learning Objective for Exposure Bias Alleviation in Flow Matching
- Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
- Virtually Unrolling the Herculaneum Papyri by Diffeomorphic Spiral Fitting
- LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
- Towards Adaptive Fusion of Multimodal Deep Networks for Human Action Recognition
- FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via neural Action Tokenization
- GeoPE:A Unified Geometric Positional Embedding for Structured Tensors
- Balanced Few-Shot Episodic Learning for Accurate Retinal Disease Diagnosis
- Stable Single-Pixel Contrastive Learning for Semantic and Geometric Tasks
- Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model
- UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
- DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance
- EgoLCD: Egocentric Video Generation with Long Context Diffusion
- VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
- Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation
- WiFi-based Cross-Domain Gesture Recognition Using Attention Mechanism
- Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification
- Auto3R: Automated 3D Reconstruction and Scanning via Data-driven Uncertainty Quantification
- PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement
- Refa\c{c}ade: Editing Object with Given Reference Texture
- Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model
- X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale
- VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management
- Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization
- Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question Answering
- COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
- Dataset creation for supervised deep learning-based analysis of microscopic images - review of important considerations and recommendations
- Prompt2Craft: Generating Functional Craft Assemblies with LLMs
- TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification
- Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation
- SAM3-I: Segment Anything with Instructions
- Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot
- Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
- I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
- Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
- UniLight: A Unified Representation for Lighting
- Learning Single-Image Super-Resolution in the JPEG Compressed Domain
- Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications
- How (Mis)calibrated is Your Federated CLIP and What To Do About It?
- Real-time Cricket Sorting By Sex
- Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding
- DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
- SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting
- A Retrieval-Augmented Generation Approach to Extracting Algorithmic Logic from Neural Networks
- Open Set Face Forgery Detection via Dual-Level Evidence Collection
- MAFNet:Multi-frequency Adaptive Fusion Network for Real-time Stereo Matching
- FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring
- Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
- Performance Evaluation of Transfer Learning Based Medical Image Classification Techniques for Disease Detection
- Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection
- UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes
- Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
- Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
- MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving
- StreamEQA: Towards Streaming Video Understanding for Embodied Scenarios
- GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis
- dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning
- UniTS: Unified Time Series Generative Model for Remote Sensing
- DeRA: Decoupled Representation Alignment for Video Tokenization
- Not All Birds Look The Same: Identity-Preserving Generation For Birds
- Controllable Long-term Motion Generation with Extended Joint Targets
- Shift-Window Meets Dual Attention: A Multi-Model Architecture for Specular Highlight Removal
- Dual-branch Prompting for Multimodal Machine Translation
- Beyond Flicker: Detecting Kinematic Inconsistencies for Generalizable Deepfake Video Detection
- OnSight Pathology: A real-time platform-agnostic computational pathology companion for histopathology
- Look Around and Pay Attention: Multi-camera Point Tracking Reimagined with Transformers
- Generalized Event Partonomy Inference with Structured Hierarchical Predictive Learning
- MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
- ReasonX: MLLM-Guided Intrinsic Image Decomposition
- 6 Fingers, 1 Kidney: Natural Adversarial Medical Images Reveal Critical Weaknesses of Vision-Language Models
- MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models
- Geschlechts\"ubergreifende Maskulina im Sprachgebrauch Eine korpusbasierte Untersuchung zu lexemspezifischen Unterschieden
- OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
- SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
- Model Whisper: Steering Vectors Unlock Large Language Models' Potential in Test-time
- EtCon: Edit-then-Consolidate for Reliable Knowledge Editing
- Challenging the Abilities of Large Language Models in Italian: a Community Initiative
- AdiBhashaa: A Community-Curated Benchmark for Machine Translation into Indian Tribal Languages
- DaLA: Danish Linguistic Acceptability Evaluation Guided by Real World Errors
- DAMASHA: Detecting AI in Mixed Adversarial Texts via Segmentation with Human-interpretable Attribution
- Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
- SEAL: Self-Evolving Agentic Learning for Conversational Question Answering over Knowledge Graphs
- LLMs Know More Than Words: A Genre Study with Syntax, Metaphor & Phonetics
- Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
- Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking
- Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
- Retrieval-Augmented Few-Shot Prompting Versus Fine-Tuning for Code Vulnerability Detection
- Towards Contextual Sensitive Data Detection
- Can machines perform a qualitative data analysis? Reading the debate with Alan Turing
- Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment
- Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction
- Limit cycles for speech
- Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs
- Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective
- Are LLMs Truly Multilingual? Exploring Zero-Shot Multilingual Capability of LLMs for Information Retrieval: An Italian Healthcare Use Case
- The AI Consumer Index (ACE)
- Algorithmic Thinking Theory
- One-shot acceleration of transient PDE solvers via online-learned preconditioners
- On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral
- SQuARE: Structured Query & Adaptive Retrieval Engine For Tabular Formats
- DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
- ClusterFusion: Hybrid Clustering with Embedding Guidance and LLM Adaptation
- LangSAT: A Novel Framework Combining NLP and Reinforcement Learning for SAT Solving
- MASE: Interpretable NLP Models via Model-Agnostic Saliency Estimation
- RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning
- MSME: A Multi-Stage Multi-Expert Framework for Zero-Shot Stance Detection
- UW-BioNLP at ChemoTimelines 2025: Thinking, Fine-Tuning, and Dictionary-Enhanced LLM Systems for Chemotherapy Timeline Extraction
- EvoEdit: Lifelong Free-Text Knowledge Editing through Latent Perturbation Augmentation and Knowledge-driven Parameter Fusion
- AdmTree: Compressing Lengthy Context with Adaptive Semantic Trees
- ADAPT: Learning Task Mixtures for Budget-Constrained Instruction Tuning
- LexGenius: An Expert-Level Benchmark for Large Language Models in Legal General Intelligence
- ArterialNet: Reconstructing Arterial Blood Pressure Waveform with Wearable Pulsatile Signals, a Cohort-Aware Approach
- Convolutional Monge Mapping between EEG Datasets to Support Independent Component Labeling
- Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning
- Towards an end-to-end artificial intelligence driven global weather forecasting system
- Contract-Driven QoE Auditing for Speech and Singing Services: From MOS Regression to Service Graphs
- Tokenizing Buildings: A Transformer for Layout Synthesis
- Series of quasi-uniform scatterings with fast search, root systems and neural network classifications
- STELLA: Guiding Large Language Models for Time Series Forecasting with Semantic Abstractions
- Shorting Dynamics and Structured Kernel Regularization
- Environment-Aware Channel Inference via Cross-Modal Flow: From Multimodal Sensing to Wireless Channels
- Rethinking the Use of Vision Transformers for AI-Generated Image Detection
- Learning Causality for Longitudinal Data
- Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
- Towards a unified framework for guided diffusion models
- Evolutionary Architecture Search through Grammar-Based Sequence Alignment
- HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
- Model-Free Assessment of Simulator Fidelity via Quantile Curves
- Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
- QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
- Meta-Learning for Quantum Optimization via Quantum Sequence Model
- Control Consistency Losses for Diffusion Bridges
- Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction
- Structured Document Translation via Format Reinforcement Learning
- Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
- NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
- DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
- Safe Online Bid Optimization with Return on Investment and Budget Constraints
- ImageNot: A contrast with ImageNet preserves model rankings
- FusionBench: A Unified Library and Comprehensive Benchmark for Deep Model Fusion
- NITRO-D: Native Integer-only Training of Deep Convolutional Neural Networks
- Convergence Analysis for Deep Sparse Coding via Convolutional Neural Networks
- Educational Cone Model in Embedding Vector Spaces
- Computational Linguistics Meets Libyan Dialect: A Study on Dialect Identification
- Polynomiogram: An Integrated Framework for Root Visualization and Generative Art
- The Geometry of Benchmarks: A New Path Toward AGI
- Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
- Plug-and-Play Image Restoration with Flow Matching: A Continuous Viewpoint
- Bayes-DIC Net: Estimating Digital Image Correlation Uncertainty with Bayesian Neural Networks
- One Detector Fits All: Robust and Adaptive Detection of Malicious Packages from PyPI to Enterprises
- Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
- AutoGuard: A Self-Healing Proactive Security Layer for DevSecOps Pipelines Using Reinforcement Learning
- Constructive Approximation under Carleman's Condition, with Applications to Smoothed Analysis
- Informative missingness and its implications in semi-supervised learning
- Sarcasm Detection on Reddit Using Classical Machine Learning and Feature Engineering
- Predicting Time-Dependent Flow Over Complex Geometries Using Operator Networks
- NORi: An ML-Augmented Ocean Boundary Layer Parameterization
- Mathematical Framing for Different Agent Strategies
- When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
- SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
- Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
- Fermionic neural Gibbs states
- Recurrent Neural Networks with Linear Structures for Electricity Price Forecasting
- Provable FDR Control for Deep Feature Selection: Deep MLPs and Beyond
- Continuous-time reinforcement learning for optimal switching over multiple regimes
- Sequential Enumeration in Large Language Models
- Complementary Characterization of Agent-Based Models via Computational Mechanics and Diffusion Models
- Pick-to-Learn for Systems and Control: Data-driven Synthesis with State-of-the-art Safety Guarantees
- TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation
- TRINITY: An Evolved LLM Coordinator
- Towards Continuous-Time Approximations for Stochastic Gradient Descent without Replacement
- A Tutorial on Regression Analysis: From Linear Models to Deep Learning -- Lecture Notes on Artificial Intelligence
- RLHFSpec: Breaking the Efficiency Bottleneck in RLHF Training via Adaptive Drafting
- MemLoRA: Distilling Expert Adapters for On-Device Memory Systems
- A result relating convex n-widths to covering numbers with some applications to neural networks
- Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty
- CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent
- Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows
- Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning
- Efficient Generative Transformer Operators For Million-Point PDEs
- Dual-Path Region-Guided Attention Network for Ground Reaction Force and Moment Regression
- SuperActivators: Only the Tail of the Distribution Contains Reliable Concept Signals
- Multi-LLM Collaboration for Medication Recommendation
- Hybrid Quantum-Classical Autoencoders for Unsupervised Network Intrusion Detection
- David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?
- OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design
- Gradient Descent with Provably Tuned Learning-rate Schedules
- The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception
- TV2TV: A Unified Framework for Interleaved Language and Video Generation
- Deep infant brain segmentation from multi-contrast MRI
- Value Gradient Guidance for Flow Matching Alignment
- The Universal Weight Subspace Hypothesis
- Rethinking AI Evaluation in Education: The TEACH-AI Framework and Benchmark for Generative AI Assistants
- AI-Enabled grading with near-domain data for scaling feedback with human-level accuracy
- Patient Safety Risks from AI Scribes: Signals from End-User Feedback
- Measuring Agents in Production
- Enhancing next token prediction based pre-training for jet foundation models
- The Initialization Determines Whether In-Context Learning Is Gradient Descent
- Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
- GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers
- When do spectral gradient updates help in deep learning?
- Evaluating Long-Context Reasoning in LLM-Based WebAgents
- RNNs perform task computations by dynamically warping neural representations
- Data-regularized Reinforcement Learning for Diffusion Models at Scale
- RGE-GCN: Recursive Gene Elimination with Graph Convolutional Networks for RNA-seq based Early Cancer Detection
- Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism
- Distance Is All You Need: Radial Dispersion for Uncertainty Estimation in Large Language Models
- SmartAlert: Implementing Machine Learning-Driven Clinical Decision Support for Inpatient Lab Utilization Reduction
- STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
- Learning to Orchestrate Agents in Natural Language with the Conductor
- Feature Engineering vs. Deep Learning for Automated Coin Grading: A Comparative Study on Saint-Gaudens Double Eagles
- GraphBench: Next-generation graph learning benchmarking
- Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems
- Prototype-Based Semantic Consistency Alignment for Domain Adaptive Retrieval
- Explainable Graph Representation Learning via Graph Pattern Analysis
- On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
- Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
- LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models
- Reliable Statistical Guarantees for Conformal Predictors with Small Datasets
- Temp-SCONE: A Novel Out-of-Distribution Detection and Domain Generalization Framework for Wild Data with Temporal Shift
- Exploiting \texttt{ftrace}'s \texttt{function\_graph} Tracer Features for Machine Learning: A Case Study on Encryption Detection
- QoSDiff: An Implicit Topological Embedding Learning Framework Leveraging Denoising Diffusion and Adversarial Attention for Robust QoS Prediction
- Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
- Score Matching for Estimating Finite Point Processes
- Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective
- Federated Learning for Anomaly Detection in Maritime Movement Data
- Contract-Governed Training for Earth Observation: Observed Service Agreement Graphs and Coverage-Accuracy Trade-offs
- ASCIIBench: Evaluating Language-Model-Based Understanding of Visually-Oriented Text
- Decoding Large Language Diffusion Models with Foreseeing Movement
- MechDetect: Detecting Data-Dependent Errors
- Mitigating the Curse of Detail: Scaling Arguments for Feature Learning and Sample Complexity
- BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training
- Network of Theseus (like the ship)
- ActVAE: Modelling human activity schedules with a deep conditional generative approach
- Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
- Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness
- Sponsored Questions and How to Auction Them
- TARA Test-by-Adaptive-Ranks for Quantum Anomaly Detection with Conformal Prediction Guarantees
- Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study
- Polarization by Design: How Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs
- Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation
- Large Language Model-Based Agents for Software Engineering: A Survey
- A Survey on Recommendation Unlearning: Fundamentals, Taxonomy, Evaluation, and Open Questions
- Public Sentiment Analysis of Traffic Management Policies in Knoxville: A Social Media Driven Study
- The BEAT-CF Causal Model: A model for guiding the design of trials and observational analyses of cystic fibrosis exacerbations
- Lost in Modality: Evaluating the Effectiveness of Text-Based Membership Inference Attacks on Large Multimodal Models
- Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases
- BookRAG: A Hierarchical Structure-aware Index-based Approach for Retrieval-Augmented Generation on Complex Documents
- World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations
- AsymPuzl: An Asymmetric Puzzle for multi-agent cooperation
- Cell-cell communication inference and analysis: biological mechanisms, computational approaches, and future opportunities
- A Learning-based Control Methodology for Transitioning VTOL UAVs
- State Space Models for Bioacoustics: A comparative Evaluation with Transformers
- KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing
- Matrix Editing Meets Fair Clustering: Parameterized Algorithms and Complexity
- Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs
- AI/ML in 3GPP 5G Advanced - Services and Architecture
- Bayesian Optimization for Automatic Tuning of Torque-Level Nonlinear Model Predictive Control
- MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving
- Hierarchical Vision Language Action Model Using Success and Failure Demonstrations
- Beyond the Black Box: A Cognitive Architecture for Explainable and Aligned AI
- When Do Symbolic Solvers Enhance Reasoning in Large Language Models?
- Prior preferences in active inference agents: soft, hard, and goal shaping
- Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
- Multimodal Reinforcement Learning with Agentic Verifier for AI Agents
- Multi-Agent Reinforcement Learning with Communication-Constrained Priors
- PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
- Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
- DeepRule: An Integrated Framework for Automated Business Rule Generation via Deep Predictive Modeling and Hybrid Search Optimization
- MemVerse: Multimodal Memory for Lifelong Learning Agents
- RoCo: Role-Based LLMs Collaboration for Automatic Heuristic Design
- Omni-AutoThink: Adaptive Multimodal Reasoning via Reinforcement Learning
- A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)
- Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties
- Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol
- AI-Driven Document Redaction in UK Public Authorities: Implementation Gaps, Regulatory Challenges, and the Human Oversight Imperative
- Quantifying the Potential to Escape Filter Bubbles: A Behavior-Aware Measure via Contrastive Simulation
- Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation
- Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem
- Will Power Return to the Clouds? From Divine Authority to GenAI Authority
- Irresponsible AI: big tech's influence on AI research and associated impacts
- AtomDisc: An Atom-level Tokenizer that Boosts Molecular LLMs and Reveals Structure--Property Associations
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- When Harmful Content Gets Camouflaged: Unveiling Perception Failure of LVLMs with CamHarmTI
- Community Quality and Influence Maximization: An Empirical Study
- Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks
Research Sources: 336 | Generated: 12/5/2025
