AI RESEARCH PAPERS & ACADEMIC SOURCES
- AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs
- Symmetry-Guided Multi-Agent Inverse Reinforcement Learning
- Towards Reliable Medical Image Segmentation by Modeling Evidential Calibrated Uncertainty
- UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
- SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification
- Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks
- ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation
- Spec2VolCAMU-Net: A Spectrogram-to-Volume Model for EEG-to-fMRI Reconstruction based on Multi-directional Time-Frequency Convolutional Attention Encoder and Vision-Mamba U-Net
- C3VDv2 -- Colonoscopy 3D video dataset with enhanced realism
- SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model
- Dynamic Structural Recovery Parameters Enhance Prediction of Visual Outcomes After Macular Hole Surgery
- In-Loop Filtering Using Learned Look-Up Tables for Video Coding
- Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
- Total Disentanglement of Font Images into Style and Character Class Features
- Automatic infant 2D pose estimation from videos: comparing seven deep neural network methods
- Attention-Guided Multi-scale Interaction Network for Face Super-Resolution
- The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods
- ForestSplats: Deformable transient field for Gaussian Splatting in the Wild
- GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
- AdvReal: Physical Adversarial Patch Generation Framework for Security Evaluation of Object Detection Systems
- TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization
- An Improved U-Net Model for Offline handwriting signature denoising
- JAX-IK: Real-Time Inverse Kinematics for Generating Multi-Constrained Movements of Virtual Human Characters
- GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving
- A Fully Automatic Framework for Intracranial Pressure Grading: Integrating Keyframe Identification, ONSD Measurement and Clinical Data
- Unsupervised Integrated-Circuit Defect Segmentation via Image-Intrinsic Normality
- Decoupling Clinical and Class-Agnostic Features for Reliable Few-Shot Adaptation under Shift
- FS-Diff: Semantic guidance and clarity-aware simultaneous multimodal image fusion and super-resolution
- FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model
- Improving Human Motion Plausibility with Body Momentum
- Region-Wise Correspondence Prediction between Manga Line Art Images
- Generative Diffusion Contrastive Network for Multi-View Clustering
- DualTrack: Sensorless 3D Ultrasound needs Local and Global Context
- InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation
- PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection
- Visual Grounding from Event Cameras
- Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
- Measuring Epistemic Humility in Multimodal Large Language Models
- Can Understanding and Generation Truly Benefit Together -- or Just Coexist?
- Geometric Neural Distance Fields for Learning Human Motion Priors
- Locality in Image Diffusion Models Emerges from Data Statistics
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
- DeepTV: A neural network approach for total variation minimization
- CameraVDP: Perceptual Display Assessment with Uncertainty Estimation via Camera and Visual Difference Prediction
- Ultrafast Deep Learning-Based Scatter Estimation in Cone-Beam Computed Tomography
- Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention
- Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval
- ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain
- Noise-Robust Topology Estimation of 2D Image Data via Neural Networks and Persistent Homology
- RT-DETR++ for UAV Object Detection
- CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution
- Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios
- VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
- MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network
- Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement
- Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis
- DATE: Dynamic Absolute Time Enhancement for Long Video Understanding
- Unified Start, Personalized End: Progressive Pruning for Efficient 3D Medical Image Segmentation
- Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
- Learning Object-Centric Representations in SAR Images with Multi-Level Feature Fusion
- You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception
- Image Recognition with Vision and Language Embeddings of VLMs
- Fine-Grained Customized Fashion Design with Image-into-Prompt benchmark and dataset from LMM
- Texture-aware Intrinsic Image Decomposition with Model- and Learning-based Priors
- Plug-and-play Diffusion Models for Image Compressive Sensing with Data Consistency Projection
- Diffusion-Based Action Recognition Generalizes to Untrained Domains
- SFD-Mamba2Net: Strcture-Guided Frequency-Enhanced Dual-Stream Mamba2 Network for Coronary Artery Segmentation
- Live(r) Die: Predicting Survival in Colorectal Liver Metastasis
- Discovering Divergent Representations between Text-to-Image Models
- An U-Net-Based Deep Neural Network for Cloud Shadow and Sun-Glint Correction of Unmanned Aerial System (UAS) Imagery
- CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision
- iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning
- UltrON: Ultrasound Occupancy Networks
- E-MLNet: Enhanced Mutual Learning for Universal Domain Adaptation with Sample-Specific Weighting
- VoxelFormer: Parameter-Efficient Multi-Subject Visual Decoding from fMRI
- Integrating Anatomical Priors into a Causal Diffusion Model
- Enhancing 3D Medical Image Understanding with Pretraining Aided by 2D Multimodal Large Language Models
- Improvement of Human-Object Interaction Action Recognition Using Scene Information and Multi-Task Learning Approach
- IRDFusion: Iterative Relation-Map Difference guided Feature Fusion for Multispectral Object Detection
- S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization
- FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding
- Prompting the Market? A Large-Scale Meta-Analysis of GenAI in Finance NLP (2022-2025)
- LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination
- Bridging the Capability Gap: Joint Alignment Tuning for Harmonizing LLM-based Multi-Agent Systems
- All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens
- Generative Engine Optimization: How to Dominate AI Search
- COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation
- DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech
- FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark
- ASTPrompter: Preference-Aligned Automated Language Model Red-Teaming to Generate Low-Perplexity Unsafe Prompts
- Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving
- CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering
- Are Generative Models Underconfident? Better Quality Estimation with Boosted Model Probability
- Culturally-Nuanced Story Generation for Reasoning in Low-Resource Languages: The Case of Javanese and Sundanese
- Uncertainty Quantification in Retrieval Augmented Question Answering
- CritiQ: Mining Data Quality Criteria from Human Preferences
- AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models
- A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions
- The NTNU System at the S&I Challenge 2025 SLA Open Track
- ReceiptSense: Beyond Traditional OCR -- A Dataset for Receipt Understanding
- Noise or Nuance: An Investigation Into Useful Information and Filtering For LLM Driven AKBC
- Automated Evidence Extraction and Scoring for Corporate Climate Policy Engagement: A Multilingual RAG Approach
- Documents Are People and Words Are Items: A Psychometric Approach to Textual Data with Contextual Embeddings
- BRoverbs -- Measuring how much LLMs understand Portuguese proverbs
- MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction
- TigerCoder: A Novel Suite of LLMs for Code Generation in Bangla
- Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia
- LITcoder: A General-Purpose Library for Building and Comparing Encoding Models
- GmSLM : Generative Marmoset Spoken Language Modeling
- CCF: A Context Compression Framework for Efficient Long-Sequence Language Modeling
- Reading Between the Lines: Classifying Resume Seniority with Large Language Models
- Agentic LLMs for Question Answering over Tabular Data
- From scratch to silver: Creating trustworthy training data for patent-SDG classification using Large Language Models
- MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
- Modelling Analogies and Analogical Reasoning: Connecting Cognitive Science Theory and NLP Research
- Hierarchical Bracketing Encodings Work for Dependency Graphs
- GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models
- Mitigating Language Barriers in Education: Developing Multilingual Digital Learning Materials with Machine Translation
- SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models
- Scalable Evaluation of Online Facilitation Strategies via Synthetic Simulation of Discussions
- Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
- Quantum-Assisted Machine Learning Models for Enhanced Weather Prediction
- ACE: A Security Architecture for LLM-Integrated App Systems
- Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
- Self-Optimizing Machine Learning Potential Assisted Automated Workflow for Highly Efficient Complex Systems Material Design
- Inferring entropy production in many-body systems using nonequilibrium MaxEnt
- Modular Jump Gaussian Processes
- Efficient Optimization Accelerator Framework for Multistate Ising Problems
- A User-Centric, Privacy-Preserving, and Verifiable Ecosystem for Personal Data Management and Utilization
- Bridging Simplicity and Sophistication using GLinear: A Novel Architecture for Enhanced Time Series Prediction
- Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings
- Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning
- Revisiting Non-Acyclic GFlowNets in Discrete Environments
- Adaptive kernel predictors from feature-learning infinite limits of neural networks
- MOLLM: Multi-Objective Large Language Model for Molecular Design -- Optimizing with Experts
- A Vector-Quantized Foundation Model for Patient Behavior Monitoring
- Variance-Aware Noisy Training: Hardening DNNs against Unstable Analog Computations
- Convergence Analysis of Asynchronous Federated Learning with Gradient Compression for Non-Convex Optimization
- Temporal Query Network for Efficient Multivariate Time Series Forecasting
- Towards Robust Influence Functions with Flat Validation Minima
- Development and Comparative Evaluation of Three Artificial Intelligence Models (NLP, LLM, JEPA) for Predicting Triage in Emergency Departments: A 7-Month Retrospective Proof-of-Concept
- DivMerge: A divergence-based model merging method for multi-tasking
- Iterative Methods for Full-Scale Gaussian Process Approximations for Large Spatial Data
- Sigma Flows for Image and Data Labeling and Learning Structured Prediction
- Examining Different Research Communities: Authorship Network
- Average Causal Effect Estimation in DAGs with Hidden Variables: Beyond Back-Door and Front-Door Criteria
- Extended Neural Contractive Dynamical Systems: On Multiple Tasks and Riemannian Safety Regions
- Physics consistent machine learning framework for inverse modeling with applications to ICF capsule implosions
- Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot Coordination
- Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception
- Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment
- Low-degree lower bounds via almost orthonormal bases
- Expressive Power of Deep Networks on Manifolds: Simultaneous Approximation
- Representation-Aware Distributionally Robust Optimization: A Knowledge Transfer Framework
- Semantic Concentration for Self-Supervised Dense Representations Learning
- Database Views as Explanations for Relational Deep Learning
- DeMeVa at LeWiDi-2025: Modeling Perspectives with In-Context Learning and Label Distribution Learning
- Finite Scalar Quantization Enables Redundant and Transmission-Robust Neural Audio Compression at Low Bit-rates
- What Does Normal Even Mean? Evaluating Benign Traffic in Intrusion Detection Datasets
- Personality-Enhanced Social Recommendations in SAMI: Exploring the Role of Personality Detection in Matchmaking
- Steering MoE LLMs via Expert (De)Activation
- On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Networks
- Geometry and Stability of Supervised Learning Problems
- Attribution Regularization for Multimodal Paradigms
- AdaWaveNet: Adaptive Wavelet Network for Time Series Analysis
- Unveiling Multiple Descents in Unsupervised Autoencoders
- Understanding Large Language Models in Your Pockets: Performance Study on COTS Mobile Devices
- Tensor-Based Foundations of Ordinary Least Squares and Neural Network Regression Models
- Communication Compression for Distributed Learning without Control Variates
- AquaCast: Urban Water Dynamics Forecasting with Precipitation-Informed Multi-Input Transformer
- AEGIS: An Agent for Extraction and Geographic Identification in Scholarly Proceedings
- CountTRuCoLa: Rule Confidence Learning for Temporal Knowledge Graph Forecasting
- Balancing Utility and Privacy: Dynamically Private SGD with Random Projection
- PIPES: A Meta-dataset of Machine Learning Pipelines
- Cough Classification using Few-Shot Learning
- ProDiGy: Proximity- and Dissimilarity-Based Byzantine-Robust Federated Learning
- Conditioning on PDE Parameters to Generalise Deep Learning Emulation of Stochastic and Chaotic Dynamics
- ReBaNO: Reduced Basis Neural Operator Mitigating Generalization Gaps and Achieving Discretization Invariance
- Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction
- A Masked Representation Learning to Model Cardiac Functions Using Multiple Physiological Signals
- Decentralising LLM Alignment: A Case for Context, Pluralism, and Participation
- WarpPINN-fibers: improved cardiac strain estimation from cine-MR with physics-informed neural networks
- Deploying AI for Signal Processing education: Selected challenges and intriguing opportunities
- Convexity of Optimization Curves: Local Sharp Thresholds, Robustness Impossibility, and New Counterexamples
- Physics-informed waveform inversion using pretrained wavefield neural operators
- Generative quantum advantage for classical and quantum problems
- The Role of Community Detection Methods in Performance Variations of Graph Mining Tasks
- Scalable extensions to given-data Sobol' index estimators
- CryptGNN: Enabling Secure Inference for Graph Neural Networks
- Global Optimization of Stochastic Black-Box Functions with Arbitrary Noise Distributions using Wilson Score Kernel Density Estimation
- Value bounds and Convergence Analysis for Averages of LRP attributions
- Green Federated Learning via Carbon-Aware Client and Time Slot Scheduling
- Active Learning and Explainable AI for Multi-Objective Optimization of Spin Coated Polymers
- Fast attention mechanisms: a tale of parallelism
- Deep Context-Conditioned Anomaly Detection for Tabular Data
- "A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and Explanations
- An entropy formula for the Deep Linear Network
- Sensitivity-LoRA: Low-Load Sensitivity-Based Fine-Tuning for Large Language Models
- Learning What Matters: Causal Time Series Modeling for Arctic Sea Ice Prediction
- Continuous-Time Value Iteration for Multi-Agent Reinforcement Learning
- Peering Partner Recommendation for ISPs using Machine Learning
- Quantum Machine Learning, Quantitative Trading, Reinforcement Learning, Deep Learning
- Clip Your Sequences Fairly: Enforcing Length Fairness for Sequence-Level RL
- Breaking the Statistical Similarity Trap in Extreme Convection Detection
- Identifying Key Features for Establishing Sustainable Agro-Tourism Centre: A Data Driven Approach
- Constructing a Question-Answering Simulator through the Distillation of LLMs
- Unsupervised Multi-Attention Meta Transformer for Rotating Machinery Fault Diagnosis
- Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
- Data Driven Discovery of Emergent Dynamics in Reaction Diffusion Systems from Sparse and Noisy Observations
- Kriging prior Regression: A Case for Kriging-Based Spatial Features with TabPFN in Soil Mapping
- Fused Lasso Improves Accuracy of Co-occurrence Network Inference in Grouped Samples
- Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation
- LiDAR-BIND-T: Improved and Temporally Consistent Sensor Modality Translation and Fusion for Robotic Applications
- TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery
- On Synthesis of Timed Regular Expressions
- Beyond the Pre-Service Horizon: Infusing In-Service Behavior for Improved Financial Risk Forecasting
- Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning
- Demo: Healthcare Agent Orchestrator (HAO) for Patient Summarization in Molecular Tumor Boards
- Corruption-Tolerant Asynchronous Q-Learning with Near-Optimal Rates
- Group Distributionally Robust Machine Learning under Group Level Distributional Uncertainty
- FoundationalECGNet: A Lightweight Foundational Model for ECG-based Multitask Cardiac Analysis
- Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation
- Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review
- Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization
- Combating Falsification of Speech Videos with Live Optical Signatures (Extended Version)
- MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering
- Diffusion Graph Neural Networks for Robustness in Olfaction Sensors and Datasets
- Crack Path Prediction with Operator Learning using Discrete Particle System data Generation
- Task Matters: Knowledge Requirements Shape LLM Responses to Context-Memory Conflict
- Persistent Homology of Topic Networks for the Prediction of Reader Curiosity
- Uncertainty Estimation by Human Perception versus Neural Models
- Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound
- Can Large Language Models Understand As Well As Apply Patent Regulations to Pass a Hands-On Patent Attorney Test?
- TreeGPT: Pure TreeFFN Encoder-Decoder Architecture for Structured Reasoning Without Attention Mechanisms
- CogGuide: Human-Like Guidance for Zero-Shot Omni-Modal Reasoning
- Inconsistency Handling in Prioritized Databases with Universal Constraints: Complexity Analysis and Links with Active Integrity Constraints
- Deep Reinforcement Learning for Inventory Networks: Toward Reliable Policy Optimization
- A minimal coalition logic
- Algorithmic Collusion by Large Language Models
- Semantic Augmentation in Images using Language
- Discovering physical laws with parallel symbolic enumeration
- Rethinking Disentanglement under Dependent Factors of Variation
- DeepVoting: Learning and Fine-Tuning Voting Rules with Canonical Embeddings
- RED: Unleashing Token-Level Rewards from Holistic Feedback via Reward Redistribution
- MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond
- Knowledge-Guided Biomarker Identification for Label-Free Single-Cell RNA-Seq Data: A Reinforcement Learning Perspective
- EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds
- V-HOP: Visuo-Haptic 6D Object Pose Tracking
- MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue
- VeriSafe Agent: Safeguarding Mobile GUI Agent via Logic-based Action Verification
- Byzantine-Robust Federated Learning Using Generative Adversarial Networks
- SWI: Speaking with Intent in Large Language Models
- Entropy-Gated Branching for Efficient Test-Time Reasoning
- KROMA: Ontology Matching with Knowledge Retrieval and Large Language Models
- Robix: A Unified Model for Robot Interaction, Reasoning and Planning
- Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders
- An improved educational competition optimizer with multi-covariance learning operators for global optimization problems
- Invisible Attributes, Visible Biases: Exploring Demographic Shortcuts in MRI-based Alzheimer's Disease Classification
- Fluent but Unfeeling: The Emotional Blind Spots of Language Models
- ObjectReact: Learning Object-Relative Control for Visual Navigation
- Graph Alignment via Dual-Pass Spectral Encoding and Latent Space Communication
- Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth
- LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
- Explaining Concept Drift through the Evolution of Group Counterfactuals
- Retrieval-Augmented Generation for Reliable Interpretation of Radio Regulations
- Feasibility-Guided Fair Adaptive Offline Reinforcement Learning for Medicaid Care Management
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
- CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
- ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
- Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt Tuning through Modular Prompt Composition
- Simulating Human-like Daily Activities with Desire-driven Autonomy
- LLMs for sensory-motor control: Combining in-context and iterative learning
- Optimizing Length Compression in Large Reasoning Models
- Vejde: A Framework for Inductive Deep Reinforcement Learning Based on Factor Graph Color Refinement
- Virtual staining for 3D X-ray histology of bone implants
- CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
- Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification
- Modality-Agnostic Input Channels Enable Segmentation of Brain lesions in Multimodal MRI with Sequences Unavailable During Training
- Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization
- OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning
- MoSE: Unveiling Structural Patterns in Graphs via Mixture of Subgraph Experts
- Classification of Driver Behaviour Using External Observation Techniques for Autonomous Vehicles
- Robust Non-Linear Correlations via Polynomial Regression
- MetaLLMix : An XAI Aided LLM-Meta-learning Based Approach for Hyper-parameters Optimization
- LLMs Don't Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations
- We're Still Doing It (All) Wrong: Recommender Systems, Fifteen Years Later
- ENSI: Efficient Non-Interactive Secure Inference for Large Language Models
- Resource-Efficient Glioma Segmentation on Sub-Saharan MRI
- Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts
- OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection
- Incorporating AI Incident Reporting into Telecommunications Law and Policy: Insights from India
- Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner
- Towards Explainable Job Title Matching: Leveraging Semantic Textual Relatedness and Knowledge Graphs
- A modified RIME algorithm with covariance learning and diversity enhancement for numerical optimization
- KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning
- SQAP-VLA: A Synergistic Quantization-Aware Pruning Framework for High-Performance Vision-Language-Action Models
- Towards Confidential and Efficient LLM Inference with Dual Privacy Protection
- DP-FedLoRA: Privacy-Enhanced Federated Fine-Tuning for On-Device Large Language Models
- Character-Level Perturbations Disrupt LLM Watermarks
- Automated Classification of Tutors' Dialogue Acts Using Generative AI: A Case Study Using the CIMA Corpus
- ViRanker: A BGE-M3 & Blockwise Parallel Transformer Cross-Encoder for Vietnamese Reranking
- Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation
- Video Understanding by Design: How Datasets Shape Architectures and Insights
- OCELOT 2023: Cell Detection from Cell-Tissue Interaction Challenge
- HISPASpoof: A New Dataset For Spanish Speech Forensics
- A Knowledge Noise Mitigation Framework for Knowledge-based Visual Question Answering
- Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing
- Adaptive Pareto-Optimal Token Merging for Edge Transformer Models in Semantic Communication
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs
- Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
- Probing Pre-trained Language Models on Code Changes: Insights from ReDef, a High-Confidence Just-in-Time Defect Prediction Dataset
- On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability
- Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition
- Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
- Bona fide Cross Testing Reveals Weak Spot in Audio Deepfake Detection Systems
- Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
- Deep opacity and AI: A threat to XAI and to privacy protection mechanisms
- Uncertainty Estimation using Variance-Gated Distributions
- Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned
- A vibe coding learning design to enhance EFL students' talking to, through, and about AI
- Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication
- Investigating Student Interaction Patterns with Large Language Model-Powered Course Assistants in Computer Science Courses
- Benchmarking Energy Efficiency of Large Language Models Using vLLM
- Recurrence Meets Transformers for Universal Multimodal Retrieval
- PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability
- Instance-Optimal Matrix Multiplicative Weight Update and Its Quantum Applications
- Similarity-based Outlier Detection for Noisy Object Re-Identification Using Beta Mixtures
- Implicit Neural Representations of Intramyocardial Motion and Strain
- Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison
- Can Vision-Language Models Solve Visual Math Equations?
- Personalized Sleep Prediction via Deep Adaptive Spatiotemporal Modeling and Sparse Data
- Envy-Free but Still Unfair: Envy-Freeness Up To One Item (EF-1) in Personalized Recommendation
- Stated Preference for Interaction and Continued Engagement (SPICE): Evaluating an LLM's Willingness to Re-engage in Conversation
- MoWE : A Mixture of Weather Experts
- A Scoping Review of Machine Learning Applications in Power System Protection and Disturbance Management
- Improving LLM Safety and Helpfulness using SFT and DPO: A Study on OPT-350M
- STRIDE: Scalable and Interpretable XAI via Subset-Free Functional Decomposition
- Instructional Prompt Optimization for Few-Shot LLM-Based Recommendations on Cold-Start Users
- Understanding Economic Tradeoffs Between Human and AI Agents in Bargaining Games
- Anti-Money Laundering Machine Learning Pipelines; A Technical Analysis on Identifying High-risk Bank Clients with Supervised Learning
- Mind Meets Space: Rethinking Agentic Spatial Intelligence from a Neuroscience-inspired Perspective
- ProgD: Progressive Multi-scale Decoding with Dynamic Graphs for Joint Multi-agent Motion Forecasting
- Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions
- Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
- Fusing Knowledge and Language: A Comparative Study of Knowledge Graph-Based Question Answering with LLMs
- Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning
- LightAgent: Production-level Open-source Agentic AI Framework
- Explaining Tournament Solutions with Minimal Supports
- Measuring Implicit Spatial Coordination in Teams: Effects on Collective Intelligence and Performance
- Towards Adaptive ML Benchmarks: Web-Agent-Driven Construction, Domain Expansion, and Metric Optimization
- Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning
- TORSO: Template-Oriented Reasoning Towards General Tasks
- Inteligencia Artificial jur\'idica y el desaf\'io de la veracidad: an\'alisis de alucinaciones, optimizaci\'on de RAG y principios para una integraci\'on responsable
- SEDM: Scalable Self-Evolving Distributed Memory for Agents
- Compositional Concept Generalization with Variational Quantum Circuits
- Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution
- The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
- PerFairX: Is There a Balance Between Fairness and Personality in Large Language Model Recommendations?
- An Interval Type-2 Version of Bayes Theorem Derived from Interval Probability Range Estimates Provided by Subject Matter Experts
- Automated Unity Game Template Generation from GDDs via NLP and Multi-Modal LLMs
- Global Constraint LLM Agents for Text-to-Model Translation
- ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models
- Uncertainty Awareness and Trust in Explainable AI- On Trust Calibration using Local and Global Explanations
Research Sources: 363 | Generated: 9/12/2025