AI RESEARCH PAPERS & ACADEMIC SOURCES
- Signal-Based Malware Classification Using 1D CNNs
- Missing Fine Details in Images: Last Seen in High Frequencies
- Don't Splat your Gaussians: Volumetric Ray-Traced Primitives for Modeling and Rendering Scattering and Emissive Media
- VMGNet: A Low Computational Complexity Robotic Grasping Network Based on VMamba with Multi-Scale Feature Fusion
- PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map
- BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video
- GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
- Semi-SMD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving
- IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering
- Nonparametric Envelopes for Flexible Response Reduction
- Bayesian Pliable Lasso with Horseshoe Prior for Interaction Effects in GLMs with Missing Responses
- Physics-informed low-rank neural operators with application to parametric elliptic PDEs
- Feature Understanding and Sparsity Enhancement via 2-Layered kernel machines (2L-FUSE)
- Expected Signature Kernels for L\'evy Rough Paths
- Counterfactual Cocycles: A Framework for Robust and Coherent Counterfactual Transports
- Universality of High-Dimensional Logistic Regression and a Novel CGMT under Dependence with Applications to Data Augmentation
- Frequency Domain Enhanced U-Net for Low-Frequency Information-Rich Image Segmentation in Surgical and Deep-Sea Exploration Robots
- Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration
- Large-scale Pre-training for Grounded Video Caption Generation
- A Decade of Wheat Mapping for Lebanon
- RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation Learning
- SAMba-UNet: SAM2-Mamba UNet for Cardiac MRI in Medical Robotic Perception
- PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment
- SPACE-iT: Spatial-Aware Curriculum Exploration and Feedback-Driven Adaptive Augmentation for Vision Transformer Distillation
- Interpretable Text-Guided Image Clustering via Iterative Search
- Atomizer: Generalizing to new modalities by breaking satellite images down to a set of scalars
- Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges
- HieraRS: A Hierarchical Segmentation Paradigm for Remote Sensing Enabling Multi-Granularity Interpretation and Cross-Domain Transfer
- $\pi^3$: Permutation-Equivariant Visual Geometry Learning
- SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting
- Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
- D-LEAF: Localizing and Correcting Hallucinations in Multimodal LLMs via Layer-to-head Attention Diagnostics
- Object-level Correlation for Few-Shot Segmentation
- ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
- Dynamic Scene 3D Reconstruction of an Uncooperative Resident Space Object
- Feature Space Analysis by Guided Diffusion Model
- One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
- Visual Representation Alignment for Multimodal Large Language Models
- A smart fridge with AI-enabled food computing
- Neural Cone Radiosity for Interactive Global Illumination with Glossy Materials
- Understanding Ice Crystal Habit Diversity with Self-Supervised Learning
- A Challenging Benchmark of Anime Style Recognition
- SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance
- TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Interpretable Sex and Age Prediction from Diffusion MRI Tractography
- MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning
- InteractPro: A Unified Framework for Motion-Aware Image Composition
- Self Supervised Networks for Learning Latent Space Representations of Human Body Scans and Motions
- OOD-SEG: Exploiting out-of-distribution detection techniques for learning image segmentation from sparse multi-class positive-only annotations
- Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds
- Texture- and Shape-based Adversarial Attacks for Overhead Image Vehicle Detection
- Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection
- DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation
- In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting
- XOCT: Enhancing OCT to OCTA Translation via Cross-Dimensional Supervised Multi-Scale Feature Learning
- ANYPORTAL: Zero-Shot Consistent Video Background Replacement
- LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors
- DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning
- MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection
- Universal Few-Shot Spatial Control for Diffusion Models
- TextlessRAG: End-to-End Visual Document RAG by Speech Without Text
- PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image
- Temporal Image Forensics: A Review and Critical Evaluation
- Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation
- Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's Disease
- Self-Supervised Cross-Encoder for Neurodegenerative Disease Diagnosis
- Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity
- Beyond Motion Cues and Structural Sparsity: Revisiting Small Moving Target Detection
- EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration
- SEEC: Segmentation-Assisted Multi-Entropy Models for Learned Lossless Image Compression
- HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting
- RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis
- Faster, Self-Supervised Super-Resolution for Anisotropic Multi-View MRI Using a Sparse Coordinate Loss
- Understanding Museum Exhibits using Vision-Language Reasoning
- Personalized Attacks of Social Engineering in Multi-turn Conversations: LLM Agents for Simulation and Detection
- FedAPT: Federated Adversarial Prompt Tuning for Vision-Language Models
- Geospatial Foundational Embedder: Top-1 Winning Solution on EarthVision Embed2Scale Challenge (CVPR 2025)
- K-Syn: K-space Data Synthesis in Ultra Low-data Regimes
- Enhancing Classification of Streaming Data with Image Distillation
- Faster VGGT with Block-Sparse Global Attention
- Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry
- Realism to Deception: Investigating Deepfake Detectors Against Face Enhancement
- G3CN: Gaussian Topology Refinement Gated Graph Convolutional Network for Skeleton-Based Action Recognition
- Parse Graph-Based Visual-Language Interaction for Human Pose Estimation
- Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
- Are Humans as Brittle as Large Language Models?
- From Detection to Mitigation: Addressing Gender Bias in Chinese Texts via Efficient Tuning and Voting-Based Rebalancing
- Biased Tales: Cultural and Topic Bias in Generating Children's Stories
- SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge
- Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
- VLMs-in-the-Wild: Bridging the Gap Between Academic Benchmarks and Enterprise Reality
- Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data
- GLEAM: Learning to Match and Explain in Cross-View Geo-Localization
- Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images
- UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation
- Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
- M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
- MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
- When Large Language Models Meet Speech: A Survey on Integration Approaches
- Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
- Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts
- A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP
- FinRAGBench-V: A Benchmark for Multimodal RAG with Visual Citation in the Financial Domain
- Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects
- Are Economists Always More Introverted? Analyzing Consistency in Persona-Assigned LLMs
- Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation
- GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC
- Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders
- MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations
- The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
- Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
- Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation
- PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
- The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
- LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
- AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training
- Understanding Stigmatizing Language Lexicons: A Comparative Analysis in Clinical Contexts
- From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation
- VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
- MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
- MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval
- M-BRe: Discovering Training Samples for Relation Extraction from Unlabeled Texts with Large Language Models
- Factuality Beyond Coherence: Evaluating LLM Watermarking Methods for Medical Texts
- SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP
- Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback
- Prepared for the Worst: A Learning-Based Adversarial Attack for Resilience Analysis of the ICP Algorithm
- JoPA:Explaining Large Language Model's Generation via Joint Prompt Attribution
- Improving the Estimation of Lifetime Effects in A/B Testing via Treatment Locality
- Explainable Metrics for the Assessment of Neurodegenerative Diseases through Handwriting Analysis
- PnP-Flow: Plug-and-Play Image Restoration with Flow Matching
- Generalizable Humanoid Manipulation with 3D Diffusion Policies
- A Data-Free Analytical Quantization Scheme for Deep Learning Models
- Improved Physics-informed neural networks loss function regularization with a variance-based term
- Efficient Deep Learning-based Forward Solvers for Brain Tumor Growth Models
- Matrix Completion in Group Testing: Bounds and Simulations
- MEMIT-Merge: Addressing MEMIT's Key-Value Conflicts in Same-Subject Batch Editing for LLMs
- FilterRAG: Zero-Shot Informed Retrieval-Augmented Generation to Mitigate Hallucinations in VQA
- Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models
- SemCAFE: When Named Entities make the Difference Assessing Web Source Reliability through Entity-level Analytics
- Analytic theory of dropout regularization
- Inexact Column Generation for Bayesian Network Structure Learning via Difference-of-Submodular Optimization
- Learning to Upsample and Upmix Audio in the Latent Domain
- MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs
- Convergence of Momentum-Based Optimization Algorithms with Time-Varying Parameters
- Active Learning of Piecewise Gaussian Process Surrogates
- FilterFL: Knowledge Filtering-based Data-Free Backdoor Defense for Federated Learning
- Efficient Methods for Non-stationary Online Learning
- On the Benefits of Public Representations for Private Transfer Learning under Distribution Shift
- CoMMIT: Coordinated Multimodal Instruction Tuning
- BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery
- Hybrid-Regularized Magnitude Pruning for Robust Federated Learning under Covariate Shift
- When Do Neural Networks Learn World Models?
- Contrastive MIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning
- SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models
- Highly Efficient Direct Analytics on Semantic-aware Time Series Data Compression
- M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
- Scalable Autoregressive 3D Molecule Generation
- Closing the Gap between TD Learning and Supervised Learning with $Q$-Conditioned Maximization
- Equivariant U-Shaped Neural Operators for the Cahn-Hilliard Phase-Field Model
- Bootstrapping Task Spaces for Self-Improvement
- Time-Varying Graph Learning with Constraints on Graph Temporal Variation
- Challenging Bug Prediction and Repair Models with Synthetic Bugs
- LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade
- Kernel VICReg for Self-Supervised Learning in Reproducing Kernel Hilbert Space
- Identifying Neural Signatures from fMRI using Hybrid Principal Components Regression
- Causal Attention with Lookahead Keys
- Instance-level Performance Prediction for Long-form Generation Tasks
- Reinforcement learning for online hyperparameter tuning in convex quadratic programming
- Synthetic Data Generation with Lorenzetti for Time Series Anomaly Detection in High-Energy Physics Calorimeters
- MedicalPatchNet: A Patch-Based Self-Explainable AI Architecture for Chest X-ray Classification
- RINO: Renormalization Group Invariance with No Labels
- Asynchronous Gossip Algorithms for Rank-Based Statistical Methods
- Exploring System Adaptations For Minimum Latency Real-Time Piano Transcription
- Neural Proxies for Sound Synthesizers: Learning Perceptually Informed Preset Representations
- Nearest Neighbor Projection Removal Adversarial Training
- CAViAR: Critic-Augmented Video Agentic Reasoning
- Building causation links in stochastic nonlinear systems from data
- Toward Quantum Utility in Finance: A Robust Data-Driven Algorithm for Asset Clustering
- Quantum Computing for Large-scale Network Optimization: Opportunities and Challenges
- Decentralized Online Riemannian Optimization Beyond Hadamard Manifolds
- Nuclear Data Adjustment for Nonlinear Applications in the OECD/NEA WPNCS SG14 Benchmark -- A Bayesian Inverse UQ-based Approach for Data Assimilation
- Smart Fast Finish: Preventing Overdelivery via Daily Budget Pacing at DoorDash
- Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees
- RaC: Robot Learning for Long-Horizon Tasks by Scaling Recovery and Correction
- Bio-KGvec2go: Serving up-to-date Dynamic Biomedical Knowledge Graph Embeddings
- One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning
- Customizing the Inductive Biases of Softmax Attention using Structured Matrices
- Theoretical Analysis on how Learning Rate Warmup Accelerates Convergence
- Toric geometry of ReLU neural networks
- DIET-CP: Lightweight and Data Efficient Self Supervised Continued Pretraining
- veScale: Consistent and Efficient Tensor Programming with Eager-Mode SPMD
- Private Queries with Sigma-Counting
- Physics-Guided Diffusion Transformer with Spherical Harmonic Posterior Sampling for High-Fidelity Angular Super-Resolution in Diffusion MRI
- TGLF-SINN: Deep Learning Surrogate Model for Accelerating Turbulent Transport Modeling in Fusion
- A Quantum Bagging Algorithm with Unsupervised Base Learners for Label Corrupted Datasets
- PUUMA (Placental patch and whole-Uterus dual-branch U-Mamba-based Architecture): Functional MRI Prediction of Gestational Age at Birth and Preterm Risk
- SAM$^{*}$: Task-Adaptive SAM with Physics-Guided Rewards
- End-to-End Efficiency in Keyword Spotting: A System-Level Approach for Embedded Microcontrollers
- Sequentially Auditing Differential Privacy
- ADHAM: Additive Deep Hazard Analysis Mixtures for Interpretable Survival Regression
- NestGNN: A Graph Neural Network Framework Generalizing the Nested Logit Model for Travel Mode Choice
- Avoiding Over-Personalization with Rule-Guided Knowledge Graph Adaptation for LLM Recommendations
- Beyond Sequential Reranking: Reranker-Guided Search Improves Reasoning Intensive Retrieval
- Dimensionally Reduced Open-World Clustering: DROWCULA
- Predicting person-level injury severity using crash narratives: A balanced approach with roadway classification and natural language process techniques
- Addressing the Cold-Start Problem for Personalized Combination Drug Screening
- Leveraging Support Vector Regression for Outcome Prediction in Personalized Ultra-fractionated Stereotactic Adaptive Radiotherapy
- A Survey of Graph Neural Networks for Drug Discovery: Recent Developments and Challenges
- Feasibility of In-Ear Single-Channel ExG for Wearable Sleep~Monitoring in Real-World Settings
- A Modular Algorithm for Non-Stationary Online Convex-Concave Optimization
- IP-Basis PINNs: Efficient Multi-Query Inverse Parameter Estimation
- GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning
- Learning Generalized Hamiltonian Dynamics with Stability from Noisy Trajectory Data
- CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
- FedTeddi: Temporal Drift and Divergence Aware Scheduling for Timely Federated Edge Learning
- EfficientNet in Digital Twin-based Cardiac Arrest Prediction and Analysis
- EMORF-II: Adaptive EM-based Outlier-Robust Filtering with Correlated Measurement Noise
- Conv4Rec: A 1-by-1 Convolutional AutoEncoder for User Profiling through Joint Analysis of Implicit and Explicit Feedbacks
- RoseCDL: Robust and Scalable Convolutional Dictionary Learning for Rare-event Detection
- uGMM-NN: Univariate Gaussian Mixture Model Neural Network
- Homogenization with Guaranteed Bounds via Primal-Dual Physically Informed Neural Networks
- K2-Think: A Parameter-Efficient Reasoning System
- Graph-based Integrated Gradients for Explaining Graph Neural Networks
- FUnc-SNE: A flexible, Fast, and Unconstrained algorithm for neighbour embeddings
- IBN: An Interpretable Bidirectional-Modeling Network for Multivariate Time Series Forecasting with Variable Missing
- MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?
- GRADA: Graph-based Reranking against Adversarial Documents Attack
- Overflow Prevention Enhances Long-Context Recurrent LLMs
- Visuospatial Cognitive Assistant
- Towards Visuospatial Cognition via Hierarchical Fusion of Visual Experts
- Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives
- SCIZOR: A Self-Supervised Approach to Data Curation for Large-Scale Imitation Learning
- Multi-output Classification using a Cross-talk Architecture for Compound Fault Diagnosis of Motors in Partially Labeled Condition
- Localizing Persona Representations in LLMs
- Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
- HueManity: Probing Fine-Grained Visual Perception in MLLMs
- From Images to Insights: Explainable Biodiversity Monitoring with Plain Language Habitat Explanations
- Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting
- A Kriging-HDMR-based surrogate model with sample pool-free active learning strategy for reliability analysis
- Machine Generalize Learning in Agent-Based Models: Going Beyond Surrogate Models for Calibration in ABMs
- Recursive State Inference for Linear PASFA
- Benchmarking Vision Transformers and CNNs for Thermal Photovoltaic Fault Detection with Explainable AI Validation
- Of Graphs and Tables: Zero-Shot Node Classification with Tabular Foundation Models
- PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design
- Fed-REACT: Federated Representation Learning for Heterogeneous and Evolving Data
- Predicting effect of novel treatments using molecular pathways and real-world data
- CTourLLM: Enhancing LLMs with Chinese Tourism Knowledge
- Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning
- TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
- Automatically Detecting Online Deceptive Patterns
- TrojanRobot: Physical-world Backdoor Attacks Against VLM-based Robotic Manipulation
- Cardiverse: Harnessing LLMs for Novel Card Game Prototyping
- VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification
- Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection
- MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention
- DistJoin: A Decoupled Join Cardinality Estimator based on Adaptive Neural Predicate Modulation
- Involution and BSConv Multi-Depth Distillation Network for Lightweight Image Super-Resolution
- The Model Hears You: Audio Language Model Deployments Should Consider the Principle of Least Privilege
- Audio-centric Video Understanding Benchmark without Text Shortcut
- Enhancing Traffic Incident Response through Sub-Second Temporal Localization with HybridMamba
- Llama-Nemotron: Efficient Reasoning Models
- Unlearning vs. Obfuscation: Are We Truly Removing Knowledge?
- Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices
- OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models
- Multimodal Contrastive Pretraining of CBCT and IOS for Enhanced Tooth Segmentation
- GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models
- Accelerating Local AI on Consumer GPUs: A Hardware-Aware Dynamic Strategy for YOLOv10s
- Breaking Android with AI: A Deep Dive into LLM-Powered Exploitation
- ImportSnare: Directed "Code Manual" Hijacking in Retrieval-Augmented Code Generation
- Bringing Multi-Modal Multi-Task Federated Foundation Models to Education Domain: Prospects and Challenges
- ACE and Diverse Generalization via Selective Disagreement
- Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
- Self-Emotion-Mediated Exploration in Artificial Intelligence Mirrors: Findings from Cognitive Psychology
- Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism
- COMMA: A Communicative Multimodal Multi-Agent Benchmark
- Visualizing Thought: Conceptual Diagrams Enable Robust Combinatorial Planning in LMMs
- Automatic Reward Shaping from Confounded Offline Data
- GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning
- Addition in Four Movements: Mapping Layer-wise Information Trajectories in LLMs
- MedGellan: LLM-Generated Medical Guidance to Support Physicians
- Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks
- BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment
- Can SSD-Mamba2 Unlock Reinforcement Learning for End-to-End Motion Control?
- Transformer-Based Approach to Optimal Sensor Placement for Structural Health Monitoring of Probe Cards
- Beyond Rebalancing: Benchmarking Binary Classifiers Under Class Imbalance Without Rebalancing Techniques
- From Classical Data to Quantum Advantage -- Quantum Policy Evaluation on Quantum Hardware
- Variational Quantum Circuits in Offline Contextual Bandit Problems
- Spectral Masking and Interpolation Attack (SMIA): A Black-box Adversarial Attack against Voice Authentication and Anti-Spoofing Systems
- Enhancing Online Learning by Integrating Biosensors and Multimodal Learning Analytics for Detecting and Predicting Student Behavior: A Review
- Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks
- What Were You Thinking? An LLM-Driven Large-Scale Study of Refactoring Motivations in Open-Source Projects
- Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning
- XSRD-Net: EXplainable Stroke Relapse Detection
- Individual utilities of life satisfaction reveal inequality aversion unrelated to political alignment
- Enhanced SegNet with Integrated Grad-CAM for Interpretable Retinal Layer Segmentation in OCT Images
- Forecasting Russian Equipment Losses Using Time Series and Deep Learning Models
- Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
- Deep Learning-Based Burned Area Mapping Using Bi-Temporal Siamese Networks and AlphaEarth Foundation Datasets
- Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning
- Uncovering Scaling Laws for Large Language Models via Inverse Problems
- Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
- Hybrid GCN-GRU Model for Anomaly Detection in Cryptocurrency Transactions
- Toward Lifelong-Sustainable Electronic-Photonic AI Systems via Extreme Efficiency, Reconfigurability, and Robustness
- Benchmarking Universal Interatomic Potentials on Zeolite Structures
- The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
- Text2Touch: Tactile In-Hand Manipulation with LLM-Designed Reward Functions
- Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting
- DepthVision: Robust Vision-Language Understanding through GAN-Based LiDAR-to-RGB Synthesis
- HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention
- Fine-Tuning Vision-Language Models for Visual Navigation Assistance
- Generating Transferrable Adversarial Examples via Local Mixing and Logits Optimization for Remote Sensing Object Recognition
- Astra: A Multi-Agent System for GPU Kernel Performance Optimization
- ALLabel: Three-stage Active Learning for LLM-based Entity Recognition using Demonstration Retrieval
- Water Demand Forecasting of District Metered Areas through Learned Consumer Representations
- EHWGesture -- A dataset for multimodal understanding of clinical gestures
- Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data
- FLeW: Facet-Level and Adaptive Weighted Representation Learning of Scientific Documents
- HU-based Foreground Masking for 3D Medical Masked Image Modeling
- Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
- $\Delta L$ Normalization: Rethink Loss Aggregation in RLVR
- Towards Generalized Routing: Model and Agent Orchestration for Adaptive and Efficient Inference
- A multi-strategy improved gazelle optimization algorithm for solving numerical optimization and engineering applications
- XBusNet: Text-Guided Breast Ultrasound Segmentation via Multimodal Vision-Language Learning
- Explaining How Quantization Disparately Skews a Model
- A transformer-based generative model for planetary systems
- Breaking the Conventional Forward-Backward Tie in Neural Networks: Activation Functions
- Systematic Optimization of Open Source Large Language Models for Mathematical Reasoning
- Benchmarking Information Retrieval Models on Complex Retrieval Tasks
- Datasets for Navigating Sensitive Topics in Recommendation Systems
- Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion
- ALICE: An Interpretable Neural Architecture for Generalization in Substitution Ciphers
- Paladin: Defending LLM-enabled Phishing Emails with a New Trigger-Tag Paradigm
- zkUnlearner: A Zero-Knowledge Framework for Verifiable Unlearning with Multi-Granularity and Forgery-Resistance
- Reconstruction Alignment Improves Unified Multimodal Models
- Basis Vector Metric: A Method for Robust Open-Ended State Change Detection
- Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
- MEGG: Replay via Maximally Extreme GGscore in Incremental Learning for Neural Recommendation Models
- Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation
- DEPF: A UAV Multispectral Object Detector with Dual-Domain Enhancement and Priority-Guided Mamba Fusion
- General Demographic Foundation Models for Enhancing Predictive Performance Across Diseases
- Word2Spike: Poisson Rate Coding for Associative Memories and Neuromorphic Algorithms
- SBS: Enhancing Parameter-Efficiency of Neural Representations for Neural Networks via Spectral Bias Suppression
- Random Forest Stratified K-Fold Cross Validation on SYN DoS Attack SD-IoV
- An efficient deep reinforcement learning environment for flexible job-shop scheduling
- MEGS$^{2}$: Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning
- Preventing Another Tessa: Modular Safety Middleware For Health-Adjacent AI Assistants
- 1 bit is all we need: binary normalized neural networks
- Contradictions
- Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models
- The Impact of Artificial Intelligence on Traditional Art Forms: A Disruption or Enhancement
- A Minimalist Bayesian Framework for Stochastic Optimization
- A Maslow-Inspired Hierarchy of Engagement with AI Model
- Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
- Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence
- Automated Evaluation of Gender Bias Across 13 Large Multimodal Models
- Lookup multivariate Kolmogorov-Arnold Networks
- Riemannian Batch Normalization: A Gyro Approach
- SVGauge: Towards Human-Aligned Evaluation for SVG Generation
- SoK: Security and Privacy of AI Agents for Blockchain
- Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study
- Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models
- Measuring Uncertainty in Transformer Circuits with Effective Information Consistency
- DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge
- Evaluation of Machine Learning Reconstruction Techniques for Accelerated Brain MRI Scans
- Cross-device Zero-shot Label Transfer via Alignment of Time Series Foundation Model Embeddings
- Cross-field SNR Analysis and Tensor Channel Estimation for Multi-UAV Near-field Communications
- Deep Learning-based Techniques for Integrated Sensing and Communication Systems: State-of-the-Art, Challenges, and Opportunities
- Association of Timing and Duration of Moderate-to-Vigorous Physical Activity with Cognitive Function and Brain Aging: A Population-Based Study Using the UK Biobank
- Impact of Neuron Models on Spiking Neural Networks performance. A Complexity Based Classification Approach
- Individualized and Interpretable Sleep Forecasting via a Two-Stage Adaptive Spatial-Temporal Model
- GSTBench: A Benchmark Study on the Transferability of Graph Self-Supervised Learning
- A Knowledge-Guided Cross-Modal Feature Fusion Model for Local Traffic Demand Prediction
- Toward Reproducible Cross-Backend Compatibility for Deep Learning: A Configuration-First Framework with Three-Tier Verification
- Exploring Over-stationarization in Deep Learning-based Bus/Tram Arrival Time Prediction: Analysis and Non-stationary Effect Recovery
- RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use
- CARE: Decoding Time Safety Alignment via Rollback and Introspection Intervention
- FediLoRA: Heterogeneous LoRA for Federated Multimodal Fine-tuning under Missing Modalities
- CellPainTR: Generalizable Representation Learning for Cross-Dataset Cell Painting Analysis
- FusWay: Multimodal hybrid fusion approach. Application to Railway Defect Detection
- Frustratingly Easy Feature Reconstruction for Out-of-Distribution Detection
- The Protocol Genome A Self Supervised Learning Framework from DICOM Headers
- Visible Yet Unreadable: A Systematic Blind Spot of Vision Language Models Across Writing Systems
- Not All Splits Are Equal: Rethinking Attribute Generalization Across Unrelated Categories
- ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code
- Computational Concept of the Psyche
- Human-in-the-Loop: Quantitative Evaluation of 3D Models Generation by Large Language Models
- Performative Thinking? The Brittle Correlation Between CoT Length and Problem Complexity
- Autonomous Code Evolution Meets NP-Completeness
- Language Self-Play For Data-Free Training
- SheetDesigner: MLLM-Powered Spreadsheet Layout Generation with Rule-Based and Vision-Based Reflection
- Towards explainable decision support using hybrid neural models for logistic terminal automation
- Transferable Direct Prompt Injection via Activation-Guided MCMC Sampling
- Getting In Contract with Large Language Models -- An Agency Theory Perspective On Large Language Model Alignment
- DeepGraphLog for Layered Neurosymbolic AI
- Unleashing the True Potential of LLMs: A Feedback-Triggered Self-Correction with Long-Term Multipath Decoding
- FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support
- RIMO: An Easy-to-Evaluate, Hard-to-Solve Olympiad Benchmark for Advanced Mathematical Reasoning
- BDPM: A Machine Learning-Based Feature Extractor for Parkinson's Disease Classification via Gut Microbiota Analysis
- The Carbon Footprint Wizard: A Knowledge-Augmented AI Interface for Streamlining Food Carbon Footprint Analysis
- Certainty-Guided Reasoning in Large Language Models: A Dynamic Thinking Budget Approach
- Aligning LLMs for the Classroom with Knowledge-Based Retrieval -- A Comparative RAG Study
- SCoder: Iterative Self-Distillation for Bootstrapping Small-Scale Data Synthesizers to Empower Code LLMs
- CP-Model-Zoo: A Natural Language Query System for Constraint Programming Models
- HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
- Probing the Preferences of a Language Model: Integrating Verbal and Behavioral Tests of AI Welfare
- Estimating forest carbon stocks from high-resolution remote sensing imagery by reducing domain shift with style transfer
- VoltanaLLM: Feedback-Driven Frequency Control and State-Space Routing for Energy-Efficient LLM Serving
- Renewable Energy Sources Selection Analysis with the Maximizing Deviation Method
- From Eigenmodes to Proofs: Integrating Graph Spectral Operators with Symbolic Interpretable Reasoning
- Statistical Methods in Generative AI
- Instruction Agent: Enhancing Agent with Expert Demonstration
- Neuro-Symbolic Frameworks: Conceptual Characterization and Empirical Comparative Analysis
- Autoencoder-Based Denoising of Muscle Artifacts in ECG to Preserve Skin Nerve Activity (SKNA) for Cognitive Stress Detection
- PaVeRL-SQL: Text-to-SQL via Partial-Match Rewards and Verbal Reinforcement Learning
- That's So FETCH: Fashioning Ensemble Techniques for LLM Classification in Civil Legal Intake and Referral
- A Hybrid CNN-LSTM Deep Learning Model for Intrusion Detection in Smart Grid
- BlendedNet: A Blended Wing Body Aircraft Dataset and Surrogate Model for Aerodynamic Predictions
- OmniAcc: Personalized Accessibility Assistant Using Generative AI
- HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Healthcare Monitoring
Research Sources: 418 | Generated: 9/10/2025