AI RESEARCH PAPERS & ACADEMIC SOURCES
- TR-Gaussians: High-fidelity Real-time Rendering of Planar Transmission and Reflection with 3D Gaussian Splatting
- MEGA-GUI: Multi-stage Enhanced Grounding Agents for GUI Elements
- MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications
- PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
- DAP: A Discrete-token Autoregressive Planner for Autonomous Driving
- Trust in Vision-Language Models: Insights from a Participatory User Workshop
- HIBMatch: Hypergraph Information Bottleneck for Semi-supervised Alzheimer's Progression
- DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection
- Lane Graph Extraction from Aerial Imagery via Lane Segmentation Refinement with Diffusion Models
- 3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance
- BadVim: Unveiling Backdoor Threats in Visual State Space Model
- An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation and Dynamic Loss Weighting
- Revisiting Long-Tailed Learning: Insights from an Architectural Perspective
- Density-aware global-local attention network for point cloud segmentation
- Towards Collective Intelligence: Uncertainty-aware SAM Adaptation for Ambiguous Medical Image Segmentation
- Subjective and Objective Quality Evaluation of Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset
- MeshCone: Second-Order Cone Programming for Geometrically-Constrained Mesh Enhancement
- FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron Segmentation
- RobustGait: Robustness Analysis for Appearance Based Gait Recognition
- Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving
- MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images
- CapeNext: Rethinking and refining dynamic support information for category-agnostic pose estimation
- PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking
- Low-Level Dataset Distillation for Medical Image Enhancement
- DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection
- Learning Implicit Neural Degradation Representation for Unpaired Image Dehazing
- Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image Deraining
- A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features
- CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model
- VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language
- Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
- MedGEN-Bench: Contextually entangled benchmark for open-ended multimodal medical generation
- WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection
- Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks
- Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification
- SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical Registration
- THIR: Topological Histopathological Image Retrieval
- HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution
- GenTract: Generative Global Tractography
- Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework
- Video Spatial Reasoning with Object-Centric 3D Rollout
- Birth of a Painting: Differentiable Brushstroke Reconstruction
- Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection
- Self-Supervised Ultrasound Screen Detection
- RefineVAD: Semantic-Guided Feature Recalibration for Weakly Supervised Video Anomaly Detection
- End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
- 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale
- Hybrid-Domain Adaptative Representation Learning for Gaze Estimation
- MRIQT: Physics-Aware Diffusion Model for Image Quality Transfer in Neonatal Ultra-Low-Field MRI
- MMD-Thinker: Adaptive Multi-Dimensional Thinking for Multimodal Misinformation Detection
- Referring Camouflaged Object Detection With Multi-Context Overlapped Windows Cross-Attention
- GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models
- Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
- SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression
- Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation
- Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models
- SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting
- Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera Space
- TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing
- SkyReels-Text: Fine-grained Font-Controllable Text Editing for Poster Design
- CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
- DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving
- Computer Vision based group activity detection and action spotting
- YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection
- Semi-Supervised Multi-Task Learning for Interpretable Quality As- sessment of Fundus Images
- Generalized Denoising Diffusion Codebook Models (gDDCM): Tokenizing images using a pre-trained diffusion model
- Descriptor: Distance-Annotated Traffic Perception Question Answering (DTPQA)
- TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text Editing
- What Color Is It? A Text-Interference Multimodal Hallucination Benchmark
- Delineate Anything Flow: Fast, Country-Level Field Boundary Detection from Any Source
- VOPE: Revisiting Hallucination of Vision-Language Models in Voluntary Imagination Task
- FUSE: A Flow-based Mapping Between Shapes
- Unlocking the Forgery Detection Potential of Vanilla MLLMs: A Novel Training-Free Pipeline
- InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE
- Language-Guided Invariance Probing of Vision-Language Models
- Mapping the Vanishing and Transformation of Urban Villages in China
- Minimax Multi-Target Conformal Prediction with Applications to Imaging Inverse Problems
- Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew
- Robust Defense Strategies for Multimodal Contrastive Learning: Efficient Fine-tuning Against Backdoor Attacks
- TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images
- Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware Exploitation
- Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification
- Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images
- VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
- ICLR: Inter-Chrominance and Luminance Interaction for Natural Color Restoration in Low-Light Image Enhancement
- Tissue Aware Nuclei Detection and Classification Model for Histopathology Images
- A Real-Time Driver Drowsiness Detection System Using MediaPipe and Eye Aspect Ratio
- Alpha Divergence Losses for Biometric Verification
- CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding
- Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
- PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image
- Distribution Matching Distillation Meets Reinforcement Learning
- TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
- Free-Form Scene Editor: Enabling Multi-Round Object Manipulation like in a 3D Engine
- Segment Anything Across Shots: A Method and Benchmark
- Back to Basics: Let Denoising Generative Models Denoise
- Image-based Morphological Characterization of Filamentous Biological Structures with Non-constant Curvature Shape Feature
- Slow - Motion Video Synthesis for Basketball Using Frame Interpolation
- Range Asymmetric Numeral Systems-Based Lightweight Intermediate Feature Compression for Split Computing of Deep Neural Networks
- Understanding the Representation of Older Adults in Motion Capture Locomotion Datasets
- Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy: A Review
- End to End AI System for Surgical Gesture Sequence Recognition and Clinical Outcome Prediction
- TIMERIPPLE: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space
- AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
- Recursive Threshold Median Filter and Autoencoder for Salt-and-Pepper Denoising: SSIM analysis of Images and Entropy Maps
- AURA: Development and Validation of an Augmented Unplanned Removal Alert System using Synthetic ICU Videos
- Deep Unfolded BM3D: Unrolling Non-local Collaborative Filtering into a Trainable Neural Network
- Bregman geometry-aware split Gibbs sampling for Bayesian Poisson inverse problems
- Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification
- RAA-MIL: A Novel Framework for Classification of Oral Cytology
- MTMed3D: A Multi-Task Transformer-Based Model for 3D Medical Imaging
- DEMIST: \underline{DE}coupled \underline{M}ulti-stream latent d\underline{I}ffusion for Quantitative Myelin Map \underline{S}yn\underline{T}hesis
- Predicting upcoming visual features during eye movements yields scene representations aligned with human visual cortex
- Improving the Generalisation of Learned Reconstruction Frameworks
- BrainNormalizer: Anatomy-Informed Pseudo-Healthy Brain Reconstruction from Tumor MRI via Edge-Guided ControlNet
- Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
- Yanyun-3: Enabling Cross-Platform Strategy Game Operation with Vision-Language Models
- Inertia-Informed Orientation Priors for Event-Based Optical Flow Estimation
- SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
- Scalable Vision-Guided Crop Yield Estimation
- ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks
- TM-UNet: Token-Memory Enhanced Sequential Modeling for Efficient Medical Image Segmentation
- One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving
- Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method
- LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors
- SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models
- Explainable AI-Generated Image Detection RewardBench
- Constructing and Interpreting Digital Twin Representations for Visual Reasoning via Reinforcement Learning
- Fast Reasoning Segmentation for Images and Videos
- Changes in Real Time: Online Scene Change Detection with Multi-View Fusion
- Reasoning Text-to-Video Retrieval via Digital Twin Video Representations and Large Language Models
- Leveraging Quantum-Based Architectures for Robust Diagnostics
- Calibrated Decomposition of Aleatoric and Epistemic Uncertainty in Deep Features for Inference-Time Adaptation
- MSLoRA: Multi-Scale Low-Rank Adaptation via Attention Reweighting
- VLA-R: Vision-Language Action Retrieval toward Open-World End-to-End Autonomous Driving
- Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection
- Towards Rotation-only Imaging Geometry: Rotation Estimation
- Seeing Through the Rain: Resolving High-Frequency Conflicts in Deraining and Super-Resolution via Diffusion Guidance
- MFI-ResNet: Efficient ResNet Architecture Optimization via MeanFlow Compression and Selective Incubation
- RedVTP: Training-Free Acceleration of Diffusion Vision-Language Models Inference via Masked Token-Guided Visual Token Pruning
- Text-Guided Channel Perturbation and Pretrained Knowledge Integration for Unified Multi-Modality Image Fusion
- CoTBox-TTT: Grounding Medical VQA with Visual Chain-of-Thought Boxes During Test-time Training
- MaskAnyNet: Rethinking Masked Image Regions as Valuable Information in Supervised Learning
- Towards Temporal Fusion Beyond the Field of View for Camera-based Semantic Scene Completion
- Visible Structure Retrieval for Lightweight Image-Based Relocalisation
- MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics
- D$^{2}$-VPR: A Parameter-efficient Visual-foundation-model-based Visual Place Recognition Method via Knowledge Distillation and Deformable Aggregation
- ReaSon: Reinforced Causal Search with Information Bottleneck for Video Understanding
- HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models
- EmoVerse: A MLLMs-Driven Emotion Representation Dataset for Interpretable Visual Emotion Analysis
- SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane Recognition
- Through-Foliage Surface-Temperature Reconstruction for early Wildfire Detection
- Beyond Pixels: Semantic-aware Typographic Attack for Geo-Privacy Protection
- TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
- Rank-Aware Agglomeration of Foundation Models for Immunohistochemistry Image Cell Counting
- Fine-Grained Representation for Lane Topology Reasoning
- Seg-VAR: Image Segmentation with Visual Autoregressive Modeling
- LoRA-Enhanced Vision Transformer for Single Image based Morphing Attack Detection via Knowledge Distillation from EfficientNet
- Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
- Open-World Test-Time Adaptation with Hierarchical Feature Aggregation and Attention Affine
- C3Net: Context-Contrast Network for Camouflaged Object Detection
- Multivariate Diffusion Transformer with Decoupled Attention for High-Fidelity Mask-Text Collaborative Facial Generation
- Denoising Vision Transformer Autoencoder with Spectral Self-Regularization
- Medical Knowledge Intervention Prompt Tuning for Medical Image Classification
- DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry
- Toward Real-world Text Image Forgery Localization: Structured and Interpretable Data Synthesis
- Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans
- DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
- Appreciate the View: A Task-Aware Evaluation Framework for Novel View Synthesis
- BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
- R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection
- HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
- Counting Through Occlusion: Framework for Open World Amodal Counting
- FSDAM: Few-Shot Driving Attention Modeling via Vision-Language Coupling
- Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
- Direct Visual Grounding by Directing Attention of Visual Tokens
- Deep Imbalanced Multi-Target Regression: 3D Point Cloud Voxel Content Estimation in Simulated Forests
- SAGE: Saliency-Guided Contrastive Embeddings
- Which Way from B to A: The role of embedding geometry in image interpolation for Stable Diffusion
- Lightweight Optimal-Transport Harmonization on Edge Devices
- Enhancing Neuro-Oncology Through Self-Assessing Deep Learning Models for Brain Tumor Unified Model for MRI Segmentation
- MSRNet: A Multi-Scale Recursive Network for Camouflaged Object Detection
- SAGA: Source Attribution of Generative AI Videos
- Video Finetuning Improves Reasoning Between Frames
- View-aware Cross-modal Distillation for Multi-view Action Recognition
- Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views
- Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings
- ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation
- Reconstructing 3D Scenes in Native High Dynamic Range
- FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRI
- DeepSport: A Multimodal Large Language Model for Comprehensive Sports Video Reasoning via Agentic Reinforcement Learning
- CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly Detection
- Explore How to Inject Beneficial Noise in MLLMs
- CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation
- Generative Photographic Control for Scene-Consistent Video Cinematic Editing
- Text2Traffic: A Text-to-Image Generation and Editing Method for Traffic Scenes
- PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos
- ProtoAnomalyNCD: Prototype Learning for Multi-class Novel Anomaly Discovery in Industrial Scenarios
- Semi-Supervised High Dynamic Range Image Reconstructing via Bi-Level Uncertain Area Masking
- Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
- T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving
- EndoSight AI: Deep Learning-Driven Real-Time Gastrointestinal Polyp Detection and Segmentation for Enhanced Endoscopic Diagnostics
- CalibrateMix: Guided-Mixup Calibration of Image Semi-Supervised Models
- GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
- HiFusion: Hierarchical Intra-Spot Alignment and Regional Context Fusion for Spatial Gene Expression Prediction from Histopathology
- ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes
- Concept Regions Matter: Benchmarking CLIP with a New Cluster-Importance Approach
- UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective
- Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection
- PerTouch: VLM-Driven Agent for Personalized and Semantic Image Retouching
- Medal S: Spatio-Textual Prompt Model for Medical Segmentation
- Infinite-Story: A Training-Free Consistent Text-to-Image Generation
- SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias
- Beyond Darkness: Thermal-Supervised 3D Gaussian Splatting for Low-Light Novel View Synthesis
- You Only Look Omni Gradient Backpropagation for Moving Infrared Small Target Detection
- Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo under Limited Multi-Illumination Cues
- SpectralAdapt: Semi-Supervised Domain Adaptation with Spectral Priors for Human-Centered Hyperspectral Image Reconstruction
- REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
- Towards 3D Object-Centric Feature Learning for Semantic Scene Completion
- Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts
- uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data
- MGCA-Net: Multi-Grained Category-Aware Network for Open-Vocabulary Temporal Action Localization
- DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation
- ViSS-R1: Self-Supervised Reinforcement Video Reasoning
- Monocular 3D Lane Detection via Structure Uncertainty-Aware Network with Curve-Point Queries
- LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions
- Psychological stress during Examination and its estimation by handwriting in answer script
- Real-time pothole detection with onboard sensors and camera on vehicles
- A Method for Identifying Farmland System Habitat Types Based on the Dynamic-Weighted Feature Fusion Network Model
- AGENet: Adaptive Edge-aware Geodesic Distance Learning for Few-Shot Medical Image Segmentation
- EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language Guidance
- Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement
- LE-CapsNet: A Light and Enhanced Capsule Network
- Target-Balanced Score Distillation
- CompressNAS : A Fast and Efficient Technique for Model Compression using Decomposition
- AdaptFly: Prompt-Guided Adaptation of Foundation Models for Low-Altitude UAV Networks
- Do Blind Spots Matter for Word-Referent Mapping? A Computational Study with Infant Egocentric Video
- GROVER: Graph-guided Representation of Omics and Vision with Expert Regulation for Adaptive Spatial Multi-omics Fusion
- Exposing DeepFakes via Hyperspectral Domain Mapping
- Toward bilipshiz geometric models
- Concept-RuleNet: Grounded Multi-Agent Neurosymbolic Reasoning in Vision Language Models
- Batch Transformer Architecture: Case of Synthetic Image Generation for Emotion Expression Facial Recognition
- Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
- SOTFormer: A Minimal Transformer for Unified Object Tracking and Trajectory Prediction
- Defending Unauthorized Model Merging via Dual-Stage Weight Protection
- FocusSDF: Boundary-Aware Learning for Medical Image Segmentation via Signed Distance Supervision
- Lacking Data? No worries! How synthetic images can alleviate image scarcity in wildlife surveys: a case study with muskox (Ovibos moschatus)
- Advancing Annotat3D with Harpia: A CUDA-Accelerated Library For Large-Scale Volumetric Data Segmentation
- Prompt Triage: Structured Optimization Enhances Vision-Language Model Performance on Medical Imaging Benchmarks
- PI-NAIM: Path-Integrated Neural Adaptive Imputation Model
- Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models
- From Events to Clarity: The Event-Guided Diffusion Framework for Dehazing
- Evaluation of Attention Mechanisms in U-Net Architectures for Semantic Segmentation of Brazilian Rock Art Petroglyphs
- From Classification to Cross-Modal Understanding: Leveraging Vision-Language Models for Fine-Grained Renal Pathology
- BeyondFacial: Identity-Preserving Personalized Generation Beyond Facial Close-ups
- LithoSeg: A Coarse-to-Fine Framework for High-Precision Lithography Segmentation
- LIHE: Linguistic Instance-Split Hyperbolic-Euclidean Framework for Generalized Weakly-Supervised Referring Expression Comprehension
- Null-Space Diffusion Distillation for Efficient Photorealistic Lensless Imaging
- Bridging Vision and Language for Robust Context-Aware Surgical Point Tracking: The VL-SurgPT Dataset and Benchmark
- GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory
- VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation
- Improved Masked Image Generation with Knowledge-Augmented Token Representations
- SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View Images
- FedSDA: Federated Stain Distribution Alignment for Non-IID Histopathological Image Classification
- DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging
- DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training
- UniABG: Unified Adversarial View Bridging and Graph Correspondence for Unsupervised Cross-View Geo-Localization
- PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model Decoupling
- MovSemCL: Movement-Semantics Contrastive Learning for Trajectory Similarity
- DCA-LUT: Deep Chromatic Alignment with 5D LUT for Purple Fringing Removal
- Learning to Hear by Seeing: It's Time for Vision Language Models to Understand Artistic Emotion from Sight and Sound
- Point Cloud Quantization through Multimodal Prompting for 3D Understanding
- Supervised Multilabel Image Classification Using Residual Networks with Probabilistic Reasoning
- SemanticStitch: Enhancing Image Coherence through Foreground-Aware Seam Carving
- Teaching Prompts to Coordinate: Hierarchical Layer-Grouped Prompt Tuning for Continual Learning
- Learning from Dense Events: Towards Fast Spiking Neural Networks Training via Event Dataset Distillatio
- Sparse by Rule: Probability-Based N:M Pruning for Spiking Neural Networks
- DINOv3-Guided Cross Fusion Framework for Semantic-aware CT generation from MRI and CBCT
- Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
- Did Models Sufficient Learn? Attribution-Guided Training via Subset-Selected Counterfactual Augmentation
- BdSL-SPOTER: A Transformer-Based Framework for Bengali Sign Language Recognition with Cultural Adaptation
- Fine-Grained DINO Tuning with Dual Supervision for Face Forgery Detection
- MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
- RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving
- OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
- Compression and Inference of Spiking Neural Networks on Resource-Constrained Hardware
- MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering
- Breaking the Modality Wall: Time-step Mixup for Efficient Spiking Knowledge Transfer from Static to Event Domain
- FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing
- Rethinking Multimodal Point Cloud Completion: A Completion-by-Correction Perspective
- MMRINet: Efficient Mamba-Based Segmentation with Dual-Path Refinement for Low-Resource MRI Analysis
- Cross-View Cross-Modal Unsupervised Domain Adaptation for Driver Monitoring System
- Bridging Granularity Gaps: Hierarchical Semantic Learning for Cross-domain Few-shot Segmentation
- OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs
- LSS3D: Learnable Spatial Shifting for Consistent and High-Quality 3D Generation from Single-Image
- GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction
- A Novel AI-Driven System for Real-Time Detection of Mirror Absence, Helmet Non-Compliance, and License Plates Using YOLOv8 and OCR
- Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
- FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention
- Model Inversion Attack Against Deep Hashing
- Fusionista2.0: Efficiency Retrieval System for Large-Scale Datasets
- Prompt-Conditioned FiLM and Multi-Scale Fusion on MedSigLIP for Low-Dose CT Quality Assessment
- A Disease-Aware Dual-Stage Framework for Chest X-ray Report Generation
- CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models
- Critical or Compliant? The Double-Edged Sword of Reasoning in Chain-of-Thought Explanations
- CURE: Cultural Understanding and Reasoning Evaluation - A Framework for "Thick" Culture Alignment Evaluation in LLMs
- Exploring Parameter-Efficient Fine-Tuning and Backtranslation for the WMT 25 General Translation Task
- LLMLagBench: Identifying Temporal Training Boundaries in Large Language Models
- PRISM of Opinions: A Persona-Reasoned Multimodal Framework for User-centric Conversational Stance Detection
- AI-Salesman: Towards Reliable Large Language Model Driven Telemarketing
- Seeing is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding
- CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic
- MME-RAG: Multi-Manager-Expert Retrieval-Augmented Generation for Fine-Grained Entity Recognition in Task-Oriented Dialogues
- ViConBERT: Context-Gloss Aligned Vietnamese Word Embedding for Polysemous and Sense-Aware Representations
- AugAbEx : Way Forward for Extractive Case Summarization
- Do LLMs and Humans Find the Same Questions Difficult? A Case Study on Japanese Quiz Answering
- Don't Think of the White Bear: Ironic Negation in Transformer Models Under Cognitive Load
- From Phonemes to Meaning: Evaluating Large Language Models on Tamil
- Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
- Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug Repurposing
- SGuard-v1: Safety Guardrail for Large Language Models
- QA-Noun: Representing Nominal Semantics via Natural Language Question-Answer Pairs
- TAdaRAG: Task Adaptive Retrieval-Augmented Generation via On-the-Fly Knowledge Graph Construction
- Mitigating Length Bias in RLHF through a Causal Lens
- MMWOZ: Building Multimodal Agent for Task-oriented Dialogue
- Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
- Knots: A Large-Scale Multi-Agent Enhanced Expert-Annotated Dataset and LLM Prompt Optimization for NOTAM Semantic Parsing
- Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
- Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs
- Adaptive Focus Memory for Language Models
- On the Brittleness of LLMs: A Journey around Set Membership
- Evidence of Phase Transitions in Small Transformer-Based Language Models
- LLM Reinforcement in Context
- Evaluating Autoformalization Robustness via Semantically Similar Paraphrasing
- BioMedJImpact: A Comprehensive Dataset and LLM Pipeline for AI Engagement and Scientific Impact Analysis of Biomedical Journals
- From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation
- Quantifying consistency and accuracy of Latent Dirichlet Allocation
- NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation
- From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
- Auditing Google's AI Overviews and Featured Snippets: A Case Study on Baby Care and Pregnancy
- Visual Room 2.0: Seeing is Not Understanding for MLLMs
- Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
- AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
- How Good is BLI as an Alignment Measure: A Study in Word Embedding Paradigm
- Spark-Prover-X1: Formal Theorem Proving Through Diverse Data Training
- BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
- Evaluating the Ability of Large Language Models to Identify Adherence to CONSORT Reporting Guidelines in Randomized Controlled Trials: A Methodological Evaluation Study
- Extracting Events Like Code: A Multi-Agent Programming Framework for Zero-Shot Event Extraction
- A Comparative Analysis of Recurrent and Attention Architectures for Isolated Sign Language Recognition
- Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels
- Distinguishing Repetition Disfluency from Morphological Reduplication in Bangla ASR Transcripts: A Novel Corpus and Benchmarking Analysis
- TCM-5CEval: Extended Deep Evaluation Benchmark for LLM's Comprehensive Clinical Research Competence in Traditional Chinese Medicine
- Translation Entropy: A Statistical Framework for Evaluating Translation Systems
- Evaluating Large Language Models for Diacritic Restoration in Romanian Texts: A Comparative Study
- Seeing isn't Hearing: Benchmarking Vision Language Models at Interpreting Spectrograms
- Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
- RegionMarker: A Region-Triggered Semantic Watermarking Framework for Embedding-as-a-Service Copyright Protection
- AHaSIS: Shared Task on Sentiment Analysis for Arabic Dialects
- Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning
- Can Large Language Models Function as Qualified Pediatricians? A Systematic Evaluation in Real-World Clinical Contexts
- Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term User-Agent Interaction
- Non-Linear Scoring Model for Translation Quality Evaluation
- Aspect-Level Obfuscated Sentiment in Thai Financial Disclosures and Its Impact on Abnormal Returns
- Applying Large Language Models to Characterize Public Narratives
- Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets
- Beyond SELECT: A Comprehensive Taxonomy-Guided Benchmark for Real-World Text-to-SQL Translation
- Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
- Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation
- LLM-Generated Negative News Headlines Dataset: Creation and Benchmarking Against Real Journalism
- CLINB: A Climate Intelligence Benchmark for Foundational Models
- SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detectio
- EduAgentQG: A Multi-Agent Workflow Framework for Personalized Question Generation
- Automatic generation of DRI Statements
- Generative AI as a Linguistic Equalizer in Global Science
- Do LLMs Really Struggle at NL-FOL Translation? Revealing their Strengths via a Novel Benchmarking Strategy
- Leveraging Large Language Models for Career Mobility Analysis: A Study of Gender, Race, and Job Change Using U.S. Online Resume Profiles
- How Far Do SSL Speech Models Listen for Tone? Temporal Focus of Tone Representation under Low-resource Transfer
- VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing
- DenseAnnotate: Enabling Scalable Dense Caption Collection for Images and 3D Scenes via Spoken Descriptions
- Co-Layout: LLM-driven Co-optimization for Interior Layout
- Evolving Prompts for Toxicity Search in Large Language Models
- Accepted with Minor Revisions: Value of AI-Assisted Scientific Writing
- A Content-Preserving Secure Linguistic Steganography
- WebCoach: Self-Evolving Web Agents with Cross-Session Memory Guidance
- PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
- Dropouts in Confidence: Moral Uncertainty in Human-LLM Alignment
- Attention Grounded Enhancement for Visual Document Retrieval
- ForgeDAN: An Evolutionary Framework for Jailbreaking Aligned Large Language Models
- Historical/temporal necessities/possibilities, and a logical theory of them in branching time
- Simultaneous Machine Translation with Large Language Models
- Vashantor: A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
- Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models
- ProFuser: Progressive Fusion of Large Language Models
- Contextual Breach: Assessing the Robustness of Transformer-based QA Models
- Is deeper always better? Replacing linear mappings with deep learning networks in the Discriminative Lexicon Model
- Uncovering Factor Level Preferences to Improve Human-Model Alignment
- Is Our Chatbot Telling Lies? Assessing Correctness of an LLM-based Dutch Support Chatbot
- DeepMIDE: A Multi-Output Spatio-Temporal Method for Ultra-Scale Offshore Wind Energy Forecasting
- EXAGREE: Mitigating Explanation Disagreement with Stakeholder-Aligned Models
- Fair In-Context Learning via Latent Concept Variables
- Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE)
- Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers
- Neutron Reflectometry by Gradient Descent
- A Comparative Benchmark of Federated Learning Strategies for Mortality Prediction on Heterogeneous and Imbalanced Clinical Data
- Learning at the Speed of Physics: Equilibrium Propagation on Oscillator Ising Machines
- Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation
- FinGPT: Open-Source Financial Large Language Models
- Foundations of Structural Causal Models with Latent Selection
- A comprehensive and easy-to-use multi-domain multi-task medical imaging meta-dataset
- Architectures and random properties of symplectic quantum circuits
- Learning Optimal Distributionally Robust Stochastic Control in Continuous State Spaces
- Emulation with uncertainty quantification of regional sea-level change caused by the Antarctic Ice Sheet
- MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents
- On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning
- Identify As A Human Does: A Pathfinder of Next-Generation Anti-Cheat Framework for First-Person Shooter Games
- A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models
- Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control
- NoLBERT: A No Lookahead(back) Foundational Language Model
- Evaluating Multiple Instance Learning Strategies for Automated Sebocyte Droplet Counting
- qc-kmeans: A Quantum Compressive K-Means Algorithm for NISQ Devices
- TimeStampEval: A Simple LLM Eval and a Little Fuzzy Matching Trick to Improve Search Accuracy
- MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
- On the Notion that Language Models Reason
- Scaling Open-Weight Large Language Models for Hydropower Regulatory Information Extraction: A Systematic Analysis
- Towards Autoformalization of LLM-generated Outputs for Requirement Verification
- Three Stage Narrative Analysis; Plot-Sentiment Breakdown, Structure Learning and Concept Detection
- Identifying Imaging Follow-Up in Radiology Reports: A Comparative Analysis of Traditional ML and LLM Approaches
- MedPT: A Massive Medical Question Answering Dataset for Brazilian-Portuguese Speakers
- Context-Emotion Aware Therapeutic Dialogue Generation: A Multi-component Reinforcement Learning Approach to Language Models for Mental Health Support
- A Reasoning Paradigm for Named Entity Recognition
- Ground Plane Projection for Improved Traffic Analytics at Intersections
- CLAReSNet: When Convolution Meets Latent Attention for Hyperspectral Image Classification
- More Than Irrational: Modeling Belief-Biased Agents
- AGGRNet: Selective Feature Extraction and Aggregation for Enhanced Medical Image Classification
- Multi-Domain EEG Representation Learning with Orthogonal Mapping and Attention-based Fusion for Cognitive Load Classification
- Stochastic Predictive Analytics for Stocks in the Newsvendor Problem
- From Black Box to Bijection: Interpreting Machine Learning to Build a Zeta Map Algorithm
- GRAPHTEXTACK: A Realistic Black-Box Node Injection Attack on LLM-Enhanced GNNs
- Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning
- MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding
- A Multicollinearity-Aware Signal-Processing Framework for Cross-$\beta$ Identification via X-ray Scattering of Alzheimer's Tissue
- Discovering autonomous quantum error correction via deep reinforcement learning
- Iris: First-Class Multi-GPU Programming Experience in Triton
- DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection
- FERMI-ML: A Flexible and Resource-Efficient Memory-In-Situ SRAM Macro for TinyML acceleration
- DLMMPR:Deep Learning-based Measurement Matrix for Phase Retrieval
- Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
- OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding
- LLM4SCREENLIT: Recommendations on Assessing the Performance of Large Language Models for Screening Literature in Systematic Reviews
- Auto-encoder model for faster generation of effective one-body gravitational waveform approximations
- Adaptive Dual-Layer Web Application Firewall (ADL-WAF) Leveraging Machine Learning for Enhanced Anomaly and Threat Detection
- Scalable Hierarchical AI-Blockchain Framework for Real-Time Anomaly Detection in Large-Scale Autonomous Vehicle Networks
- AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework
- Accelerated Distributional Temporal Difference Learning with Linear Function Approximation
- Improving Direct Persian-English Speech-to-Speech Translation with Discrete Units and Synthetic Parallel Data
- X-VMamba: Explainable Vision Mamba
- An Evaluation Framework for Network IDS/IPS Datasets: Leveraging MITRE ATT&CK and Industry Relevance Metrics
- TSB-HB: A Hierarchical Bayesian Extension of the TSB Model for Intermittent Demand Forecasting
- Adaptively Coordinating with Novel Partners via Learned Latent Strategies
- Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
- RoCoISLR: A Romanian Corpus for Isolated Sign Language Recognition
- Event-CausNet: Unlocking Causal Knowledge from Text with Large Language Models for Reliable Spatio-Temporal Forecasting
- Function-on-Function Bayesian Optimization
- Neuro-Logic Lifelong Learning
- Practical Causal Evaluation Metrics for Biological Networks
- Enhancing LLM Code Generation Capabilities through Test-Driven Development and Code Interpreter
- Efficient Adversarial Malware Defense via Trust-Based Raw Override and Confidence-Adaptive Bit-Depth Reduction
- DIGing--SGLD: Decentralized and Scalable Langevin Sampling over Time--Varying Networks
- Benign Overfitting in Linear Classifiers with a Bias Term
- Scalable learning of macroscopic stochastic dynamics
- Mapping fNIRS Signals to Agent Performance: Toward Reinforcement Learning from Neural Feedback
- Structured Imitation Learning of Interactive Policies through Inverse Games
- Bootstrapping LLMs via Preference-Based Policy Optimization
- Classification of Hope in Textual Data using Transformer-Based Models
- Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based Recommendation
- MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning
- Revealing the dynamic responses of Pb under shock loading based on DFT-accuracy machine learning potential
- GEM: Generative Entropy-Guided Preference Modeling for Few-shot Alignment of LLMs
- MeanFlow Transformers with Representation Autoencoders
- Reconstruction of Manifold Distances from Noisy Observations
- Orientation-Free Neural Network-Based Bias Estimation for Low-Cost Stationary Accelerometers
- Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations
- STEP: Success-Rate-Aware Trajectory-Efficient Policy Optimization
- NuBench: An Open Benchmark for Deep Learning-Based Event Reconstruction in Neutrino Telescopes
- Region-Point Joint Representation for Effective Trajectory Similarity Learning
- InteractiveGNNExplainer: A Visual Analytics Framework for Multi-Faceted Understanding and Probing of Graph Neural Network Predictions
- Learning to Solve Resource-Constrained Project Scheduling Problems with Duration Uncertainty using Graph Neural Networks
- Likelihood-guided Regularization in Attention Based Models
- Case study of a differentiable heterogeneous multiphysics solver for a nuclear fusion application
- Causal Inference, Biomarker Discovery, Graph Neural Network, Feature Selection
- EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation
- AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research
- Moving Pictures of Thought: Extracting Visual Knowledge in Charles S. Peirce's Manuscripts with Vision-Language Models
- Uncovering Causal Drivers of Energy Efficiency for Industrial Process in Foundry via Time-Series Causal Inference
- Taming Barren Plateaus in Arbitrary Parameterized Quantum Circuits Without Sacrificing Expressibility
- Exploring Multi-Table Retrieval Through Iterative Search
- Semantic Document Derendering: SVG Reconstruction via Vision-Language Modeling
- Systematic evaluation of time-frequency features for binaural sound source localization
- The Shape of Data: Topology Meets Analytics. A Practical Introduction to Topological Analytics and the Stability Index (TSI) in Business
- AI Fairness Beyond Complete Demographics: Current Achievements and Future Directions
- BootOOD: Self-Supervised Out-of-Distribution Detection via Synthetic Sample Exposure under Neural Collapse
- Power Homotopy for Zeroth-Order Non-Convex Optimizations
- A Gentle Introduction to Conformal Time Series Forecasting
- AtlasMorph: Learning conditional deformable templates for brain MRI
- Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
- OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
- Why is "Chicago" Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues
- Cost-Driven Synthesis of Sound Abstract Interpreters
- T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
- QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
- Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting
- Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
- From Power to Precision: Learning Fine-grained Dexterity for Multi-fingered Robotic Hands
- UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity
- Scaling Spatial Intelligence with Multimodal Foundation Models
- Loss Patterns of Neural Networks
- Achieving Fairness with a Simple Ridge Penalty
- State-Space Constraints Can Improve the Generalisation of the Differentiable Neural Computer to Input Sequences With Unseen Length
- Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
- CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
- GLANCE: Global Actions in a Nutshell for Counterfactual Explainability
- Uncertainty Quantification for Deep Learning
- Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems
- Deep deterministic policy gradient with symmetric data augmentation for lateral attitude tracking control of a fixed-wing aircraft
- Temporal Test-Time Adaptation with State-Space Models
- Efficiently Computing Compact Formal Explanations
- Exploiting Missing Data Remediation Strategies using Adversarial Missingness Attacks
- Communication-Efficient Federated Low-Rank Update Algorithm and its Connection to Implicit Regularization
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
- Explanation-Preserving Augmentation for Semi-Supervised Graph Representation Learning
- Finding Kissing Numbers with Game-theoretic Reinforcement Learning
- Fast and Robust Simulation-Based Inference With Optimization Monte Carlo
- PAST: A Primary-Auxiliary Spatio-Temporal Network for Traffic Time Series Imputation
- MMWSTM-ADRAN+: A Novel Hybrid Deep Learning Architecture for Enhanced Climate Time Series Forecasting and Extreme Event Prediction
- Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression
- Discovering Operational Patterns Using Image-Based Convolutional Clustering and Composite Evaluation: A Case Study in Foundry Melting Processes
- Hardware optimization on Android for inference of AI models
- Artificial Intelligence-Enabled Spirometry for Early Detection of Right Heart Failure
- Multi-task GINN-LP for Multi-target Symbolic Regression
- AdamX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate
- GREAT: Generalizable Representation Enhancement via Auxiliary Transformations for Zero-Shot Environmental Prediction
- Quantum Machine Learning via Contrastive Training
- Naga: Vedic Encoding for Deep State Space Models
- A Quantum Tensor Network-Based Viewpoint for Modeling and Analysis of Time Series Data
- Mitigating Spurious Correlations in Patch-wise Tumor Classification on High-Resolution Multimodal Images
- Fairness-Aware Graph Representation Learning with Limited Demographic Information
- Graph Out-of-Distribution Detection via Test-Time Calibration with Dual Dynamic Dictionaries
- RAC-DMVC: Reliability-Aware Contrastive Deep Multi-View Clustering under Multi-Source Noise
- P1: Mastering Physics Olympiads with Reinforcement Learning
- Batch Acquisition Function Evaluations and Decouple Optimizer Updates for Faster Bayesian Optimization
- Towards Multimodal Representation Learning in Paediatric Kidney Disease
- Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
- FuseSampleAgg: Fused Neighbor Sampling and Aggregation for Mini-batch GNNs
- Weight-sparse transformers have interpretable circuits
- Tuning for Two Adversaries: Enhancing the Robustness Against Transfer and Query-Based Attacks using Hyperparameter Tuning
- Scientific Data Compression and Super-Resolution Sampling
- Cross-Learning from Scarce Data via Multi-Task Constrained Optimization
- Protein Secondary Structure Prediction Using 3D Graphs and Relation-Aware Message Passing Transformers
- Efficient Calibration for Decision Making
- Learning stochasticity: a nonparametric framework for intrinsic noise estimation
- ST-ProC: A Graph-Prototypical Framework for Robust Semi-Supervised Travel Mode Identification
- Rare Genomic Subtype Discovery from RNA-seq via Autoencoder Embeddings and Stability-Aware Clustering
- From Black Box to Insight: Explainable AI for Extreme Event Preparedness
- Limitations of Quantum Advantage in Unsupervised Machine Learning
- LLM Architecture, Scaling Laws, and Economics: A Quick Summary
- Social and Physical Attributes-Defined Trust Evaluation for Effective Collaborator Selection in Human-Device Coexistence Systems
- Mind the Gap: Revealing Inconsistencies Across Heterogeneous AI Accelerators
- Quantifying Skill and Chance: A Unified Framework for the Geometry of Games
- Physics-Informed Neural Network-based Reliability Analysis of Buried Pipelines
- Lightweight Hopfield Neural Networks for Bioacoustic Detection and Call Monitoring of Captive Primates
- Hierarchical Federated Graph Attention Networks for Scalable and Resilient UAV Collision Avoidance
- Characterizing and Understanding Energy Footprint and Efficiency of Small Language Model on Edges
- Omics-scale polymer computational database transferable to real-world artificial intelligence applications
- Tactile Data Recording System for Clothing with Motion-Controlled Robotic Sliding
- Exploring Parallelism in FPGA-Based Accelerators for Machine Learning Applications
- The Environmental Impact of Ensemble Techniques in Recommender Systems
- GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning
- A Structure-Agnostic Co-Tuning Framework for LLMs and SLMs in Cloud-Edge Systems
- Generalized Inequality-based Approach for Probabilistic WCET Estimation
- Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
- Harli: Harvest Underutilized Resources in LLM Serving with Finetuning Tasks
- Noise-Aware Optimization in Nominally Identical Manufacturing and Measuring Systems for High-Throughput Parallel Workflows
- Socrates-Mol: Self-Oriented Cognitive Reasoning through Autonomous Trial-and-Error with Empirical-Bayesian Screening for Molecules
- Learning to Refine: An Agentic RL Approach for Iterative SPARQL Query Construction
- On the Measure of a Model: From Intelligence to Generality
- Towards Mitigating Systematics in Large-Scale Surveys via Few-Shot Optimal Transport-Based Feature Alignment
- FreDN: Spectral Disentanglement for Time Series Forecasting via Learnable Frequency Decomposition
- A Computational Method for Solving the Stochastic Joint Replenishment Problem in High Dimensions
- TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models
- MP-GFormer: A 3D-Geometry-Aware Dynamic Graph Transformer Approach for Machining Process Planning
- Modeling X-ray photon pile-up with a normalizing flow
- ClinStructor: AI-Powered Structuring of Unstructured Clinical Texts
- Forgetting-MarI: LLM Unlearning via Marginal Information Regularization
- Additive Large Language Models for Semi-Structured Text
- PCA recovery thresholds in low-rank matrix inference with sparse noise
- Enhancing XR Auditory Realism via Multimodal Scene-Aware Acoustic Rendering
- InData: Towards Secure Multi-Step, Tool-Based Data Analysis
- A Deep Learning Framework for Thyroid Nodule Segmentation and Malignancy Classification from Ultrasound Images
- Improving Neutrino Oscillation Measurements through Event Classification
- Augmenting The Weather: A Hybrid Counterfactual-SMOTE Algorithm for Improving Crop Growth Prediction When Climate Changes
- Improving LLM's Attachment to External Knowledge In Dialogue Generation Tasks Through Entity Anonymization
- Temporal Micro-Doppler Spectrogram-based ViT Multiclass Target Classification
- On the Entropy Calibration of Language Models
- Bayesian--AI Fusion for Epidemiological Decision Making: Calibrated Risk, Honest Uncertainty, and Hyperparameter Intelligence
- Goal-Oriented Multi-Agent Reinforcement Learning for Decentralized Agent Teams
- Dynamic Parameter Optimization for Highly Transferable Transformation-Based Attacks
- Uncertainty-Guided Selective Adaptation Enables Cross-Platform Predictive Fluorescence Microscopy
- Adaptive Diagnostic Reasoning Framework for Pathology with Multimodal Large Language Models
- Enhancing Road Safety Through Multi-Camera Image Segmentation with Post-Encroachment Time Analysis
- Calibrated Multimodal Representation Learning with Missing Modalities
- Preference Learning from Physics-Based Feedback: Tuning Language Models to Design BCC/B2 Superalloys
- BackWeak: Backdooring Knowledge Distillation Simply with Weak Triggers and Fine-tuning
- Aggregating Conformal Prediction Sets via {\alpha}-Allocation
- Informed Bootstrap Augmentation Improves EEG Decoding
- From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction
- Explainable Transformer-Based Email Phishing Classification with Adversarial Robustness
- Decoupled Action Head: Confining Task Knowledge to Conditioning Layers
- TEMPO: Global Temporal Building Density and Height Estimation from Satellite Imagery
- Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash Function
- Rapid Machine Learning-Driven Detection of Pesticides and Dyes Using Raman Spectroscopy
- MixAR: Mixture Autoregressive Image Generation
- Chemistry-Enhanced Diffusion-Based Framework for Small-to-Large Molecular Conformation Generation
- Suppressing VLM Hallucinations with Spectral Representation Filtering
- Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
- Reinforcement Learning for Chemical Ordering in Alloy Nanoparticles
- PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning
- D$^{3}$ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs
- Cmprsr: Abstractive Token-Level Question-Agnostic Prompt Compressor
- Learning Time in Static Classifiers
- Linear time small coresets for k-mean clustering of segments with applications
- Enhancing Machine Learning Model Efficiency through Quantization and Bit Depth Optimization: A Performance Analysis on Healthcare Data
- LMM-IR: Large-Scale Netlist-Aware Multimodal Framework for Static IR-Drop Prediction
- Symmetry-Aware Graph Metanetwork Autoencoders: Model Merging through Parameter Canonicalization
- PID-controlled Langevin Dynamics for Faster Sampling of Generative Models
- FedTopo: Topology-Informed Representation Alignment in Federated Learning under Non-I.I.D. Conditions
- NFQ2.0: The CartPole Benchmark Revisited
- Sample Complexity of Agnostic Multiclass Classification: Natarajan Dimension Strikes Back
- FLClear: Visually Verifiable Multi-Client Watermarking for Federated Learning
- Attention-Enhanced Convolutional Autoencoder and Structured Delay Embeddings for Weather Prediction
- A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning
- Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
- Adaptive Graph Rewiring to Mitigate Over-Squashing in Mesh-Based GNNs for Fluid Dynamics Simulations
- Oxytrees: Model Trees for Bipartite Learning
- On Robustness of Linear Classifiers to Targeted Data Poisoning
- LAYA: Layer-wise Attention Aggregation for Interpretable Depth-Aware Neural Networks
- Convolutional Model Trees
- Stabilizing Self-Consuming Diffusion Models with Latent Space Filtering
- DIVIDE: A Framework for Learning from Independent Multi-Mechanism Data Using Deep Encoders and Gaussian Processes
- Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving
- Conformal Online Learning of Deep Koopman Linear Embeddings
- INC: An Indirect Neural Corrector for Auto-Regressive Hybrid PDE Solvers
- MolEdit: Knowledge Editing for Multimodal Molecule Language Models
- Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation
- Physics-Constrained Adaptive Neural Networks Enable Real-Time Semiconductor Manufacturing Optimization with Minimal Training Data
- Optimal Look-back Horizon for Time Series Forecasting in Federated Learning
- Genomic Next-Token Predictors are In-Context Learners
- The Alignment Game: A Theory of Long-Horizon Alignment Through Recursive Curation
- Expressive Temporal Specifications for Reward Monitoring
- Assessing Automated Fact-Checking for Medical LLM Responses with Knowledge Graphs
- Catastrophic Forgetting in Kolmogorov-Arnold Networks
- An Evaluation of Representation Learning Methods in Particle Physics Foundation Models
- Connectivity-Guided Sparsification of 2-FWL GNNs: Preserving Full Expressivity with Improved Efficiency
- RoS-Guard: Robust and Scalable Online Change Detection with Delay-Optimal Guarantees
- From Black-Box to White-Box: Control-Theoretic Neural Network Interpretability
- An approach of deep reinforcement learning for maximizing the net present value of stochastic projects
- On the Fundamental Limits of LLMs at Scale
- On the Information Processing of One-Dimensional Wasserstein Distances with Finite Samples
- Method of Manufactured Learning for Solver-free Training of Neural Operators
- Functional Mean Flow in Hilbert Space
- Contrastive Entropy Bounds for Density and Conditional Density Decomposition
- LinkedIn Profile Characteristics and Professional Success Indicators
- AIF: Asynchronous Inference Framework for Cost-Effective Pre-Ranking
- APT: Affine Prototype-Timestamp For Time Series Forecasting Under Distribution Shift
- A FEDformer-Based Hybrid Framework for Anomaly Detection and Risk Forecasting in Financial Time Series
- Global Cross-Time Attention Fusion for Enhanced Solar Flare Prediction from Multivariate Time Series
- RAGPulse: An Open-Source RAG Workload Trace to Optimize RAG Serving Systems
- Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks
- Learning Branching Policies for MILPs with Proximal Policy Optimization
- Are Graph Transformers Necessary? Efficient Long-Range Message Passing with Fractal Nodes in MPNNs
- The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training
- The Final-Stage Bottleneck: A Systematic Dissection of the R-Learner for Network Causal Inference
- Learning Time-Scale Invariant Population-Level Neural Representations
- SLMQuant:Benchmarking Small Language Model Quantization for Practical Deployment
- One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow
- Bi-View Embedding Fusion: A Hybrid Learning Approach for Knowledge Graph's Nodes Classification Addressing Problems with Limited Data
- Generalization Bounds for Semi-supervised Matrix Completion with Distributional Side Information
- Learning from the Undesirable: Robust Adaptation of Language Models without Forgetting
- Self-Organization of Attractor Landscapes in High-Capacity Kernel Logistic Regression Hopfield Networks
- Latency and Ordering Effects in Online Decisions
- MACKO: Sparse Matrix-Vector Multiplication for Low Sparsity
- Self-Adaptive Graph Mixture of Models
- A Smart-Glasses for Emergency Medical Services via Multimodal Multitask Learning
- Real-time prediction of breast cancer sites using deformation-aware graph neural network
- Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions
- Synthetic Forgetting without Access: A Few-shot Zero-glance Framework for Machine Unlearning
- Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schr\"odinger Bridges
- Soft Conflict-Resolution Decision Transformer for Offline Multi-Task Reinforcement Learning
- Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching
- OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs
- Warm-starting active-set solvers using graph neural networks
- Real-time distortion prediction in metallic additive manufacturing via a physics-informed neural operator approach
- Uncertainty-aware Physics-informed Neural Networks for Robust CARS-to-Raman Signal Reconstruction
- DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play
- ParaDySe: A Parallel-Strategy Switching Framework for Dynamic Sequence Lengths in Transformer
- TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs
- Laplace Learning in Wasserstein Space
- MorphBoost: Self-Organizing Universal Gradient Boosting with Adaptive Tree Morphing
- Counterfactual Explainable AI (XAI) Method for Deep Learning-Based Multivariate Time Series Classification
- Computational Measurement of Political Positions: A Review of Text-Based Ideal Point Estimation Algorithms
- Incoherent Beliefs & Inconsistent Actions in Large Language Models
- Uncovering and Mitigating Transient Blindness in Multimodal Model Editing
- Seek and You Shall Fold
- Edge-aware baselines for ogbn-proteins in PyTorch Geometric: species-wise normalization, post-hoc calibration, and cost-accuracy trade-offs
- KForge: Program Synthesis for Diverse AI Hardware Accelerators
- Explainable RL Policies by Distilling to Locally-Specialized Linear Policies with Voronoi State Partitioning
- Tab-PET: Graph-Based Positional Encodings for Tabular Transformers
- Statistically Accurate and Robust Generative Prediction of Rock Discontinuities with A Tabular Foundation Model
- Dual-LoRA and Quality-Enhanced Pseudo Replay for Multimodal Continual Food Learning
- A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs
- Federated Learning for Pediatric Pneumonia Detection: Enabling Collaborative Diagnosis Without Sharing Patient Data
- Multiscale Grassmann Manifolds for Single-Cell Data Analysis
- Fast 3D Surrogate Modeling for Data Center Thermal Management
- Optimizing Input of Denoising Score Matching is Biased Towards Higher Score Norm
- Physics-Informed Neural ODEs with Scale-Aware Residuals for Learning Stiff Biophysical Dynamics
- KAN/H: Kolmogorov-Arnold Network using Haar-like bases
- DK-Root: A Joint Data-and-Knowledge-Driven Framework for Root Cause Analysis of QoE Degradations in Mobile Networks
- Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts
- Diffusion Models: A Mathematical Introduction
- IDOL: Meeting Diverse Distribution Shifts with Prior Physics for Tropical Cyclone Multi-Task Estimation
- Improving a Hybrid Graphsage Deep Network for Automatic Multi-objective Logistics Management in Supply Chain
- Sumudu Neural Operator for ODEs and PDEs
- Learning Fair Representations with Kolmogorov-Arnold Networks
- CATCHFed: Efficient Unlabeled Data Utilization for Semi-Supervised Federated Learning in Limited Labels Environments
- Coordinate Descent for Network Linearization
- Simplicial covering dimension of extremal concept classes
- Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
- Volatility in Certainty (VC): A Metric for Detecting Adversarial Perturbations During Inference in Neural Network Classifiers
- On the Trade-Off Between Transparency and Security in Adversarial Machine Learning
- Leveraging Exogenous Signals for Hydrology Time Series Forecasting
- Transformers vs. Recurrent Models for Estimating Forest Gross Primary Production
- Better LLM Reasoning via Dual-Play
- FLEX: Feature Importance from Layered Counterfactual Explanations
- Chain-of-Generation: Progressive Latent Diffusion for Text-Guided Molecular Design
- Robust Bidirectional Associative Memory via Regularization Inspired by the Subspace Rotation Algorithm
- A Systematic Study of Model Extraction Attacks on Graph Foundation Models
- Batch Matrix-form Equations and Implementation of Multilayer Perceptrons
- Beyond the Laplacian: Interpolated Spectral Augmentation for Graph Neural Networks
- A Systematic Analysis of Out-of-Distribution Detection Under Representation and Training Paradigm Shifts
- SurvBench: A Standardised Preprocessing Pipeline for Multi-Modal Electronic Health Record Survival Analysis
- Learning the relative composition of EEG signals using pairwise relative shift pretraining
- Computation-aware Energy-harvesting Federated Learning: Cyclic Scheduling with Selective Participation
- Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
- ReCast: Reliability-aware Codebook Assisted Lightweight Time Series Forecasting
- Selecting Fine-Tuning Examples by Quizzing VLMs
- EARL: Entropy-Aware RL Alignment of LLMs for Reliable RTL Code Generation
- Mesh-based Super-resolution of Detonation Flows with Multiscale Graph Transformers
- Improving Graph Embeddings in Machine Learning Using Knowledge Completion with Validation in a Case Study on COVID-19 Spread
- Treatment Stitching with Schr\"odinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies
- SenseRay-3D: Generalizable and Physics-Informed Framework for End-to-End Indoor Propagation Modeling
- To Align or Not to Align: Strategic Multimodal Representation Alignment for Optimal Performance
- Dynamic Anomaly Identification in Accounting Transactions via Multi-Head Self-Attention Networks
- HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
- FairGSE: Fairness-Aware Graph Neural Network without High False Positive Rates
- Fusion-ResNet: A Lightweight multi-label NILM Model Using PCA-ICA Feature Fusion
- Variation-Bounded Loss for Noise-Tolerant Learning
- Finding Time Series Anomalies using Granular-ball Vector Data Description
- Open Banking Foundational Model: Learning Language Representations from Few Financial Transactions
- Rethinking Deep Alignment Through The Lens Of Incomplete Learning
- Data-Efficient Self-Supervised Algorithms for Fine-Grained Birdsong Analysis
- FGM optimization in complex domains using Gaussian process regression based profile generation algorithm
- TSGDiff: Rethinking Synthetic Time Series Generation from a Pure Graph Perspective
- Understanding InfoNCE: Transition Probability Matrix Induced Feature Clustering
- Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?
- Evaluation of Multi- and Single-objective Learning Algorithms for Imbalanced Data
- MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient Regularization
- AlignTree: Efficient Defense Against LLM Jailbreak Attacks
- Chicken Swarm Kernel Particle Filter: A Structured Rejuvenation Approach with KLD-Efficient Sampling
- SCI: An Equilibrium for Signal Intelligence
- Cross-view Joint Learning for Mixed-Missing Multi-view Unsupervised Feature Selection
- Calibrated Adversarial Sampling: Multi-Armed Bandit-Guided Generalization Against Unforeseen Attacks
- MMSense: Adapting Vision-based Foundation Model for Multi-task Multi-modal Wireless Sensing
- Optimal Self-Consistency for Efficient Reasoning with Large Language Models
- Active Learning of Symbolic Automata Over Rational Numbers
- BlinDNO: A Distributional Neural Operator for Dynamical System Reconstruction from Time-Label-Free data
- LILogic Net: Compact Logic Gate Networks with Learnable Connectivity for Efficient Hardware Deployment
- Dynamic Reward Scaling for Multivariate Time Series Anomaly Detection: A VAE-Enhanced Reinforcement Learning Approach
- BitSnap: Checkpoint Sparsification and Quantization in LLM Training
- CEDL: Centre-Enhanced Discriminative Learning for Anomaly Detection
- On the Dimension-Free Approximation of Deep Neural Networks for Symmetric Korobov Functions
- Interpretable Fine-Gray Deep Survival Model for Competing Risks: Predicting Post-Discharge Foot Complications for Diabetic Patients in Ontario
- The 'Sure' Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models
- Integrating Neural Differential Forecasting with Safe Reinforcement Learning for Blood Glucose Regulation
- Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
- VISAGNN: Versatile Staleness-Aware Efficient Training on Large-Scale Graphs
- Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link Prediction
- Personality-guided Public-Private Domain Disentangled Hypergraph-Former Network for Multimodal Depression Detection
- Redundancy-optimized Multi-head Attention Networks for Multi-View Multi-Label Feature Selection
- Logarithmic Regret and Polynomial Scaling in Online Multi-step-ahead Prediction
- Diffusion Model Based Signal Recovery Under 1-Bit Quantization
- SculptDrug : A Spatial Condition-Aware Bayesian Flow Model for Structure-based Drug Design
- Uncover and Unlearn Nuisances: Agnostic Fully Test-Time Adaptation
- Towards Better IncomLDL: We Are Unaware of Hidden Labels in Advance
- BSO: Binary Spiking Online Optimization Algorithm
- Hierarchical Frequency-Decomposition Graph Neural Networks for Road Network Representation Learning
- Spectral Bias Mitigation via xLSTM-PINN: Memory-Gated Representation Refinement for Physics-Informed Learning
- Regret Guarantees for Linear Contextual Stochastic Shortest Path
- Center-Outward q-Dominance: A Sample-Computable Proxy for Strong Stochastic Dominance in Multi-Objective Optimisation
- CAO: Curvature-Adaptive Optimization via Periodic Low-Rank Hessian Sketching
- Training Instabilities Induce Flatness Bias in Gradient Descent
- Softmax as a Lagrangian-Legendrian Seam
- LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora
- Detecting Statistically Significant Fairness Violations in Recidivism Forecasting Algorithms
- DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs
- Decoupling Positional and Symbolic Attention Behavior in Transformers
- The Anatomy of a Triton Attention Kernel
- Parallel and Multi-Stage Knowledge Graph Retrieval for Behaviorally Aligned Financial Asset Recommendations
- Output Supervision Can Obfuscate the Chain of Thought
- Parameter-Efficient and Personalized Federated Training of Generative Models at the Edge
- WildfireGenome: Interpretable Machine Learning Reveals Local Drivers of Wildfire Risk and Their Cross-County Variation
- Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
- Sound Logical Explanations for Mean Aggregation Graph Neural Networks
- Loss Given Default Prediction Under Measurement-Induced Mixture Distributions: An Information-Theoretic Approach
- Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part A: Stochastic Stability in Non-zero-Sum Games
- Enhancing failure prediction in nuclear industry: Hybridization of knowledge- and data-driven techniques
- Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement Learning
- Small Vocabularies, Big Gains: Pretraining and Tokenization in Time Series Models
- Early GVHD Prediction in Liver Transplantation via Multi-Modal Deep Learning on Imbalanced EHR Data
- MedFedPure: A Medical Federated Framework with MAE-based Detection and Diffusion Purification for Inference-Time Attacks
- SA-EMO: Structure-Aligned Encoder Mixture of Operators for Generalizable Full-waveform Inversion
- Global Feature Enhancing and Fusion Framework for Strain Gauge Time Series Classification
- Predicting Grain Growth in Polycrystalline Materials Using Deep Learning Time Series Models
- Toward Better Generalization in Few-Shot Learning through the Meta-Component Combination
- An Explainable and Fair AI Tool for PCOS Risk Assessment: Calibration, Subgroup Equity, and Interactive Clinical Deployment
- Enhancing PINN Accuracy for the RLW Equation: Adaptive and Conservative Approaches
- EcoSpa: Efficient Transformer Training with Coupled Sparsity
- A Deep Learning Model to Predicting Changes in Consumer Attributes for New Line-extended Products
- Environment-Aware Transfer Reinforcement Learning for Sustainable Beam Selection
- Lightweight Time Series Data Valuation on Time Series Foundation Models via In-Context Finetuning
- Enhanced Water Leak Detection with Convolutional Neural Networks and One-Class Support Vector Machine
- Incomplete Depression Feature Selection with Missing EEG Channels
- How many stations are sufficient? Exploring the effect of urban weather station density reduction on imputation accuracy of air temperature and humidity
- Convergence of Multiagent Learning Systems for Traffic control
- On the Probabilistic Learnability of Compact Neural Network Preimage Bounds
- SpecQuant: Spectral Decomposition and Adaptive Truncation for Ultra-Low-Bit LLMs Quantization
- Clifford Algebraic Rotor Embeddings : Maybe embeddings should start to CARE
- Adaptive Stepsizing for Stochastic Gradient Langevin Dynamics in Bayesian Neural Networks
- Beyond Superficial Forgetting: Thorough Unlearning through Knowledge Density Estimation and Block Re-insertion
- Do traveling waves make good positional encodings?
- H-Model: Dynamic Neural Architectures for Adaptive Processing
- Evaluation of LLM-based Explanations for a Learning Analytics Dashboard
- Synergistic Feature Fusion for Latent Lyrical Classification: A Gated Deep Learning Architecture
- Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff
- Learning with Preserving for Continual Multitask Learning
- Homotopy-Guided Self-Supervised Learning of Parametric Solutions for AC Optimal Power Flow
- A neural optimization framework for free-boundary diffeomorphic mapping problems and its applications
- Probabilistic Wildfire Susceptibility from Remote Sensing Using Random Forests and SHAP
- MPCM-Net: Multi-scale network integrates partial attention convolution with Mamba for ground-based cloud image segmentation
- Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
- A Bayesian Model for Multi-stage Censoring
- R-Tuning: Wavelet-Decomposed Replay and Semantic Alignment for Continual Adaptation of Pretrained Time-Series Models
- Regularized Schr\"odinger: Alleviating Distortion and Exposure Bias in Solving Inverse Problems
- Hierarchical Schedule Optimization for Fast and Robust Diffusion Model Sampling
- Doubly Debiased Test-Time Prompt Tuning for Vision-Language Models
- Beyond saliency: enhancing explanation of speech emotion recognition with expert-referenced acoustic cues
- AnchorDS: Anchoring Dynamic Sources for Semantically Consistent Text-to-3D Generation
- Toward Dignity-Aware AI: Next-Generation Elderly Monitoring from Fall Detection to ADL
- Benchmarking GNNs for OOD Materials Property Prediction with Uncertainty Quantification
- Moirai 2.0: When Less Is More for Time Series Forecasting
- Tighter Truncated Rectangular Prism Approximation for RNN Robustness Verification
- Bayesian Neural Networks with Monte Carlo Dropout for Probabilistic Electricity Price Forecasting
- Enhancing Reinforcement Learning in 3D Environments through Semantic Segmentation: A Case Study in ViZDoom
- Simple Vision-Language Math Reasoning via Rendered Text
- Multimodal ML: Quantifying the Improvement of Calorie Estimation Through Image-Text Pairs
- Context-Aware Multimodal Representation Learning for Spatio-Temporally Explicit Environmental modelling
- FSC-Net: Fast-Slow Consolidation Networks for Continual Learning
- Which Sparse Autoencoder Features Are Real? Model-X Knockoffs for False Discovery Rate Control
- Reasoning: From Reflection to Solution
Research Sources: 877 | Generated: 11/18/2025
