AI Research News Feeds for August 28th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

MTS-Net: Dual-Enhanced Positional Multi-Head Self-Attention for 3D CT Diagnosis of May-Thurner Syndrome
Analysis and Synthesis Denoisers for Forward-Backward Plug-and-Play Algorithms
DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems
TAGS: 3D Tumor-Adaptive Guidance for SAM
Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction
Deep Learning in Mild Cognitive Impairment Diagnosis using Eye Movements and Image Content in Visual Memory Tasks
Reduced-Order Modeling of Cyclo-Stationary Time Series Using Score-Based Generative Methods
Weighted Levenberg-Marquardt methods for fitting multichannel nuclear cross section data
Eigenvalue distribution of the Neural Tangent Kernel in the quadratic scaling
Neural Conditional Simulation for Complex Spatial Processes
Scalable Bayesian Structure Learning for Gaussian Graphical Models Using Marginal Pseudo-likelihood
The Bayesian Context Trees State Space Model for time series modelling and forecasting
AutoQ-VIS: Improving Unsupervised Video Instance Segmentation via Automatic Quality Assessment
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Ego-centric Predictive Model Conditioned on Hand Trajectories
Self-supervised structured object representation learning
PersonaAnimator: Personalized Motion Transfer from Unconstrained Videos
Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities
Streamlining the Development of Active Learning Methods in Real-World Object Detection
Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation
Reimagining Image Segmentation using Active Contour: From Chan Vese Algorithm into a Proposal Novel Functional Loss Framework
Assessing the Geolocation Capabilities, Limitations and Societal Risks of Generative Vision-Language Models
GS: Generative Segmentation via Label Diffusion
Segmentation Assisted Incremental Test Time Adaptation in an Open World
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
PAUL: Uncertainty-Guided Partition and Augmentation for Robust Cross-View Geo-Localization under Noisy Correspondence
Seam360GS: Seamless 360{\deg} Gaussian Splatting from Real-World Omnidirectional Images
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Bridging Domain Gaps for Fine-Grained Moth Classification Through Expert-Informed Adaptation and Foundation Model Priors
Saccade crossing avoidance as a visual search strategy
Modeling spectral filtering effects on color-matching functions: Implications for observer variability
A Technical Review on Comparison and Estimation of Steganographic Tools
Mind the Third Eye! Benchmarking Privacy Awareness in MLLM-powered Smartphone Agents
DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View
Fast Texture Transfer for XR Avatars via Barycentric UV Conversion
Addressing Deepfake Issue in Selfie banking through camera based authentication
Context-Aware Risk Estimation in Home Environments: A Probabilistic Framework for Service Robots
Variational Bayes image restoration with compressive autoencoders
Latent space configuration for improved generalization in supervised autoencoder neural networks
REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment
TraceNet: Segment one thing efficiently
Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis
DiffArtist: Towards Structure and Appearance Controllable Image Stylization
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
LV-CadeNet: A Long-View Feature Convolution-Attention Fusion Encoder-Decoder Network for EEG/MEG Spike Analysis
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Solving Inverse Problems using Diffusion with Iterative Colored Renoising
Active Learning for Deep Learning-Based Hemodynamic Parameter Estimation
End-to-End Action Segmentation Transformer
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fr\'{e}chet Distance
OPAL: Visibility-aware LiDAR-to-OpenStreetMap Place Recognition via Adaptive Radial Fusion
Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition
Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation
LDRFusion: A LiDAR-Dominant multimodal refinement framework for 3D object detection
PRISM: A Framework Harnessing Unsupervised Visual Representations and Textual Prompts for Explainable MACE Survival Prediction from Cardiac Cine MRI
EffNetViTLoRA: An Efficient Hybrid Deep Learning Approach for Alzheimer's Disease Diagnosis
JVLGS: Joint Vision-Language Gas Leak Segmentation
Weed Detection in Challenging Field Conditions: A Semi-Supervised Framework for Overcoming Shadow Bias and Data Scarcity
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
CVBench: Evaluating Cross-Video Synergies for Complex Multimodal Understanding and Reasoning
MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery
DNP-Guided Contrastive Reconstruction with a Reverse Distillation Transformer for Medical Anomaly Detection
High-Speed FHD Full-Color Video Computer-Generated Holography
Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction
Generalizing Monocular 3D Object Detection
Quantization Robustness to Input Degradations for Object Detection
Controllable Skin Synthesis via Lesion-Focused Vector Autoregression Model
UTAL-GNN: Unsupervised Temporal Action Localization using Graph Neural Networks
IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising
Video-LevelGauge: Investigating Contextual Positional Bias in Large Video Language Models
Scalable Object Detection in the Car Interior With Vision Foundation Models
Self-Rewarding Vision-Language Model via Reasoning Decomposition
Hardware-aware vs. Hardware-agnostic Energy Estimation for SNN in Space Applications
A Frequency-Aware Self-Supervised Learning for Ultra-Wide-Field Image Enhancement
SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction
Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators
LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation
FreeVPS: Repurposing Training-Free SAM2 for Generalizable Video Polyp Segmentation
Improving Generalization in Deepfake Detection with Face Foundation Models and Metric Learning
POEv2: a flexible and robust framework for generic line segment detection and wireframe line segment detection
SPLF-SAM: Self-Prompting Segment Anything Model for Light Field Salient Object Detection
FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers
BuzzSet v1.0: A Dataset for Pollinator Detection in Field Conditions
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
The Return of Structural Handwritten Mathematical Expression Recognition
MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction
StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation
Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models
FusionSort: Enhanced Cluttered Waste Segmentation with Advanced Decoding and Comprehensive Modality Optimization
Context-aware Sparse Spatiotemporal Learning for Event-based Vision
News is More than a Collection of Facts: Moral Frame Preserving News Summarization
ICL CIPHERS: Quantifying "Learning" in In-Context Learning via Substitution Ciphers
Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning
Refining Czech GEC: Insights from a Multi-Experiment Approach
PhoniTale: Phonologically Grounded Mnemonic Generation for Typologically Distant Language Pairs
Doc2Chart: Intent-Driven Zero-Shot Chart Generation from Documents
Reducing Biases towards Minoritized Populations in Medical Curricular Content via Artificial Intelligence for Fairer Health Outcomes
Unifying the Extremes: Developing a Unified Model for Detecting and Predicting Extremist Traits and Radicalization
Know "No" Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP
Do Vision Encoders Truly Explain Object Hallucination?: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore
Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
Selective Retrieval-Augmentation for Long-Tail Legal Text Classification
Forewarned is Forearmed: Pre-Synthesizing Jailbreak-like Instructions to Enhance LLM Safety Guardrail to Potential Attacks
AraHealthQA 2025 Shared Task Description Paper
Capabilities of GPT-5 across critical domains: Is it the next breakthrough?
Beat-Based Rhythm Quantization of MIDI Performances
Geopolitical Parallax: Beyond Walter Lippmann Just After Large Language Models
Functional Consistency of LLM Code Embeddings: A Self-Evolving Data Synthesis Framework for Benchmarking
Word Chain Generators for Prefix Normal Words
KRETA: A Benchmark for Korean Reading and Reasoning in Text-Rich VQA Attuned to Diverse Visual Contexts
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
NPHardEval4V: Dynamic Evaluation of Large Vision-Language Models with Effects of Vision
FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction
Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging
Agent-as-Judge for Factual Summarization of Long Narratives
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective
MDEval: Evaluating and Enhancing Markdown Awareness in Large Language Models
Efficient Response Generation Strategy Selection for Fine-Tuning Large Language Models Through Self-Aligned Perplexity
KoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines
RAGAPHENE: A RAG Annotation Platform with Human Enhancements and Edits
Leveraging Language Models and Machine Learning in Verbal Autopsy Analysis
Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Heterogeneous LLM Methods for Ontology Learning (Few-Shot Prompting, Ensemble Typing, and Attention-Based Taxonomies)
Rule Synergy Analysis using LLMs: State of the Art and Implications
Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding
Alignment with Fill-In-the-Middle for Enhancing Code Generation
Emotion Transfer with Enhanced Prototype for Unseen Emotion Recognition in Conversation
ArgCMV: An Argument Summarization Benchmark for the LLM-era
Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs
A Symbolic Adversarial Learning Framework for Evolving Fake News Generation and Detection
Automatic integration of SystemC in the FMI standard for Software-defined Vehicle design
Building Task Bots with Self-learning for Enhanced Adaptability, Extensibility, and Factuality
Continuously Steering LLMs Sensitivity to Contextual Knowledge with Proxy Models
CAM\~OES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese
Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Uncovering the Bigger Picture: Comprehensive Event Understanding Via Diverse News Retrieval
Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Scalable and consistent few-shot classification of survey responses using text embeddings
TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation
Beyond Shallow Heuristics: Leveraging Human Intuition for Curriculum Learning
Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Your AI Bosses Are Still Prejudiced: The Emergence of Stereotypes in LLM-Based Multi-Agent Systems
HEAL: A Hypothesis-Based Preference-Aware Analysis Framework
Online-Score-Aided Federated Learning: Taming the Resource Constraints in Wireless Networks
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
LLM-based feature generation from text for interpretable machine learning
Machine Learning for Asymptomatic Ratoon Stunting Disease Detection With Freely Available Satellite Based Multispectral Imaging
k-HyperEdge Medoids for Clustering Ensemble
PAC Learnability of Scenario Decision-Making Algorithms: Necessary Conditions and Sufficient Conditions
Training LLMs with MXFP4
Human locomotor control timescales depend on the environmental context and sensory input modality
NAPER: Fault Protection for Real-Time Resource-Constrained Deep Neural Networks
R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning
SubROC: AUC-Based Discovery of Exceptional Subgroup Performance for Binary Classifiers
Towards a Spatiotemporal Fusion Approach to Precipitation Nowcasting
Unfolding AlphaFold's Bayesian Roots in Probability Kinematics
Forecasting Multivariate Urban Data via Decomposition and Spatio-Temporal Graph Analysis
Computation- and Communication-Efficient Online FL for Resource-Constrained Aerial Vehicles
Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful
Deep Learning of Semi-Competing Risk Data via a New Neural Expectation-Maximization Algorithm
Predicting the cardinality and maximum degree of a reduced Gr\"obner basis
To the Noise and Back: Diffusion for Shared Autonomy
From Optimization to Control: Quasi Policy Iteration
Bayes-Optimal Fair Classification with Linear Disparity Constraints via Pre-, In-, and Post-processing
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules
Which Spaces can be Embedded in $L_p$-type Reproducing Kernel Banach Space? A Characterization via Metric Entropy
Robust Detection of Watermarks for Large Language Models Under Human Edits
On Domain-Adaptive Post-Training for Multimodal Large Language Models
GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network
Benchmarking Diffusion Annealing-Based Bayesian Inverse Problem Solvers
TERL: Large-Scale Multi-Target Encirclement Using Transformer-Enhanced Reinforcement Learning
SuperBPE: Space Travel for Language Models
Graphical Transformation Models
Predicting Forced Responses of Probability Distributions via the Fluctuation-Dissipation Theorem and Generative Modeling
Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Multilevel neural simulation-based inference
mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks
Hierarchical Decentralized Stochastic Control for Cyber-Physical Systems
Escaping Stability-Plasticity Dilemma in Online Continual Learning for Motion Forecasting via Synergetic Memory Rehearsal
Delta-Audit: Explaining What Changes When Models Change
Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning
ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation
SCAR: A Characterization Scheme for Multi-Modal Dataset
Exploration of Low-Power Flexible Stress Monitoring Classifiers for Conformal Wearables
$\mathcal{C}^1$-approximation with rational functions and rational neural networks
Metric spaces of walks and Lipschitz duality on graphs
Tune My Adam, Please!
InfraredGP: Efficient Graph Partitioning via Spectral Graph Neural Networks with Negative Corrections
Fast 3D Diffusion for Scalable Granular Media Synthesis
Interestingness First Classifiers
Symplectic convolutional neural networks
Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources
Quantum latent distributions in deep generative models
Parameter-Free Structural-Diversity Message Passing for Graph Neural Networks
NM-Hebb: Coupling Local Hebbian Plasticity with Metric Learning for More Accurate and Interpretable CNNs
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
GegenNet: Spectral Convolutional Neural Networks for Link Sign Prediction in Signed Bipartite Graphs
Ontology-Based Concept Distillation for Radiology Report Retrieval and Labeling
FlowletFormer: Network Behavioral Semantic Aware Pre-training Model for Traffic Classification
Constraint Learning in Multi-Agent Dynamic Games from Demonstrations of Local Nash Interactions
Global Permutation Entropy
Short-Horizon Predictive Maintenance of Industrial Pumps Using Time-Series Features and Machine Learning
Reducing Street Parking Search Time via Smart Assignment Strategies
Evaluating Language Model Reasoning about Confidential Information
Self-Supervised Pre-Training with Equilibrium Constraints
FairLoop: Software Support for Human-Centric Fairness in Predictive Business Process Monitoring
Using item recommendations and LLMs in marketing email titles
Pruning Strategies for Backdoor Defense in LLMs
Reinforcement Learning for Search Tree Size Minimization in Constraint Programming: New Results on Scheduling Benchmarks
Large VLM-based Stylized Sports Captioning
Aggregate Fictitious Play for Learning in Anonymous Polymatrix Games (Extended Version)
GENIE-ASI: Generative Instruction and Executable Code for Analog Subcircuit Identification
Is data-efficient learning feasible with quantum models?
Stack Trace-Based Crash Deduplication with Transformer Adaptation
MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space
Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks
UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models
A Lightweight Crowd Model for Robot Social Navigation
Simple Stepsize for Quasi-Newton Methods with Global Convergence Guarantees
Inferring geometry and material properties from Mueller matrices with machine learning
Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy
Fourier Feature Networks for High-Fidelity Prediction of Perturbed Optical Fields
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
Conditional Normalizing Flow Surrogate for Monte Carlo Prediction of Radiative Properties in Nanoparticle-Embedded Layers
Multimodal Conditional MeshGAN for Personalized Aneurysm Growth Prediction
TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations
Sky Background Building of Multi-objective Fiber spectra Based on Mutual Information Network
On-chip wave chaos for photonic extreme learning
Experimental End-to-End Optimization of Directly Modulated Laser-based IM/DD Transmission
11Plus-Bench: Demystifying Multimodal LLM Spatial Reasoning with Cognitive-Inspired Analysis
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
Anomaly Detection in Networked Bandits
Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching
FraGNNet: A Deep Probabilistic Model for Tandem Mass Spectrum Prediction
MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
DATABench: Evaluating Dataset Auditing in Deep Learning from an Adversarial Perspective
PyVision: Agentic Vision with Dynamic Tooling
Optimistic Exploration for Risk-Averse Constrained Reinforcement Learning
Scaling Decentralized Learning with FLock
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Physics-Informed Regression: Parameter Estimation in Parameter-Linear Nonlinear Dynamic Models
Memorization in Graph Neural Networks
Efficient Multi-Source Knowledge Transfer by Model Merging
Graph Data Modeling: Molecules, Proteins, & Chemical Processes
Towards Quantum Machine Learning for Malicious Code Analysis
DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting
Quantum-Classical Hybrid Molecular Autoencoder for Advancing Classical Decoding
Kolmogorov-Arnold Representation for Symplectic Learning: Advancing Hamiltonian Neural Networks
Differentiable multiphase flow model for physics-informed machine learning in reservoir pressure management
MS-ConTab: Multi-Scale Contrastive Learning of Mutation Signatures for Pan Cancer Representation and Stratification
Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization
On Surjectivity of Neural Networks: Can you elicit any behavior from your model?
The Sample Complexity of Membership Inference and Privacy Auditing
DeepAtlas: a tool for effective manifold learning
Distribution Shift Aware Neural Tabular Learning
MobText-SISA: Efficient Machine Unlearning for Mobility Logs with Spatio-Temporal and Natural-Language Data
Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning
Topological Uncertainty for Anomaly Detection in the Neural-network EoS Inference with Neutron Star Data
Safety Alignment Should Be Made More Than Just A Few Attention Heads
Attention is also needed for form design
NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks
A bag of tricks for real-time Mitotic Figure detection
Bootstrapping Learned Cost Models with Synthetic SQL Queries
ERSR: An Ellipse-constrained pseudo-label refinement and symmetric regularization framework for semi-supervised fetal head segmentation in ultrasound images
From Research to Reality: Feasibility of Gradient Inversion Attacks in Federated Learning
Gradient Rectification for Robust Calibration under Distribution Shift
PSO-Merging: Merging Models Based on Particle Swarm Optimization
SoK: Large Language Model Copyright Auditing via Fingerprinting
Multispectral LiDAR data for extracting tree points in urban and suburban areas
Generative AI for Testing of Autonomous Driving Systems: A Survey
AI-Powered Detection of Inappropriate Language in Medical School Curricula
The Information Dynamics of Generative Diffusion
Logical Reasoning with Outcome Reward Models for Test-Time Scaling
The Next Layer: Augmenting Foundation Models with Structure-Preserving and Attention-Guided Learning for Local Patches to Global Context Awareness in Computational Pathology
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
Dhati+: Fine-tuned Large Language Models for Arabic Subjectivity Evaluation
GLSim: Detecting Object Hallucinations in LVLMs via Global-Local Similarity
Diffusion Language Models Know the Answer Before Decoding
MathBuddy: A Multimodal System for Affective Math Tutoring
Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation
Cross-Platform E-Commerce Product Categorization and Recategorization: A Multimodal Hierarchical Classification Approach
Decomposing Behavioral Phase Transitions in LLMs: Order Parameters for Emergent Misalignment
HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling
Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence
Large Language Models (LLMs) for Electronic Design Automation (EDA)
DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
Patch Progression Masked Autoencoder with Fusion CNN Network for Classifying Evolution Between Two Pairs of 2D OCT Slices
Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
From Evidence to Decision: Exploring Evaluative AI
Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning
AirRAG: Autonomous Strategic Planning and Reasoning Steer Retrieval Augmented Generation
Demonstrating specification gaming in reasoning models
Preference Elicitation for Multi-objective Combinatorial Optimization with Active Learning and Maximum Likelihood Estimation
Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents
Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search
Approximate Lifted Model Construction
General agents contain world models
HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models
TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Training with Explanations Alone: A New Paradigm to Prevent Shortcut Learning
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
Understanding Fairness-Accuracy Trade-offs in Machine Learning Models: Does Promoting Fairness Undermine Performance?
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
PromptKeeper: Safeguarding System Prompts for LLMs
Score-based Generative Diffusion Models for Social Recommendations
Statistical learning does not always entail knowledge
Efficient PINNs via Multi-Head Unimodular Regularization of the Solutions Space
An Empirical Risk Minimization Approach for Offline Inverse RL and Dynamic Discrete Choice Model
Constructing a Norm for Children's Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models
PGAD: Prototype-Guided Adaptive Distillation for Multi-Modal Learning in AD Diagnosis
Evaluating the Fitness of Ontologies for the Task of Question Generation
Pricing AI Model Accuracy
Multi-Type Context-Aware Conversational Recommender Systems via Mixture-of-Experts
Bidirectional Task-Motion Planning Based on Hierarchical Reinforcement Learning for Strategic Confrontation
Heat Diffusion Models -- Interpixel Attention Mechanism
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents
FaceEditTalker: Controllable Talking Head Generation with Facial Attribute Editing
BinConv: A Neural Architecture for Ordinal Encoding in Time-Series Forecasting
Pseudo-Simulation for Autonomous Driving
DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers
CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval
Language Models Identify Ambiguities and Exploit Loopholes
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference
Just Because You Can, Doesn't Mean You Should: LLMs for Data Fitting
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
Energy-Efficient Learning-Based Beamforming for ISAC-Enabled V2X Networks
Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era
Multimodal Prototype Alignment for Semi-supervised Pathology Image Segmentation
Interact-Custom: Customized Human Object Interaction Image Generation
Towards a Holistic and Automated Evaluation Framework for Multi-Level Comprehension of LLMs in Book-Length Contexts
Towards stable AI systems for Evaluating Arabic Pronunciations
Hallucinating with AI: AI Psychosis as Distributed Delusions
Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation
IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation
FinCast: A Foundation Model for Financial Time-Series Forecasting
LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation
A Scenario-Oriented Survey of Federated Recommender Systems: Techniques, Challenges, and Future Directions
Towards Instance-wise Personalized Federated Learning via Semi-Implicit Bayesian Prompt Tuning
Training for Obsolescence? The AI-Driven Education Trap
Divide, Weight, and Route: Difficulty-Aware Optimization with Dynamic Expert Fusion for Long-tailed Recognition
Invited Paper: Feature-to-Classifier Co-Design for Mixed-Signal Smart Flexible Wearables for Healthcare at the Extreme Edge
Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception
Intellectual Property in Graph-Based Machine Learning as a Service: Attacks and Defenses
Arbitrary Precision Printed Ternary Neural Networks with Holistic Evolutionary Approximation
Survey of Specialized Large Language Model
Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation
Stand on The Shoulders of Giants: Building JailExpert from Previous Attack Experience
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
DemoBias: An Empirical Study to Trace Demographic Biases in Vision Foundation Models
CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy
2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks
Epistemic Trade-Off: An Analysis of the Operational Breakdown and Ontological Limits of "Certainty-Scope" in AI
Geo2Vec: Shape- and Distance-Aware Neural Representation of Geospatial Entities
Advancements in Crop Analysis through Deep Learning and Explainable AI
Sistema de Reconocimiento Facial Federado en Conjuntos Abiertos basado en OpenMax
Are Companies Taking AI Risks Seriously? A Systematic Analysis of Companies' AI Risk Disclosures in SEC 10-K forms
Automated classification of natural habitats using ground-level imagery
What Makes AI Applications Acceptable or Unacceptable? A Predictive Moral Framework
(DEMO) Deep Reinforcement Learning Based Resource Allocation in Distributed IoT Systems
MedVQA-TREE: A Multimodal Reasoning and Retrieval Framework for Sarcopenia Prediction
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation
An Investigation on Group Query Hallucination Attacks
AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays
Deep Data Hiding for ICAO-Compliant Face Images: A Survey
Quantum Entanglement as Super-Confounding: From Bell's Theorem to Robust Machine Learning
Re:Frame -- Retrieving Experience From Associative Memory
Reflective Agreement: Combining Self-Mixture of Agents with a Sequence Tagger for Robust Event Extraction
Atrial Fibrillation Prediction Using a Lightweight Temporal Convolutional and Selective State Space Architecture
LongReasonArena: A Long Reasoning Benchmark for Large Language Models
Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs
Inference of Human-derived Specifications of Object Placement via Demonstration
Database Entity Recognition with Data Augmentation and Deep Learning
Fine-Tuning Vision-Language Models for Neutrino Event Analysis in High-Energy Physics Experiments
One Joke to Rule them All? On the (Im)possibility of Generalizing Humor
Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention
A perishable ability? The future of writing in the face of generative artificial intelligence
Data-Augmented Few-Shot Neural Stencil Emulation for System Identification of Computer Models
"She was useful, but a bit too optimistic": Augmenting Design with Interactive Virtual Personas
Bridging Language Gaps: Enhancing Few-Shot Language Adaptation
Addressing Weak Authentication like RFID, NFC in EVs and EVCs using AI-powered Adaptive Authentication
Incentivized Lipschitz Bandits
Inference Gap in Domain Expertise and Machine Intelligence in Named Entity Recognition: Creation of and Insights from a Substance Use-related Dataset
SIExVulTS: Sensitive Information Exposure Vulnerability Detection System using Transformer Models and Static Analysis
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
Concurrent validity of computer-vision artificial intelligence player tracking software using broadcast footage
Improving Low-Resource Translation with Dictionary-Guided Fine-Tuning and RL: A Spanish-to-Wayuunaiki Study
Data-Efficient Symbolic Regression via Foundation Model Distillation
PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense
Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery
Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills
Learning Game-Playing Agents with Generative Code Optimization
A Self-Supervised Mixture-of-Experts Framework for Multi-behavior Recommendation
Orchid: Orchestrating Context Across Creative Workflows with Generative AI
WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization
Sycophancy as compositions of Atomic Psychometric Traits
Aleks: AI powered Multi Agent System for Autonomous Scientific Discovery via Data-Driven Approaches in Plant Science
Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs
Reliable Weak-to-Strong Monitoring of LLM Agents
SLIM: Subtrajectory-Level Elimination for More Effective Reasoning
Caught in the Act: a mechanistic approach to detecting deception
Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities
Skill-based Explanations for Serendipitous Course Recommendation
ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding
Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties
InquireMobile: Teaching VLM-based Mobile Agent to Request Human Assistance via Reinforcement Fine-Tuning
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
Tracking World States with Language Models: State-Based Evaluation Using Chess
CASE: An Agentic AI Framework for Enhancing Scam Intelligence in Digital Payments
Flocking Behavior: An Innovative Inspiration for the Optimization of Production Plants
SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control
Model Science: getting serious about verification, explanation and control of AI systems
Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices
MuSpike: A Benchmark and Evaluation Framework for Symbolic Music Generation with Spiking Neural Networks
Real-Time Intuitive AI Drawing System for Collaboration: Enhancing Human Creativity through Formal and Contextual Intent Integration
TTF-VLA: Temporal Token Fusion via Pixel-Attention Integration for Vision-Language-Action Models
Emotional Manipulation by AI Companions
Lossless Compression of Neural Network Components: Weights, Checkpoints, and K/V Caches in Low-Precision Formats
A Theory of Information, Variation, and Artificial Intelligence
The Aegis Protocol: A Foundational Security Framework for Autonomous AI Agents
MultiPL-MoE: Multi-Programming-Lingual Extension of Large Language Models through Hybrid Mixture-of-Experts
Should LLMs be WEIRD? Exploring WEIRDness and Human Rights in Large Language Models
Whisper based Cross-Lingual Phoneme Recognition between Vietnamese and English
Rethinking Reasoning in LLMs: Neuro-Symbolic Local RetoMaton Beyond ICL and CoT
MixGAN: A Hybrid Semi-Supervised and Generative Approach for DDoS Detection in Cloud-Integrated IoT Networks
POT: Inducing Overthinking in LLMs via Black-Box Iterative Optimization
Towards Production-Worthy Simulation for Autonomous Cyber Operations
FLAIRR-TS -- Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series
CORTEX: Composite Overlay for Risk Tiering and Exposure in Operational AI Systems
CORE: Lossless Compression for Retrieval-Augmented LLMs via Reinforcement Learning
RL-Finetuned LLMs for Privacy-Preserving Synthetic Rewriting
Prompt-in-Content Attacks: Exploiting Uploaded Inputs to Hijack LLM Behavior
Tricking LLM-Based NPCs into Spilling Secrets
Seeing Like a Designer Without One: A Study on Unsupervised Slide Quality Assessment via Designer Cue Augmentation

Research Sources: 445 | Generated: 9/27/2025