Below is the list of accepted papers for BMVC 2025. Congratulations! You will receive an email with further information and the next steps soon!

If your paper is not listed, it has been rejected. We understand how disappointing it can be to have a paper rejected—we’ve all been there. We hope the feedback from the reviews (when you receive the email) will provide valuable insights for revising the work and that you will consider resubmitting it in the future.

This year, BMVC received 865 submissions of which 276 papers were accepted. Each paper had 3 reviews, including a meta-review. All papers were discussed among the reviewers and the assigned Area Chairs (AC). Meta-reviews were verified by our Programme Chairs (PCs). All this was done while preserving author anonymity and avoiding domain conflicts.


Number Table
IDTitle
12Part Segmentation and Motion Estimation for Articulated Objects with Dynamic 3D Gaussians
18Volumetric Temporal Texture for Smoke Stylization using Dynamic Radiance Fields
22Learning from Synthetic Data for Visual Grounding
24MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection
26Guiding a diffusion model with itself using sliding windows
28Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
29SPARTAN: Spatiotemporal Pose-Aware Retrieval for Text-guided Autonomous Navigation
37Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
38GC-Font: Few-Shot Font Generation via Global Contextual Feature Modelling
39DocAttentionRect: Attention-Guided Document Image Rectification
43Split Matching for Inductive Zero-shot Semantic Segmentation
55Piezoelectric Acoustic Sensing for Sitting Pose Classification
56Pointly-Supervised Weak-Shot Semantic Segmentation via Dual Mapping Transfer
59CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance
66ST-GDance: Long-Term and Collision-Free Group Choreography from Music
69TACTFL: Temporal Contrastive Training for Multi-modal Federated Learning with Similarity-guided Model Aggregation
77Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
81SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
83RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze
84Continual Vision-and-Language Navigation
88CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
102STAIN: Smooth Tile-Aware Instance Normalisation for Virtual Staining
113Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning
117RAPrivacy: a Readable Anonymizer for Privacy Preserving Action Recognition
121BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching
122Distribution-guided Generative Replay with Semantic Prompts for Class-Incremental Chest X-ray Diagnosis
124Mask2Act: Predictive Multi-Object Tracking as Video Pre-Training for Robot Manipulation
126CLAIR: CLIP-Aided Weakly Supervised Zero-Shot Cross-Domain Image Retrieval
128Uncertainty Diffusion: Parameter-Efficient Depth Refinement via Uncertainty-Guided Diffusion Models
131Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
133M$^2$StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting
136Lang4D: Weakly Supervised Learning of 4D Language Splatting
138SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet
147Spatial-Frequency Domain Aggregation for Visual Place Recognition
150WTNet: A Weather Transfer Network for Domain-Adaptive All-In-One Adverse Weather Image Restoration
154Seed-to-Seed: Unpaired Image Translation in Diffusion Seed Space
159Identity-Motion Trade-offs in Text-to-Video Generation
165Four eyes see more than two: Dataset Distillation with Mixture-of-Experts
168PSScreen: Partially Supervised Multiple Retinal Disease Screening
173CellMamba: Adaptive Mamba for Accurate and Efficient Cell Detection
181LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction
182eXtended Multimodal Composite Association Score (xMCAS): A Gender Inclusive Approach to Measurement of Bias in Text-To-Image Diffusion Models
184Graph Similarity Learning of Floor Plans
195Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification
211Efficient Image Restoration via Latent Consistency Flow Matching
218Bridging Visual-Textual Modalities: Weakly Supervised Histopathology Segmentation
220LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking
228Dual-Expert Collaborative Network for Fake News Detection with External Knowledge Integration
232Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering
238Capture and Reconstruct 3D Clothed Human from Images
239REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation
257S$^2$V2V: Training-Free Video-to-Video with Sparse Points and Motion Guidance
259Revisiting Entropy Minimization for Long-Sequence Continual Test-Time Adaptation
261ADIR: Adaptive Diffusion for Image Reconstruction
265Enhancing Visual Tracking by Leveraging High-frequency Information within Event Signals
270FaceGCD: Generalized Face Discovery via Dynamic Prefix Generation
272B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
276Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending
277CFFlow: An Optical Flow Estimation Hinging on Cross-Frequency Attention
281CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
283AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields
284Multimodal Feature Collaboration and Fusion for Fine-Grained Action Recognition
288Benchmarking Microsaccade Recognition with Event Cameras: A Novel Dataset and Evaluation
289CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings
291Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
292SVAC: Scaling Is All You Need For Referring Video Object Segmentation
294Task Progressive Curriculum Learning for Robust Visual Question Answering
310PME3D: An Adaptive and Efficient Multi-modal Feature Extraction Plug-in for 3D Object Detection
313OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware
320Extreme Model Compression with Structured Sparsity at Low Precision
328DepthHMR: Leveraging Depth Around Humans for Multi-Human Mesh Generation
330FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion
333Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking
334DAOVI: Distortion-Aware Omnidirectional Video Inpainting
338GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
339Events Meet Dynamic Mode Decomposition: Capturing the Spatiotemporal Dynamics of Moving Objects
342Flatness-aware Curriculum Learning via Adversarial Difficulty
346Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID
348PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing
354Dual-Branch Network via Multiple Illumination-Aware Representation Learning for Steel Surface Defect Classification
357What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
358Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones
359SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation
364Learning from Silence and Noise for Visual Sound Source Localization
366HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting
367RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction
370RGB-Event Fusion for Robust Lane Detection
373Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval
376Multi-Rationale Explainable Object Recognition via Contrastive Conditional Inference
381Permutation-Invariant Polar Harmonic Pooling for Point-based Neural Networks
391Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation
392UMM: A Unified Multi-Modal Model for Low-Level Vision Tasks with Dual-Driven Prompting
398Knowledge Distillation via Cross Supervising with Attention for Remote Sensing Object Detection
3993D Shape Reconstruction from Autonomous Driving Radars
402Log NeRF: Comparing Spaces for Learning Radiance Fields
404Exploring Histogram-based Color Constancy
405HVLO-YOLO: An Ultra-Lightweight Detection Model for High-voltage Line Obstacles
408Multimodal Hate Detection Using Dual-Stream Graph Neural Networks
416Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps
423JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection
427Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data
444Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering
445Frequency-Temporal Feature Integration for Compressed Video Action Recognition
453HalfMix Augmentation and Regularized Dual-Path Learning for Cross-Domain Gaze Estimation
457A Novel Local Focusing Mechanism for Deepfake Detection Generalization
458Intra-Modal Divergence-Weighted Distillation for Vision-Language Models
461FSF3A: Federated Spatial Feature Alignment and Adaptive Aggregation for Heterogeneous Brain Tumor Segmentation
478Bézier Curve-Based Stroke Extraction for Handwritten Characters
479CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
486FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction
493Audio-Visual Separation with Hierarchical Fusion and Representation Alignment
495An Explorative Study on Abstract Images and Visual Representations Learned from Them
499One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving
500UFD-KD: Unified Frequency Decoupled Knowledge Distillation
502DefectGPT: Towards Multi-Class Defect Detection with Limited Electrical Samples
507Robust and Label-efficient Deep Waste Detection
509Dynamic Convolution and Graph-Coupled Attention for Cross-Subject EEG-Vision Decoding
511LieMorph: Transformer-based Image Registration Using Flows on Lie Groups
516Making Rotation Averaging Fast and Robust with Anisotropic Coordinate Descent
517Catching the Unknown with Limited Data: Bi-Directional Prompt Tuning in CLIP for Few-Shot Open-Set Adaptation
528OmniSegNet: Towards Scalable, Efficient & Universal Medical Image Segmentation
540OptSplat: Recurrent Optimization for Generalizable Reconstruction and Novel View Renderings
543Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes
546Multi-Method Ensemble for Out-of-Distribution Detection
548Pandora: Articulated 3D Scene Graphs from Egocentric Vision
558EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance
566OpenHuman4D: Open-Vocabulary 4D Human Parsing
567Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection
577Zero-Shot CFC: Fast Real-World Image Denoising based on Cross-Frequency Consistency
588CoT-SD: Chain-of-Thought Semantic Denoising
593PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads
594Video Dataset Condensation with Diffusion Models
596TopoMortar: A Dataset to Evaluate Topology Accuracy in Image Segmentation
602Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
603Dual-Stream Adapters for Open-Set Segmentation in Driving Scenes
610HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
614PerSense: Training-Free Personalized Instance Segmentation in Dense Images
626Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation
627TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation
628MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
630Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives
632Segmentation Assisted Incremental Test Time Adaptation in an Open World
644Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems
646Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework
649Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
650Prompt-Informed Reinforcement Learning for Visual Coverage Path Planning
6563D Curvix: From Multiview 2D Edges to 3D Curve Segments
661Leveraging Sparsity for Efficient Inference of High-Resolution Vision Foundation Models
664Canonical Makeup Transfer
666Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization
668MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression
675Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network
680On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics
690MonoGSDF: Exploring Monocular Geometric Cues for Gaussian Splatting-Guided Implicit Surface Reconstruction
694Occam’s LGS: An Efficient Approach for Language Gaussian Splatting
698SAMWave: Adapting Segment Anything Model to difficult tasks
700Q-Align: Alleviating Attention Leakage in Zero-Shot Appearance Transfer via Query-Query Alignment
701Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
704JOG3R: Towards 3D-Consistent Video Generators
705ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation
707Catch Your Concepts: A Flexible Concept Locator for Interpretable Visual Recognition
709TAPM-Net: Trajectory-Aware Perturbation Modeling for Infrared Small Target Detection
716Size-aware Contrastive Imitation Learning for Language-conditioned Multi-task Robotic Manipulation
717From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
718PADS: Plug-and-Play 3D Human Pose Analysis via Diffusion Generative Modeling
721Gromov Wasserstein Optimal Transport for Semantic Correspondences
722Unsupervised Multimodal Deepfake Detection Through Explicit Intra-Modal and Cross-Modal Inconsistency Discovery
732TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
736EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
737Self-Intersection-Aware 3D Human Motion Generation Using an Efficient Human Sphere Proxy
745Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
751TPA: Temporal Prompt Alignment for Fetal Congenital Heart Defect Classification
753Boosting Camera Motion Control for Video Diffusion Transformers
755Audio-Guided Visual Editing with Complex Multi-Modal Prompts
762Learning a Neural Association Network for Self-supervised Multi-Object Tracking
766Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
768Interpretable Text-Guided Image Clustering via Iterative Search
770DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction
782Is Safety Checker Still Safe? A Study on the Covert NSFW Text
784UDT : Unsupervised Discovery of Transformations between Fine-Grained Classes in Diffusion Models
786TAG: A Simple Yet Effective Temporal-Aware Approach for Zero-Shot Video Temporal Grounding
787${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting
788Vision Backbone Efficient Selection for Image Classification in Low-Data Regimes
811DEAD: Data-Efficient Audiovisual Dubbing using Neural Rendering Priors
824A Unified Framework for High-Frame-Rate HDR Video Synthesis
825Modular Embedding Recomposition for Incremental Learning
827MonoTracker: Monocular RGB-Only 6D Tracking of Unknown Object
828Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval
831LOGen: Toward LiDAR Object Generation by Point Diffusion
832SIMULDITEX: Single Image Multiscale & Lightweight Diffusion for Texture Modelling
835Cloud-Stereo: A Dataset and Benchmark for Reconstructing Atmospheric Clouds from Stereo Images
850C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression
851Asymmetric Event-Image Stereo with Temporal Feature Gating and Iterative Structure-Detail Refinement
852LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
857Lost in Time: A New Temporal Benchmark for VideoLLMs
859Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift
864Geometry-Aware Diffusion Models for Multiview Scene Inpainting
865Estimating Foot Pressure and Stability from Visual Input
866Image Recognition with Vision and Language Embeddings of VLMs
873Exploring Image Representation with Decoupled Classical Visual Descriptors
875Lost in Translation? Vocabulary Alignment for Source-Free Adaptation in Open-Vocabulary Semantic Segmentation
880Distortion-Aware Multi-Object Tracking via Virtual Plane Projection in Overhead Fisheye Cameras
887QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising
896Llama Learns to Direct: DirectorLLM for Human-Centric Video Generation
898FaceCPT: Toward Cross-Modal Facial Representation Learning with Face-Caption Pre-Training
900Time-Scaling State-Space Models for Dense Video Captioning
902ITC-RWKV: Interactive Tissue–Cell Modeling with Recurrent Key-Value Aggregation for Histopathological Subtyping
903Toward Robust Audio-Visual Synchronization Detection in Egocentric Video with Sparse Synchronization Events
914Improving Human Motion Plausibility with Body Momentum
915DualDistill: A Unified Cross-Modal Knowledge Distillation Framework for Camera-Based BEV Representation
922FSLC: Fast Scoring with Learnable Coreset for Zero-shot Industrial Anomaly Detection
931Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation
938Grad-CL: Source Free Domain Adaptation with Gradient Guided Feature Disalignment
940Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation
943HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
948Tracking Meets Large Multimodal Models for Driving Scenario Understanding
949Prompt Image to Watch and Hear: Multimodal Prompting for Parameter-Efficient Audio-Visual Learning
955Clean Sample Selection and Noisy Sample Rematching for Text-Based Pedestrian Retrieval
960A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
962LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba
976Advancing Utility Pole and Sign Detection Through Deep Learning
981Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams
986Visible Structure Retrieval for Lightweight Image-Based Relocalisation
987Spatiotemporal Event Spotting via 3D Heatmaps with Dynamically Shifted Gaussian Kernels
988Ink Enhancement for Ancient Bamboo Manuscripts Using Iterative Restoration-Degradation Adversarial Learning
989MedOpenSeg: Open-World Medical Segmentation with Memory-Augmented Transformers
992Dual Polarity Prompts with Stochastic Entropy Perturbation for Label Noise
999Lid-Lab-NeRF: Generating Temporally Consistent, Labelled LiDAR Point Clouds using Neural Radiance Fields
1004RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation
1006Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models
1007Is Structural Awareness the Key to Event Camera Data Cleansing for Enhancing Veracity?
1013SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank Transformation
1015In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models
1024Quantifying Risk in Pedestrian Crowds Using Divergence Estimated from Flows of Head-Tracking Data
1029PrIINeR: Towards Prior-Informed Implicit Neural Representations for Accelerated MRI
1030Language-Guided Decision Override for Adaptive and Retraining-Free Video Anomaly Detection
1035Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation
1036Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving
1037Generative Data Augmentation for Object Point Cloud Segmentation
1051Conditional Prototype Learning for Few-Shot Object Detection
1058Atomizer: Generalizing to unseen modalities by breaking images down to a set of scalars
1060Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation
1061IPGPhormer: Interpretable Pathology Graph-Transformer for Survival Analysis
1062Calibration-Aware Prompt Learning for Medical Vision-Language Models
10643D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
1071Stabilizing Open-Set Test-Time Adaptation via Primary-Auxiliary Filtering and Knowledge-Integrated Prediction
1072Controllable Garment Generation with Multi-Modal Diffusion Guidance
1076CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation
1077Towards Sharper Object Boundaries in Self-Supervised Depth Estimation
1081ImProvShow: Multimodal Fusion for Image Provenance Summarization
1091CLIMB-3D: Class-Incremental Imbalanced 3D Instance Segmentation
1095Supervised Segmentation Model for Improved Detection of OSSN using Slit Lamp Images
1096LED: Light Enhanced Depth Estimation at Night
1109AKD-BNN: Adaptive Kernel Dynamic Bayesian Neural Networks for Enhanced Medical Image Segmentation with Uncertainty Estimation
1110Robust Human Registration with Body Part Segmentation on Noisy Point Clouds
1118Conformal Predictors for Efficient Video Text Spotting
1121MIAS-SAM: Medical Image Anomaly Segmentation without thresholding
1123Evaluating Self-Supervised Learning in Medical Imaging: A Systematic Investigation of Robustness, Generalizability, and Multi-Domain Impact
1127ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds
1132Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
1134TRUDI and TITUS: A Multi-Perspective Dataset and A Three-Stage Recognition System for Transportation Unit Identification
1135The Trauma THOMPSON Dataset for Real-World Emergency AI
1137ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates
1139RETRO: REthinking Tactile Representation Learning with Material PriOrs
1156Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion
1161Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data
1171Conflict-Aware Adversarial Training
1177Binarizing Severely Degraded Ancient Bamboo Slips: Dataset and Baseline
1180MO-SHW: Hierarchy-Aware Multi-Objective Optimization for Open-World Segmentation
1183Contrastive Point Feature Matching for Open-world Object Counting
1187Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
1196IoSR: End-to-End Intraoral Scans Repairing
1199AULUNet: An Adaptive Ultra-Lightweight U-Net Framework for Efficient Skin Lesion Segmentation in Resource-Constrained Environments
1209Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning
1220Hair Strand Reconstruction based on 3D Gaussian Splatting
1224RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans
1257End-to-End LiDAR optimization for point cloud registration