The 36th British Machine Vision Conference 2025: Proceedings

12	Part Segmentation and Motion Estimation for Articulated Objects with Dynamic 3D Gaussians Jun-Jee Chao (University of Minnesota); Qingyuan Jiang (University of Minnesota); Volkan Isler (University of Texas at Austin) PDF Poster Video (Right click to download)
18	Volumetric Temporal Texture for Smoke Stylization using Dynamic Radiance Fields Dongqing Wang (École Polytechnique Fédérale de Lausanne (EPFL)); Ehsan Pajouheshgar (École Polytechnique Fédérale de Lausanne (EPFL)); Yitao Xu (École Polytechnique Fédérale de Lausanne (EPFL)); Tong Zhang (École Polytechnique Fédérale de Lausanne (EPFL)); Sabine Süsstrunk (École Polytechnique Fédérale de Lausanne (EPFL)) PDF Poster Video (Right click to download)
22	Learning from Synthetic Data for Visual Grounding Ruozhen He (Rice University); Ziyan Yang (Rice University); Paola Cascante-Bonilla (Stony Brook University); Alexander C. Berg (University of California, Irvine); Vicente Ordonez (Rice University) PDF Poster Video (Right click to download)
24	MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection Taiga Yamane (NTT Corporation); Satoshi Suzuki (NTT Corporation); Ryo Masumura (NTT Corporation); Shota Orihashi (NTT Corporation); Tomohiro Tanaka (NTT Corporation); Mana Ihori (NTT Corporation); Naoki Makishima (NTT Corporation); Naotaka Kawata (NTT Corporation) PDF Poster
26	Guiding a diffusion model with itself using sliding windows Nikolas Adaloglou (Heinrich Heine University); Tim Kaiser (Heinrich Heine University); Damir Iagudin (Heinrich Heine University); Markus Kollmann (Heinrich Heine University) PDF Poster Video (Right click to download)
28	Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh (UC Berkeley); Ta-Ying Cheng (University of Oxford); He-Yen Hsieh (Harvard University); David Chuan-En Lin (Carnegie Mellon University); Yi Ma (University of Hong Kong); Andrew Markham (University of Oxford); Niki Trigoni (University of Oxford); H. T. Kung (Harvard University); Yubei Chen (UC Davis) PDF Poster Video (Right click to download)
29	SPARTAN: Spatiotemporal Pose-Aware Retrieval for Text-guided Autonomous Navigation Xiangyu Bai (Northeastern University); Sai Anish Sreeramagiri (Northeastern University); Sai Siddhartha Vivek Dhir Rangoju (Northeastern University); Bishoy Galoaa (Northeastern University); Eric C Mortin (DEVCOM Analysis Center (DAC), US Army); Sarah Ostadabbas (Northeastern University) PDF Poster Video (Right click to download)
37	Cross-Modal Scene Semantic Alignment for Image Complexity Assessment Yuqing Luo (Cardiff University); YIXIAO LI (Beihang University); Jiang Liu (Cardiff University); Jun Fu (Cardiff University); Hadi Amirpour (University of Klagenfurt); Guanghui Yue (Shenzhen University); Baoquan Zhao (SUN YAT-SEN UNIVERSITY); Padraig Corcoran (Cardiff University); Hantao Liu (Cardiff University); Wei Zhou (Cardiff University) PDF Poster Video (Right click to download)
38	GC-Font: Few-Shot Font Generation via Global Contextual Feature Modelling Weiran Chen (Soochow University); Guiqian Zhu (Soochow University); Ying Li (Soochow University); Yi Ji (Soochow University); Chunping Liu (Soochow University) PDF Poster Video (Right click to download)
39	DocAttentionRect: Attention-Guided Document Image Rectification Pooja Kumari (Indian Institute of Technology Madras); Sukhendu Das (Indian Institute of Technology Madras) PDF Poster Video (Right click to download)
43	Split Matching for Inductive Zero-shot Semantic Segmentation Jialei Chen (Nagoya University); Xu Zheng (The Hong Kong University of Science and Technology (Guangzhou) (HKUST (GZ))); Dongyue Li (Nagoya University); Chong Yi (Nagoya University); Seigo Ito (Nagoya University); Danda Pani Paudel (Artificial Intelligence and Technology (INSAIT), Sofia University); Luc Van Gool (Artificial Intelligence and Technology (INSAIT), Sofia University); Hiroshi Murase (Nagoya University); Daisuke Deguchi (Nagoya University) PDF Poster Video (Right click to download)
55	Piezoelectric Acoustic Sensing for Sitting Pose Classification Yuuki Shibuya (Tokyo University of Science); Go Irie (Tokyo University of Science) PDF Poster Video (Right click to download)
56	Pointly-Supervised Weak-Shot Semantic Segmentation via Dual Mapping Transfer Wenhui Jiang (Jiangxi University of Finance and Economics); Ruikang Luo (Jiangxi University of Finance and Economics); Zeyu Luo (Jiangxi University of Finance and Economics); Xiaowei Zhao (Sany Heavy Industry Co.); Junjie Chen (Jiangxi University of Finance and Economics); Yuming Fang (Jiangxi University of Finance and Economics) PDF Poster Video (Right click to download)
59	CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance Yingtie Lei (University of Macau); Fanghai Yi (Guangdong University of Technology); Yihang Dong (University of Chinese Academy of Sciences); Weihuang Liu (University of Macau); Xiaofeng Zhang (Shanghai Jiao Tong University); Zimeng Li (Shenzhen Polytechnic University); Chi-Man Pun (University of Macau); Xuhang Chen (University of Macau) PDF Poster Video (Right click to download)
66	ST-GDance: Long-Term and Collision-Free Group Choreography from Music Jing Xu (Monash University); Weiqiang Wang (Monash University); Cunjian Chen (Monash University); Jun Liu (Lancaster University); Qiuhong Ke (Monash University) PDF Poster Video (Right click to download)
69	TACTFL: Temporal Contrastive Training for Multi-modal Federated Learning with Similarity-guided Model Aggregation Guanxiong Sun (University of Bristol); Majid Mirmehdi (University of Bristol); Zahraa S. Abdallah (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Ian James Craddock (University of Bristol); Telmo M Silva Filho (University of Bristol) PDF Poster
77	Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance Luc Boudier (École Polytechnique); Loris Manganelli (École Polytechnique); Eleftherios Tsonis (École Polytechnique); Nicolas Dufour (École Polytechnique, École des Ponts); Vicky Kalogeiton (Ecole polytechnique) PDF Poster Video (Right click to download)
81	SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation Xiao Cao (National University of Singapore); Beibei Lin (National University of Singapore); Bo Wang (University of Mississippi); Zhiyong Huang (National University of Singapore); Robby T. Tan (National University of Singapore) PDF Poster Video (Right click to download)
83	RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze Ruicheng Zhang (SUN YAT-SEN UNIVERSITY); Puxin Yan (SUN YAT-SEN UNIVERSITY); Zeyu Zhang (The Australian National University); Yicheng Chang (SUN YAT-SEN UNIVERSITY); Hongyi Chen (SUN YAT-SEN UNIVERSITY); Zhi Jin (SUN YAT-SEN UNIVERSITY) PDF Poster Video (Right click to download)
84	Continual Vision-and-Language Navigation SeongJun Jeong (Seoul National University); Gi-Cheon Kang (Seoul National University); Seongho Choi (Seoul National University); Joochan Kim (Korea Institute of Science and Technology); Byoung-Tak Zhang (Seoul National University) PDF Poster Video (Right click to download)
88	CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models Yuyang Xue (University of Edinburgh); Edward Moroshko (University of Edinburgh); Feng Chen (University of Edinburgh); Jingyu Sun (University of Edinburgh); Steven G. McDonagh (University of Edinburgh); Sotos Tsaftaris (University of Edinburgh) PDF Poster Video (Right click to download)
102	STAIN: Smooth Tile-Aware Instance Normalisation for Virtual Staining David Armstrong (The Queen's University Belfast); Iain B Styles (The Queen's University Belfast); Niall McLaughlin (The Queen's University Belfast) PDF Poster Video (Right click to download)
113	Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning Ruxiao Duan (Yale University); Jieneng Chen (Johns Hopkins University); Adam Kortylewski (University of Freiburg); Alan Yuille (Johns Hopkins University); Yaoyao Liu (University of Illinois Urbana-Champaign) PDF Poster Video (Right click to download)
117	RAPrivacy: a Readable Anonymizer for Privacy Preserving Action Recognition ZiZhen Wang (National Yang Ming Chiao Tung University); Yen-Lung Chu (National Yang Ming Chiao Tung University); Pei-Chun Tsai (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download)
121	BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching Zhihua Liu (University of Leicester); Lei Tong (AstraZeneca); Xilin He (The Chinese University of Hong Kong); Che Liu (Imperial College London); Rossella Arcucci (Imperial College London); Chen Jin (Astrazeneca); Huiyu Zhou (University of Leicester) PDF Poster Video (Right click to download)
122	Distribution-guided Generative Replay with Semantic Prompts for Class-Incremental Chest X-ray Diagnosis Jayant Mahawar (Indian Institue of Technology Jodhpur); Devi Prasad Maharathy (Indian Institute of Technology Jodhpur); Angshuman Paul (Indian Institute of Technology Jodhpur) PDF Poster Video (Right click to download)
124	Mask2Act: Predictive Multi-Object Tracking as Video Pre-Training for Robot Manipulation Junbo Zhang (Tsinghua University); Kaisheng Ma (Tsinghua University) PDF Poster Video (Right click to download)
126	CLAIR: CLIP-Aided Weakly Supervised Zero-Shot Cross-Domain Image Retrieval Chor Boon Tan (National University of Singapore); Conghui Hu (National University of Singapore); Gim Hee Lee (National University of Singapore) PDF Poster Video (Right click to download)
128	Uncertainty Diffusion: Parameter-Efficient Depth Refinement via Uncertainty-Guided Diffusion Models Jeng-Huo Tzeng (National Yang Ming Chiao Tung University); Chuan-Yuan Huang (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download)
131	Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications Noreen Anwar (LITIV, Polytechnique Montréal); Guillaume-Alexandre Bilodeau (LITIV, Polytechnique Montréal); Wassim Bouachir (Université du Québec (TELUQ)) PDF Poster Video (Right click to download)
133	M$^2$StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting Xingyu Miao (Durham University); Xueqi Qiu (Durham University); Haoran Duan (Tsinghua University); Yawen Huang (Tencent Jarvis Lab); Xian Wu (Tencent Jarvis Lab); Jingjing Deng (University of Bristol); Yang Long (Durham University) PDF Poster Video (Right click to download)
136	Lang4D: Weakly Supervised Learning of 4D Language Splatting Mana Masuda (CyberAgent, Keio University); Taiki Sekii (CyberAgent) PDF Poster Video (Right click to download)
138	SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet Woosung Joung (Korea University); Daewon Chae (University of Michigan - Ann Arbor); Jinkyu Kim (Korea University) PDF Poster Video (Right click to download)
147	Spatial-Frequency Domain Aggregation for Visual Place Recognition Chaoqun Wang (South China Normal University); Shaobo Min (Tencent) PDF Poster Video (Right click to download)
150	WTNet: A Weather Transfer Network for Domain-Adaptive All-In-One Adverse Weather Image Restoration Si-Yu Huang (National Yang Ming Chiao Tung University); Fu-Jen Tsai (National Tsing Hua University); Chia-Wen Lin (National Tsing Hua University); Yen-Yu Lin (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download)
154	Seed-to-Seed: Unpaired Image Translation in Diffusion Seed Space Or Greenberg (General Motors R&D, The Hebrew University of Jerusalem); Eran Kishon (General Motors R&D); Dani Lischinski (The Hebrew University of Jerusalem) PDF Poster
159	Identity-Motion Trade-offs in Text-to-Video Generation Yuval Atzmon (NVIDIA Research); Rinon Gal (NVIDIA Research); Yoad Tewel (NVIDIA Research); Yoni Kasten (NVIDIA Research); Gal Chechik (NVIDIA Research) PDF Poster
165	Four eyes see more than two: Dataset Distillation with Mixture-of-Experts Jia-Jiun Yao (National Yang Ming Chiao Tung University); Sheng-Feng Yu (National Yang Ming Chiao Tung University, Macronix International Co., Ltd.); Wei-Chen Chiu (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download)
168	PSScreen: Partially Supervised Multiple Retinal Disease Screening Boyi Zheng (University of Oulu); Qing Liu (University of Oulu) PDF Poster Video (Right click to download)
173	CellMamba: Adaptive Mamba for Accurate and Efficient Cell Detection Ruochen Liu (University of Liverpool); Yi.Tian (National University of Singapore); Jiahao Wang (Xi'an Jiaotong-Liverpool University); Hongbin Liu (Xi'an Jiaotong-Liverpool University); Xianxu Hou (Xi'an Jiaotong-Liverpool University); Jingxin Liu (Xi'an Jiaotong-Liverpool University) PDF Poster Video (Right click to download)
181	LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan (Concordia University); Abdessamad Ben Hamza (Concordia University); Nizar Bouguila (Concordia University) PDF Poster Video (Right click to download)
182	eXtended Multimodal Composite Association Score (xMCAS): A Gender Inclusive Approach to Measurement of Bias in Text-To-Image Diffusion Models Abhishek Mandal (Dublin City University); Susan Leavy (University College Dublin); Suzanne Little (Dublin City University) PDF Poster Video (Right click to download)
184	Graph Similarity Learning of Floor Plans Casper van Engelenburg (Delft University of Technology); Jan van Gemert (Delft University of Technology); Seyran Khademi (Delft University of Technology) PDF Poster
195	Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification Shuxian Ma (University of Jinan); Zihao Dong (University of Jinan); Runmin Cong (Shandong University); Sam Tak-Wu Kwong (Lingnan University); Xiuli Shao (Nankai University) PDF Poster Video (Right click to download)
211	Efficient Image Restoration via Latent Consistency Flow Matching Elad Cohen (Sony Semiconductor Israel); Idan Achituve (Sony Semiconductor Israel); Idit Diamant (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel) PDF Poster Video (Right click to download)
218	Bridging Visual-Textual Modalities: Weakly Supervised Histopathology Segmentation Sicong Gao (University of New South Wales); Matthew AB Baker (University of New South Wales); Maurice Pagnucco (University of New South Wales); Zhiwei Gao (Hebei Key Laboratory of Electromagnetic Environmental Effects and Information Processing); Yang Song (University of New South Wales) PDF Poster Video (Right click to download)
220	LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking Martha Teiko Teye (University of Wuppertal); Ori Maoz (Aptiv); Matthias Rottmann (Universität Osnabrück) PDF Poster Video (Right click to download)
228	Dual-Expert Collaborative Network for Fake News Detection with External Knowledge Integration Wenxin Luo (Beijing Jiaotong University); Zhu Teng (Beijing Jiaotong Univeristy); Wei Zhang (Beijing Jiaotong University); Baopeng Zhang (Beijing Jiaotong Univeristy) PDF Poster Video (Right click to download)
232	Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering Nattapong Kurpukdee (University of York); Adrian G. Bors (University of York) PDF Poster Video (Right click to download)
238	Capture and Reconstruct 3D Clothed Human from Images Onat Vuran (ETH Zürich); Hsuan-I Ho (ETH Zürich) PDF Poster Video (Right click to download)
239	REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation Maëlic Neau (Umeå University); Paulo Eduardo Santos (PrioriAnalytica); Anne-Gwenn Bosser (Bretagne INP); Akihiro Sugimoto (National Institute of Informatics); Cedric Buche (CNRS IRL 2010 CROSSING, IMT Atlantique) PDF Poster Video (Right click to download)
257	S$^2$V2V: Training-Free Video-to-Video with Sparse Points and Motion Guidance Xinyu Zhang (University of Auckland); Zicheng Duan (The University of Adelaide); Dong Gong (University of New South Wales); Lingqiao Liu (University of Adelaide) PDF Poster Video (Right click to download)
259	Revisiting Entropy Minimization for Long-Sequence Continual Test-Time Adaptation WeiQin Chuah (Royal Melbourne Institute of Technology); Ruwan Tennakoon (Royal Melbourne Institute of Technology); Alireza Bab-Hadiashar (Royal Melbourne Institute of Technology) PDF Poster Video (Right click to download)
261	ADIR: Adaptive Diffusion for Image Reconstruction Shady Abu-Hussein (Tel Aviv University); Tom Tirer (Tel Aviv University); Raja Giryes (Bar Ilan University) PDF Poster Video (Right click to download)
265	Enhancing Visual Tracking by Leveraging High-frequency Information within Event Signals Yuheng Jiang (University of Science and Technology of China); Hebei Li (University of Science and Technology of China); Dachun Kai (University of Science and Technology of China); Yansong Peng (University of Science and Technology of China); Jiahui Yuan (University of Science and Technology of China); Peilin Xiao (University of Science and Technology of China); Yueyi Zhang (MiroMind); Xiaoyan Sun (University of Science and Technology of China) PDF Poster Video (Right click to download)
270	FaceGCD: Generalized Face Discovery via Dynamic Prefix Generation Yunseok Oh (Inha University, AutoLabs, Inc.); Dong-Wan Choi (Inha University) PDF Poster Video (Right click to download)
272	B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing Yoojin Jang (Ulsan National Institute of Science and Technology); Junsu Kim (Ulsan National Institute of Science and Technology); Ha Yeon Kim (Ulsan National Institute of Science and Technology); Eun-Ki Lee (Hanyang University); Eun-Sol Kim (Hanyang University); Seungryul Baek (Ulsan National Institute of Science and Technology); Jaejun Yoo (Ulsan National Institute of Science and Technology) PDF Poster Video (Right click to download)
276	Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending Junsik Jung (KAIST); Yoonki Cho (KAIST); Woo Jae Kim (KAIST); Lin Wang (Nanyang Technological University); Sung-Eui Yoon (KAIST) PDF Poster Video (Right click to download)
277	CFFlow: An Optical Flow Estimation Hinging on Cross-Frequency Attention Mengfei Wang (Shanghai Institute of Microsystem and Information Technology); Dongchen Zhu (Shanghai Institute of Microsystem and Information Technology); Lei Wang (Shanghai Institute of Microsystem and Information Technology); Jiamao Li (Shanghai Institute of Microsystem and Information Technology) PDF Poster Video (Right click to download)
281	CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections Mohamed Fazli Mohamed Imam (Mohamed bin Zayed University of Artificial Intelligence); Rufael Fekadu Marew (Mohamed bin Zayed University of Artificial Intelligence); Jameel Hassan Abdul Samadh (Johns Hopkins University); Mustansar Fiaz (IBM Research); Alham Fikri Aji (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
283	AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields Woo Jae Kim (Korea Advanced Institute of Science and Technology (KAIST)); Kyu Beom Han (Korea Advanced Institute of Science and Technology (KAIST)); Yoonki Cho (Korea Advanced Institute of Science and Technology (KAIST)); Youngju Na (Korea Advanced Institute of Science and Technology (KAIST)); Junsik Jung (Korea Advanced Institute of Science and Technology (KAIST)); Sooel Son (Korea Advanced Institute of Science and Technology (KAIST)); Sung-Eui Yoon (Korea Advanced Institute of Science and Technology (KAIST)) PDF Poster Video (Right click to download)
284	Multimodal Feature Collaboration and Fusion for Fine-Grained Action Recognition Xinyu Bian (Beijing University of Posts and Telecommunications); Dongliang Chang (Beijing University of Posts and Telecommunications); Yuqi Yang (Beijing University of Posts and Telecommunications); Lei Chen (Tsinghua University); Zhanyu Ma (Beijing University of Post and Telecommunication) PDF Poster Video (Right click to download)
288	Benchmarking Microsaccade Recognition with Event Cameras: A Novel Dataset and Evaluation Waseem Shariff (University of Galway); Timothy Hanley (University of Galway); Maciej Stec (University of Galway); Hossein Javidnia (University of Dublin); Peter Corcoran (University of Galway) PDF Poster Video (Right click to download)
289	CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings Mattia Nardon (Bruno Kessler Foundation); Mikel Mujika Agirre (Ikerlan); Ander González Tomé (Ikerlan); Daniel Sedano Algarabel (Ikerlan); Josep Rueda Collell (ikerlan); Ana Paola Caro (Andreu World); Andrea Caraffa (Bruno Kessler Foundation); Fabio Poiesi (Bruno Kessler Foundation); Paul Ian Chippendale (Bruno Kessler Foundation); Davide Boscaini (Bruno Kessler Foundation) PDF Poster Video (Right click to download)
291	Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations Nils Hütten (Bergische Universität Wuppertal); Florian Hölken (Bergische Universität Wuppertal); Hasan Tercan (Bergische Universität Wuppertal); Tobias Meisen (Bergische Universität Wuppertal) PDF Poster
292	SVAC: Scaling Is All You Need For Referring Video Object Segmentation Li Zhang (Columbia University); Haoxiang Gao (Carnegie Mellon University); Zhihao Zhang (Columbia University); Luoxiao Huang (New York University); Tao Zhang (Wuhan University) PDF Poster Video (Right click to download)
294	Task Progressive Curriculum Learning for Robust Visual Question Answering Ahmed Akl (Griffith University); Abdelwahed Khamis (CSIRO); Zhe Wang (Griffith University); Ali Cheraghian (CSIRO); Sara Khalifa (Queensland University of Technology); Kewen Wang (Griffith University) PDF Poster Video (Right click to download)
310	PME3D: An Adaptive and Efficient Multi-modal Feature Extraction Plug-in for 3D Object Detection TianyiYu (University of Glasgow); Lei Yu (Beihang University) PDF Poster Video (Right click to download)
313	OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware Nick Lemke (Technische Universität Darmstadt); John Kalkhof (Technical University of Darmstadt); Niklas Babendererde (Technische Universität Darmstadt); Anirban Mukhopadhyay (Technical University of Darmstadt) PDF Poster Video (Right click to download)
320	Extreme Model Compression with Structured Sparsity at Low Precision Dan Liu (McGill University); Nikita Dvornik (Palona AI); Xue Liu (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
328	DepthHMR: Leveraging Depth Around Humans for Multi-Human Mesh Generation Nikhil Sharma (University of Illinois Chicago); Jiachen Tao (University of Illinois Chicago); Junyi Wu (University of Illinois Chicago); Yan Yan (University of Illinois Chicago) PDF Poster
330	FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion Kazuaki Mishima (Institute of Science Tokyo); Antoni Bigata Casademunt (Imperial College London); Stavros Petridis (Imperial College London); Maja Pantic (Imperial College London); Kenji Suzuki (Institute of Science Tokyo) PDF Poster Video (Right click to download)
333	Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking Milad Khanchi (Concordia University); Maria Amer (Concordia University); Charalambos Poullis (Concordia University) PDF Poster Video (Right click to download)
334	DAOVI: Distortion-Aware Omnidirectional Video Inpainting Ryosuke Seshimo (Keio University); Mariko Isogawa (Keio University) PDF Poster Video (Right click to download)
338	GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition Tianyue Wang (University of the Chinese Academy of Sciences); Shuang Yang (University of Chinese Academy of Sciences); Shiguang Shan (University of Chinese Academy of Sciences); Xilin CHEN (University of Chinese Academy of Sciences) PDF Poster Video (Right click to download)
339	Events Meet Dynamic Mode Decomposition: Capturing the Spatiotemporal Dynamics of Moving Objects Zhouning Du (AISIN CORPORATION); Israr Ulhaq (AISIN CORPORATION); Thanh Thi Huyen Phan (AISIN CORPORATION); Yuichiro Yoshimura (AISIN CORPORATION); Jigyasa Chand (AISIN CORPORATION); Truong Vinh Truong Duy (AISIN CORPORATION) PDF Poster Video (Right click to download)
342	Flatness-aware Curriculum Learning via Adversarial Difficulty Hiroaki Aizawa (Hiroshima University); Yoshikazu Hayashi (Gifu University) PDF Poster
346	Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID Priyank Pathak (University of Central Florida); Yogesh S Rawat (University of Central Florida) PDF Poster Video (Right click to download)
348	PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing PEILIN XIONG (The University of Electro-Communications); Junwen Chen (The University of Electro-Communications); HONGHUI YUAN (The University of Electro-Communications); Keiji Yanai (The University of Electro-Communications) PDF Poster Video (Right click to download)
354	Dual-Branch Network via Multiple Illumination-Aware Representation Learning for Steel Surface Defect Classification Yong Seok Oh (Dongguk University); Min Geol Kim (Sogang University); Bogyeong Kim (Dongguk University); Jun Young Kim (Dongguk University); Hyeongseob Jo (Dongguk University); Jae Hyeon Park (Dongguk University); Gyoomin Lee (Dongguk University); Sung In Cho (Sogang University) PDF Poster Video (Right click to download)
357	What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos Qiyue Sun (University of Birmingham); Qiming Huang (University of Birmingham); Yang Yang (Shandong University); Hongjun Wang (Shandong University); Jianbo Jiao (University of Birmingham) PDF Poster
358	Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones Yujie Zhao (Institute of Computing Technology, Chinese Academy of Sciences); Jiabei Zeng (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences) PDF Poster Video (Right click to download)
359	SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation Sang-Eun Lee (Kyoto University); Ko Nishino (Kyoto University); Shohei Nobuhara (Kyoto Institute of Technology) PDF Poster
364	Learning from Silence and Noise for Visual Sound Source Localization Xavier Juanola (Universitat Pompeu Fabra); Giovana Morais (New York University); Magdalena Fuentes (New York University); Gloria Haro (Universitat Pompeu Fabra) PDF Poster Video (Right click to download)
366	HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting Maksym Ivashechkin (University of Surrey); Oscar Mendez (University of Surrey); Richard Bowden (University of Surrey) PDF Poster Video (Right click to download)
367	RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction Zhuodong Jiang (University of Bristol); Haoran Wang (University of Bristol); Guoxi Huang (University of Bristol); Brett Seymour (National Park Service); Nantheera Anantrasirichai (University of Bristol) PDF Poster
370	RGB-Event Fusion for Robust Lane Detection Jingtao Dong (Beijing Institute of Technology); Hao Zhuang (Beijing Institute of Technology); Hao Yang (Beijing Institute of Technology); Liyuan Pan (Beijing Institute of Technology) PDF Poster Video (Right click to download)
373	Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval Emil Demić (University of Ljubljana); Luka Čehovin Zajc (University of Ljubljana) PDF Poster Video (Right click to download)
376	Multi-Rationale Explainable Object Recognition via Contrastive Conditional Inference Ali Rasekh (L3S Research Center); Sepehr Kazemi Ranjbar (Independent Researcher); Simon Gottschalk (L3S Research Center) PDF Poster Video (Right click to download)
381	Permutation-Invariant Polar Harmonic Pooling for Point-based Neural Networks Jaspreet Singh Maan (University of Lincoln); Grzegorz Cielniak (University of Lincoln) PDF Poster Video (Right click to download)
391	Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation Ritsuki Kurohiji (Wakayama University); Hirotaka Hachiya (Wakayama University) PDF Poster Video (Right click to download)
392	UMM: A Unified Multi-Modal Model for Low-Level Vision Tasks with Dual-Driven Prompting Ziqi Luo ( South University of Science and Technology); Jinxiang Lai (The Hong Kong University of Science and Technology); Ruitao Chen (Southern University of Science and Technology); Jinyu Yang (tapall.ai); Bin-Bin Gao (Tencent); Qiang Nie (The Hong Kong University of Science and Technology); Jun Liu (Tencent YouTu Lab); Jinfan Wang (Southern University of Science and Technology); Feng Zheng (Southern University of Science and Technology) PDF Poster Video (Right click to download)
398	Knowledge Distillation via Cross Supervising with Attention for Remote Sensing Object Detection KefanZhan (Xiangtan University); An Luo (Xiangtan University); Yunpeng Zeng (Xiangtan University); Jiaxin Li (Xiangtan University); Yuan Zhang (Xiangtan University); Kai Hu (Xiangtan University) PDF Poster Video (Right click to download)
399	3D Shape Reconstruction from Autonomous Driving Radars Samah Hussein (École Polytechnique Fédérale de Lausanne (EPFL)); Junfeng Guan (École Polytechnique Fédérale de Lausanne (EPFL), Bosch Research); Swathi Shree Narashiman (École Polytechnique Fédérale de Lausanne (EPFL)); Saurabh Gupta (University of Illinois Urbana-Champaign); Haitham Al Hassanieh (École Polytechnique Fédérale de Lausanne (EPFL)) PDF Poster Video (Right click to download)
402	Log NeRF: Comparing Spaces for Learning Radiance Fields Sihe Chen (Northeastern University); Luv Verma (Northeastern University); Bruce A. Maxwell (Northeastern University) PDF Poster Video (Right click to download)
404	Exploring Histogram-based Color Constancy David R. Treadwell IV (Northeastern University); Yunxuan Rao (Northeastern University); Daniel Y. Bi (Northeastern University); Bruce A. Maxwell (Northeastern University) PDF Poster Video (Right click to download)
405	HVLO-YOLO: An Ultra-Lightweight Detection Model for High-voltage Line Obstacles Weichao Pan (Shandong Jianzhu University); Xu Wang (Shandong Jianzhu University); Chengze Lv (Shandong Jianzhu University); Zicheng Lin (Shandong Jianzhu University); Gongrui Wang (Shandong Jianzhu University); Xuening Zhang (Harbin Institute of Technology); Yi Sun (University of Ulster); Xingbo Liu (Shandong Jianzhu University) PDF Poster Video (Right click to download)
408	Multimodal Hate Detection Using Dual-Stream Graph Neural Networks Jiangbei Yue (University of Exeter); Shuonan Yang (University of Exeter); Tailin Chen (University of Exeter); Jianbo Jiao (University of Birmingham); Zeyu Fu (University of Exeter) PDF Poster Video (Right click to download)
416	Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps Yoojin Oh (Ewha Womans University); Junhyug Noh (Ewha Womans University) PDF Poster Video (Right click to download)
423	JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection Zhiming Liu (University of Bristo); Paul Hill (University of Bristol); Nantheera Anantrasirichai (University of Bristol) PDF Poster
427	Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data Lintao XU (Université Gustave Eiffel); Chaohui Wang (Université Gustave Eiffel) PDF Poster Video (Right click to download)
444	Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering Haopeng Li (The University of Melbourne); Mohammed Bennamoun (University of Western Australia); Jun Liu (Lancaster University); Hossein Rahmani (Lancaster University); Qiuhong Ke (Monash University) PDF Poster Video (Right click to download)
445	Frequency-Temporal Feature Integration for Compressed Video Action Recognition Zhou Jiang wan (Beijing University of Posts and Telecommunications); Yue Ming (Beijing University of Posts and Telecommunications) PDF Poster Video (Right click to download)
453	HalfMix Augmentation and Regularized Dual-Path Learning for Cross-Domain Gaze Estimation Jiuk Hong (Kyungpook National University); Heechul Jung (Kyungpook National University) PDF Poster Video (Right click to download)
457	A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li (Jiangxi Normal University); Hanxi Li (Jiangxi Normal University); Lin Yuanbo Wu (The University of Warwick); Changhong Liu (Jiangxi Normal University) PDF Poster Video (Right click to download)
458	Intra-Modal Divergence-Weighted Distillation for Vision-Language Models Addad Youva (Université de Caen Basse Normandie); Alexis Lechervy (Université de Caen Basse Normandie); Frédéric Jurie (Université de Caen Normandie) PDF Poster Video (Right click to download)
461	FSF3A: Federated Spatial Feature Alignment and Adaptive Aggregation for Heterogeneous Brain Tumor Segmentation Peketi Divya (Indian Institute of Technology Hyderabad); C Krishna Mohan (Indian Institute of Technology Hyderabad); Sobhan Babu (Indian Institute of Technology Hyderabad); Sumanth Yenduri (Sam Houston State University) PDF Poster Video (Right click to download)
478	Bézier Curve-Based Stroke Extraction for Handwritten Characters Yuxuan TENG (Tokyo University of Science); Takuya Matsuzaki (Tokyo University of Science) PDF Poster Video (Right click to download)
479	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan (Linköping University, Qualcomm Inc); Maciej Wozniak (KTH Royal Institute of Technology); Marvin Klingner (Qualcomm Inc); Camille Maurice (Qualcomm Inc); B Ravi Kiran (Qualcomm Inc); Senthil Yogamani (Qualcomm Inc) PDF Poster Video (Right click to download)
486	FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction Junliang Ye (Australian National University); Lei Wang (Griffith University); Md Zakir Hossain (Curtin University) PDF Poster
493	Audio-Visual Separation with Hierarchical Fusion and Representation Alignment Han Hu (University of Birmingham); Dongheng Lin (University of Birmingham); Qiming Huang (University of Birmingham); Yuqi Hou (University of Birmingham); Hyung Jin Chang (University of Birmingham); Jianbo Jiao (University of Birmingham) PDF Poster
495	An Explorative Study on Abstract Images and Visual Representations Learned from Them Haotian LI (University of Birmingham); Jianbo Jiao (University of Birmingham) PDF Poster Video (Right click to download)
499	One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving Andrea Bertogalli (Politecnico di Milano); Giacomo Boracchi (Politecnico di Milano); Luca Magri (Politecnico di Milano) PDF Poster Video (Right click to download)
500	UFD-KD: Unified Frequency Decoupled Knowledge Distillation Lu Sihan (Beijing Institute of Technology); Yang Zheng (Institute of automation, Chinese Academy of Sciences); Jie Liu (Beijing University of Posts and Telecommunications); Zhenghao Xi (Shanghai University of Engineering Science) PDF Poster Video (Right click to download)
502	DefectGPT: Towards Multi-Class Defect Detection with Limited Electrical Samples Zhuoyi Lin (Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR)); Kaixin Xu (Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR)); Aye Phyu Phyu Aung (Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR)); Wen Qiu (Advanced Micro Devices (Singapore) Pte Ltd); Bernice Zee (Advanced Micro Devices (Singapore) Pte Ltd); Jiann Min Chin (Advanced Micro Devices (Singapore) Pte Ltd); Senthilnath Jayavelu (Institute for Infocomm Research, Agency for Science, Technology and Research (ASTAR)) PDF Poster Video (Right click to download)
507	Robust and Label-efficient Deep Waste Detection Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Khan Muhammad (Sung Kyun Kwan University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
509	Dynamic Convolution and Graph-Coupled Attention for Cross-Subject EEG-Vision Decoding Tianyu Zhang (Durham University); FAN WAN (Tongfang Knowledge Network Digital Technology Co., Ltd. China National Nuclear Corporation); Kaili Sun (Durham University); Xingyu Miao (Durham University); Yueming Sun (University of Durham); Minye Shao (Durham University); Yang Long (Durham University) PDF Poster Video (Right click to download)
511	LieMorph: Transformer-based Image Registration Using Flows on Lie Groups Johannes Bostelmann (Universität zu Lübeck); Jan Lellmann (Universität zu Lübeck) PDF Poster Video (Right click to download)
516	Making Rotation Averaging Fast and Robust with Anisotropic Coordinate Descent Yaroslava Lochman (Chalmers University of Technology); Carl Olsson (Lund University); Christopher Zach (Chalmers University of Technology) PDF Poster Video (Right click to download)
517	Catching the Unknown with Limited Data: Bi-Directional Prompt Tuning in CLIP for Few-Shot Open-Set Adaptation Moloud Abdar (The University of Queensland); Md Mehedi Hasan (Deakin University); Biplab Banerjee (Indian Institute of Technology Bombay); Abbas Khosravi (Deakin University); Pietro Lio (University of Cambridge) PDF Poster Video (Right click to download)
528	OmniSegNet: Towards Scalable, Efficient & Universal Medical Image Segmentation Soma Dasgupta (Tata Consultancy Services); Swarnava Dey (Tata Consultancy Services); Arijit Mukherjee (Tata Consultancy Services); Arpan Pal (Tata Consultancy Services) PDF Poster Video (Right click to download)
540	OptSplat: Recurrent Optimization for Generalizable Reconstruction and Novel View Renderings Vemburaj Chockalingam Yadav (German Research Center for Artificial Intelligence); Alain Pagani (German Research Center for Artificial Intelligence); Didier Stricker (German Research Center for Artificial Intelligence, RPTU) PDF Poster Video (Right click to download)
543	Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Saqib Javed (CVLab, EPFL); Ahmad Jarrar Khan (CVLab, EPFL); Corentin Dumery (CVLab, EPFL); Chen Zhao (CVLab, EPFL); Mathieu Salzmann (CVLab, EPFL, Swiss Data Science Center) PDF Poster Video (Right click to download)
546	Multi-Method Ensemble for Out-of-Distribution Detection Lucas Rakotoarivony (Thales, cortAIx Labs) PDF Poster Video (Right click to download)
548	Pandora: Articulated 3D Scene Graphs from Egocentric Vision Alan Yu (Massachusetts Institute of Technology); Yun Chang (Massachusetts Institute of Technology); Christopher Xie ( Meta Reality Labs); Luca Carlone (Massachusetts Institute of Technology) PDF Poster Video (Right click to download)
558	EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance Zicheng Duan (The University of Adelaide); Yuxuan Ding (Qualcomm AI Research); Chenhui Gou (Monash University); Ziqin Zhou (The University of Adelaide); Ethan Smith (Leonardo.AI); Lingqiao Liu (The University of Adelaide) PDF Poster
566	OpenHuman4D: Open-Vocabulary 4D Human Parsing Keito Suzuki (University of California, San Diego); Bang Du (University of California, San Diego); Runfa Li (University of California, San Diego); Kunyao Chen (Qualcomm Inc); Lei Wang (Qualcomm Inc); Peng Liu (Qualcomm Inc); Ning Bi (Qualcomm Inc); Truong Nguyen (University of California, San Diego) PDF Poster Video (Right click to download)
567	Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection Zihan Wang (McGill University); Samira Ebrahimi Kahou (University of Calgary); Narges Armanfard (McGill University) PDF Poster Video (Right click to download)
577	Zero-Shot CFC: Fast Real-World Image Denoising based on Cross-Frequency Consistency Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud) PDF Poster Video (Right click to download)
588	CoT-SD: Chain-of-Thought Semantic Denoising Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud) PDF Poster Video (Right click to download)
593	PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads Shashikant Verma (Indian Institute of Technology Gandhinagar); Shanmuganathan Raman (Indian Institute of Technology Gandhinagar) PDF Poster Video (Right click to download)
594	Video Dataset Condensation with Diffusion Models Zhe Li (FAU Erlangen-Nürnberg); Hadrien Reynaud (Imperial College London); Mischa Dombrowski (FAU Erlangen-Nürnberg); Sarah Cechnicka (Imperial College London); Franciskus Xaverius Erick (FAU Erlangen-Nürnberg); Bernhard Kainz (FAU Erlangen-Nürnberg, Imperial College London) PDF Poster Video (Right click to download)
596	TopoMortar: A Dataset to Evaluate Topology Accuracy in Image Segmentation Juan Miguel Valverde (Technical University of Denmark); Motoya Koga (Sojo University); Nijihiko Otsuka (Sojo University); Anders Dahl (Technical University of Denmark) PDF Poster Video (Right click to download)
602	Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism Jun Zheng (SUN YAT-SEN UNIVERSITY); Jing Wang (SUN YAT-SEN UNIVERSITY); Fuwei Zhao (ByteDance Inc.); xujie zhang (SUN YAT-SEN UNIVERSITY); Xiaodan Liang (SUN YAT-SEN UNIVERSITY) PDF Poster Video (Right click to download)
603	Dual-Stream Adapters for Open-Set Segmentation in Driving Scenes Shyam Nandan Rai (Politecnico di Torino); Massimiliano Mancini (University of Trento); Barbara Caputo (Politecnico di Torino); Carlo Masone (Politecnico di Torino) PDF Poster Video (Right click to download)
610	HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization Joohyun Chang (Kyung Hee University); Soyeon Hong (Kyung Hee University); Hyogun Lee (Kyung Hee University); Seong Jong Ha (AI R&D Division, CJ Group); Dongho Lee (AI R&D Division, CJ Group); Seong Tae Kim (Kyung Hee University); Jinwoo Choi (Kyung Hee University) PDF Poster Video (Right click to download)
614	PerSense: Training-Free Personalized Instance Segmentation in Dense Images Muhammad Ibraheem Siddiqui (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Umer Sheikh (Mohamed bin Zayed University of Artificial Intelligence); Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
626	Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation Sobhan Asasi (University of Surrey); Mohamed Ilyes Lakhal (University of Surrey); Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey) PDF Poster
627	TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation ZechaoGuan (Southeast University); Feng Yan (Meituan Company); Shuai Du (Southeast University); Lin Ma (Meituan Company); Qingshan Liu (Southeast University) PDF Poster Video (Right click to download)
628	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Yingshuang Zou (Tsinghua University); Yikang Ding (Megvii Technology); Chuanrui Zhang (Tsinghua University); Jiazhe Guo (Tsinghua University); Bohan Li (Shanghai Jiaotong University); Xiaoyang Lyu (University of Hong Kong); Feiyang Tan (Mach Drive); Xiaojuan Qi (University of Hong Kong); Haoqian Wang (Tsinghua University) PDF Poster Video (Right click to download)
630	Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives Prajwal Singh (IIT Gandhinagar); Ashish Tiwari (IIT Gandhinagar); Gautam Vashishtha (IIT Gandhinagar); Shanmuganathan Raman (IIT Gandhinagar) PDF Poster Video (Right click to download)
632	Segmentation Assisted Incremental Test Time Adaptation in an Open World Manogna Sreenivas (Indian Institute of Science); Soma Biswas (Indian Institute of Science) PDF Poster Video (Right click to download)
644	Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems Qihao Yuan (University of Groningen); Kailai Li (University of Groningen); Jiaming Zhang (Karlsruhe Institute of Technology) PDF Poster Video (Right click to download)
646	Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework Mosong Ma (Imperial College London); Tania Stathaki (Imperial College London); Michalis Lazarou (University of Surrey) PDF Poster Video (Right click to download)
649	Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes Xinhao Xiang (University of California, Davis); Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories (MERL)); Suhas Lohit (Mitsubishi Electric Research Laboratories (MERL)); Michael J. Jones (Mitsubishi Electric Research Laboratories (MERL)); Jiawei Zhang (University of California, Davis) PDF Poster Video (Right click to download)
650	Prompt-Informed Reinforcement Learning for Visual Coverage Path Planning Venkat Margapuri (Villanova University) PDF Poster Video (Right click to download)
656	3D Curvix: From Multiview 2D Edges to 3D Curve Segments Qiwu Zhang (Brown University); Chiang-Heng Chien (Brown University); Ricardo Fabbri (Universidade do Estado do Rio de Janeiro); Benjamin Kimia (Brown University) PDF Poster Video (Right click to download)
661	Leveraging Sparsity for Efficient Inference of High-Resolution Vision Foundation Models Xin Xu (University of Illinois Urbana-Champaign); Jason Kuen (Adobe); Brian L Price (Adobe); Kangning Liu (Adobe); Zijun Wei (Adobe); Yu-Xiong Wang (University of Illinois Urbana-Champaign) PDF Poster Video (Right click to download)
664	Canonical Makeup Transfer Xinyu Lin (CUHK-Shenzhen); Kun Zhou (Shenzhen University); Xiaoguang Han (CUHK-Shenzhen); Jiangbo Lu (SmartMore Corporation) PDF Poster Video (Right click to download)
666	Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization Alberto Compagnoni (University of Modena and Reggio Emilia); Davide Caffagni (University of Modena and Reggio Emilia); Nicholas Moratelli (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia ); Marcella Cornia (University of Modena and Reggio Emilia); Rita Cucchiara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download)
668	MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression Ofir Gordon (Sony Semiconductor Israel); Ariel Lapid (Sony Semiconductor Israel); Elad Cohen (Sony Semiconductor Israel); Yarden Yagil (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel) PDF Poster Video (Right click to download)
675	Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network Abdullah Tariq (Edith Cowan University); Martin Masek (Edith Cowan University); R Muhammad Atif Azad (Birmingham City University); Syed Zulqarnain Gilani (Edith Cowan University) PDF Poster Video (Right click to download)
680	On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics Li-Wei Chen (KU Leuven); Ombretta Strafforello (KU Leuven); Anne-Sofie Maerten (KU Leuven); Tinne Tuytelaars (KU Leuven); Johan Wagemans (KU Leuven) PDF Poster
690	MonoGSDF: Exploring Monocular Geometric Cues for Gaussian Splatting-Guided Implicit Surface Reconstruction Kunyi Li (Technical University of Munich); Michael Niemeyer (Google); Zeyu Chen (Tsinghua University); Nassir Navab (Technical University of Munich); Federico Tombari (Google) PDF Poster Video (Right click to download)
694	Occam’s LGS: An Efficient Approach for Language Gaussian Splatting Jiahuan Cheng (Johns Hopkins University); Jan-Nico Zaech (INSAIT, Sofia University); Luc Van Gool (INSAIT, Sofia University); Danda Pani Paudel (INSAIT, Sofia University) PDF Poster
698	SAMWave: Adapting Segment Anything Model to difficult tasks Saurabh Yadav (IIIT Delhi); Avi Gupta (IIIT Delhi); Koteswar Rao Jerripothula (IIT Kanpur) PDF Poster Video (Right click to download)
700	Q-Align: Alleviating Attention Leakage in Zero-Shot Appearance Transfer via Query-Query Alignment Namu Kim (KT Corporation); Wonbin Kweon (University of Illinois Urbana-Champaign); Minsoo Kim (Pohang University of Science and Technology); Hwanjo Yu (Pohang University of Science and Technology) PDF Poster Video (Right click to download)
701	Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders Faizan Farooq Khan (King Abdullah University of Science and Technology); Vladan Stojnić (Czech Technical University in Prague); Zakaria Laskar (Czech Technical University in Prague); Mohamed Elhoseiny (King Abdullah University of Science and Technology); Giorgos Tolias (Czech Technical University of Prague) PDF Poster Video (Right click to download)
704	JOG3R: Towards 3D-Consistent Video Generators Chun-Hao Paul Huang (Adobe Systems); Niloy J. Mitra (Adobe Systems); Hyeonho Jeong (Adobe Systems); Jae Shin Yoon (Adobe Systems); Duygu Ceylan (Adobe Systems) PDF Poster Video (Right click to download)
705	ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation Chenzhi Liu (The University of Queensland); Mahsa Baktashmotlagh (The University of Queensland); Yanran Tang (The University of Queensland); Zi Huang (University of Queensland); Ruihong Qiu (The University of Queensland) PDF Poster Video (Right click to download)
707	Catch Your Concepts: A Flexible Concept Locator for Interpretable Visual Recognition Qiyang Wan (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Chengzhi Gao (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Xilin CHEN (Institute of Computing Technology, Chinese Academy of Sciences (CAS)) PDF Poster Video (Right click to download)
709	TAPM-Net: Trajectory-Aware Perturbation Modeling for Infrared Small Target Detection Hongyang Xie (The University of Warwick); Hongyang He (The University of Warwick); Victor Sanchez (The University of Warwick) PDF Poster
716	Size-aware Contrastive Imitation Learning for Language-conditioned Multi-task Robotic Manipulation Jiakai Huang (South China Normal University); Weiping Zheng (South China Normal University) PDF Poster Video (Right click to download)
717	From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects Zizhao Li (University of Melbourne); Zhengkang Xiang (University of Melbourne); Joseph West (University of Melbourne); Kourosh Khoshelham (University of Melbourne) PDF Poster Video (Right click to download)
718	PADS: Plug-and-Play 3D Human Pose Analysis via Diffusion Generative Modeling Haorui Ji (Australian National University); Hongdong Li (Australian National University) PDF Poster
721	Gromov Wasserstein Optimal Transport for Semantic Correspondences Francis Snelgar (Australian National University); Stephen Gould (Australian National University); Ming Xu (Australian National University); Liang Zheng (Australian National University); Akshay Asthana (Seeing Machines) PDF Poster Video (Right click to download)
722	Unsupervised Multimodal Deepfake Detection Through Explicit Intra-Modal and Cross-Modal Inconsistency Discovery Mulin Tian (University of Southern California); Mahyar Khayatkhoei (University of Southern California); Joe Mathai (University of Southern California); Wael AbdAlmageed (Clemson University) PDF Poster Video (Right click to download)
732	TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Riza Velioglu (Universität Bielefeld); Petra Bevandić (Universität Bielefeld); Robin Chan (Technische Universität Berlin); Barbara Hammer (Universität Bielefeld) PDF Poster Video (Right click to download)
736	EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models Yue Hu (AgiBot, Harbin Institute of Technology); Siyuan Huang (Shanghai Jiao Tong University); Yue Liao (National University of Singapore); Shengcong Chen (Agibot); Pengfei Zhou (Agibot); Liliang Chen (AgiBot); Guanghui Ren (AgiBot); Maoqing Yao (Agibot) PDF Poster Video (Right click to download)
737	Self-Intersection-Aware 3D Human Motion Generation Using an Efficient Human Sphere Proxy Pascal Herrmann (Bosch Research); Maarten Bieshaar (Bosch Research); Dennis Mack (Bosch Research); Paul Robert Herzog (Bosch Research); Juergen Gall (University of Bonn) PDF Poster Video (Right click to download)
745	Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping Fatih Erdoğan (Koç University); Merve Rabia Barin (Koç University); Fatma Güney (Koç University) PDF Poster Video (Right click to download)
751	TPA: Temporal Prompt Alignment for Fetal Congenital Heart Defect Classification Darya Taratynova (Mohamed bin Zayed University of Artificial Intelligence); Alya Almsouti (Mohamed bin Zayed University of Artificial Intelligence); Beknur Kalmakhanbet (Mohamed bin Zayed University of Artificial Intelligence); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
753	Boosting Camera Motion Control for Video Diffusion Transformers Soon Yau Cheong (University of Surrey); Duygu Ceylan (Adobe Research); Armin Mustafa (University of Surrey); Andrew Gilbert (University of Surrey); Chun-Hao Paul Huang (Adobe Research) PDF Poster Video (Right click to download)
755	Audio-Guided Visual Editing with Complex Multi-Modal Prompts Hyeonyu Kim (MAUM AI Inc.); Seokhoon Jeong (Ulsan National Institute of Science and Technology); Seonghee Han (Ulsan National Institute of Science and Technology); Chanhyuk Choi (Ulsan National Institute of Science and Technology); Taehwan Kim (Ulsan National Institute of Science and Technology) PDF Poster Video (Right click to download)
762	Learning a Neural Association Network for Self-supervised Multi-Object Tracking Shuai Li (University of Bonn); Michael Burke (Monash University); Subramanian Ramamoorthy (University of Edinburgh); Juergen Gall (University of Bonn) PDF Poster Video (Right click to download)
766	Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva (Télécom Paris); Fabio Pizzati (MBZUAI); Philip Torr (University of Oxford); Stéphane Lathuilière (INRIA) PDF Poster Video (Right click to download)
768	Interpretable Text-Guided Image Clustering via Iterative Search Bingchen Zhao (University of Edinburgh); Oisin Mac Aodha (University of Edinburgh) PDF Poster
770	DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction Kiana Hooshanfar (University of Tehran); Alireza Hosseini (University of Tehran); Mona Ahmadian (University of Surrey); Ahmad Kalhor (University of Tehran); Babak N Araabi (University of Tehran) PDF Poster Video (Right click to download)
782	Is Safety Checker Still Safe? A Study on the Covert NSFW Text Xin Li (Shanghai Fudan Microelectronics Group CO.,LTD.); Kai Chen (Shanghai Fudan Microelectronics Group CO.,LTD.); XUE YANG (Shanghai Fudan Microelectronics Group CO.,LTD.); Weijun Shan (Shanghai Fudan Microelectronics Group CO.,LTD.); Jun Yu (Shanghai Fudan Microelectronics Group CO.,LTD.); Qing Li (Shanghai Fudan Microelectronics Group CO.,LTD.) PDF Poster
784	UDT : Unsupervised Discovery of Transformations between Fine-Grained Classes in Diffusion Models Youngjae Choi (Soongsil University); Hyunsuh Koh (Soongsil University); Hojae Jeong (Soongsil University); ByungKwan Chae (Soongsil University); Sungyong Park (Soongsil University); Heewon Kim (Soongsil University) PDF Poster Video (Right click to download)
786	TAG: A Simple Yet Effective Temporal-Aware Approach for Zero-Shot Video Temporal Grounding Jin-Seop Lee (Sungkyunkwan University); SungJoon Lee (Sungkyunkwan University); Jaehan Ahn (Sungkyunkwan University); YunSeok Choi (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University) PDF Poster Video (Right click to download)
787	${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting Yuxi Hu (Graz University of Technology); Jun Zhang (Graz University of Technology); Kuangyi Chen (Graz University of Technology); Zhe Zhang (Peking University); Friedrich Fraundorfer (Graz University of Technology) PDF Poster Video (Right click to download)
788	Vision Backbone Efficient Selection for Image Classification in Low-Data Regimes Joris Guerin (IRD); Shray Bansal (Georgia Institute of Technology); Amirreza Shaban (Field AI); Paulo Mann (Federal University of Rio de Janeiro); Harshvardhan Gazula (Massachusetts Institute of Technology) PDF Poster Video (Right click to download)
811	DEAD: Data-Efficient Audiovisual Dubbing using Neural Rendering Priors Jack Saunders (University of Bath); Vinay Namboodiri (University of Bath) PDF Poster Video (Right click to download)
824	A Unified Framework for High-Frame-Rate HDR Video Synthesis Hue Nguyen (York University); Trevor Dalton Canham (York University); Michael S. Brown (York University) PDF Poster Video (Right click to download)
825	Modular Embedding Recomposition for Incremental Learning Aniello Panariello (University of Modena and Reggio Emilia); Emanuele Frascaroli (University of Modena and Reggio Emilia); Pietro Buzzega (University of Modena and Reggio Emilia); Lorenzo Bonicelli (University of Modena and Reggio Emilia); Angelo Porrello (University of Modena and Reggio Emilia); Simone Calderara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download)
827	MonoTracker: Monocular RGB-Only 6D Tracking of Unknown Object Zilong Deng (ETH Zürich); Shaochang Tan (Universität Zürich); Zuria Bauer (ETH Zürich); Daniel Barath (ETH Zürich); Marc Pollefeys (Microsoft) PDF Poster Video (Right click to download)
828	Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval Adriano Fragomeni (University of Bristol); Dima Damen (University of Bristol); Michael Wray (University of Bristol) PDF Poster Video (Right click to download)
831	LOGen: Toward LiDAR Object Generation by Point Diffusion Ellington Kirby (Valeo.ai); Mickael Chen (Valeo.ai); Renaud Marlet (Valeo.ai); Nermin Samet (Valeo.ai) PDF Poster Video (Right click to download)
832	SIMULDITEX: Single Image Multiscale & Lightweight Diffusion for Texture Modelling Pierrick Chatillon (Université de Caen Normandie); Julien Rabin (Université de Caen Normandie); David Tschumperlé (Université de Caen Normandie) PDF Poster Video (Right click to download)
835	Cloud-Stereo: A Dataset and Benchmark for Reconstructing Atmospheric Clouds from Stereo Images Jacob Lin (University of Oxford); Edward Gryspeerdt (Imperial College London); Ronald Clark (University of Oxford) PDF Poster Video (Right click to download)
850	C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression Baptiste Bauvin (Thales); Loïc Baret (Thales); Ola Ahmad (Thales) PDF Poster Video (Right click to download)
851	Asymmetric Event-Image Stereo with Temporal Feature Gating and Iterative Structure-Detail Refinement Hao Zhuang (Beijing Institute of Technology); Yan Yang (Australian National University); Liyuan Pan (Beijing Institute of Technology) PDF Poster Video (Right click to download)
852	LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance Jae Myung Kim (University of Tuebingen); Stephan Alaniz (Télécom Paris); Cordelia Schmid (Inria, Ecole normale supérieure); Zeynep Akata (Technische Universität München) PDF Poster Video (Right click to download)
857	Lost in Time: A New Temporal Benchmark for VideoLLMs Daniel Cores (University of Santiago de Compostela); Michael Dorkenwald (University of Amsterdam); Manuel Mucientes (Universidad de Santiago de Compostela); Cees G. M. Snoek (University of Amsterdam); Yuki M Asano (University of Technology Nuremberg) PDF Poster Video (Right click to download)
859	Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift Björn Michele (Université de Bretagne Sud); Alexandre Boulch (valeo.ai); Gilles Puy (valeo.ai); Tuan-Hung VU (valeo.ai); Renaud Marlet (valeo.ai); Nicolas Courty (IRISA) PDF Poster Video (Right click to download)
864	Geometry-Aware Diffusion Models for Multiview Scene Inpainting Ahmad Salimi (York University); Tristan Ty Aumentado-Armstrong (York University); Marcus A Brubaker (Google DeepMind); Konstantinos G. Derpanis (York University) PDF Poster Video (Right click to download)
865	FootFormer: Estimating Stability from Visual Input Keaton Yukio Kraiger (Pennsylvania State University); Jingjing Li (Pennsylvania State University); Skanda Bharadwaj (Pennsylvania State University); Jesse Scott (Scientific Applications & Research Associates (SARA), Inc.); Robert T. Collins (Pennsylvania State University); Yanxi Liu (Pennsylvania State University) PDF Poster
866	Image Recognition with Vision and Language Embeddings of VLMs Illia Volkov (Czech Technical University of Prague); Nikita Kisel (Czech Technical Univeresity in Prague); Klara Janouskova (Czech Technical Univeresity in Prague); Jiri Matas (Czech Technical University in Prague) PDF Poster
873	Exploring Image Representation with Decoupled Classical Visual Descriptors Chenyuan Qu (University of Birmingham); Hao Chen (University of Cambridge); Jianbo Jiao (University of Birmingham) PDF Poster Video (Right click to download)
875	Lost in Translation? Vocabulary Alignment for Source-Free Adaptation in Open-Vocabulary Semantic Segmentation Silvio Mazzucco (The Good AI Lab); Carl Persson (The Good AI Lab); Mattia Segu (The Good AI Lab); Pier Luigi Dovesi (The Good AI Lab); Federico Tombari (Google); Luc Van Gool (INSAIT, Sofia University); Matteo Poggi (University of Bologna) PDF Poster Video (Right click to download)
880	Distortion-Aware Multi-Object Tracking via Virtual Plane Projection in Overhead Fisheye Cameras Panithi Vanasirikul (OxygenAI); Piyanon Charoenpoonpanich (OxygenAI); Ekapol Chuangsuwanich (Chulalongkorn University) PDF Poster Video (Right click to download)
887	QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising Qijun Yang (The University of Manchester); Yating Huang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester) PDF Poster Video (Right click to download)
896	Llama Learns to Direct: DirectorLLM for Human-Centric Video Generation Kunpeng Song (The State University of New Jersey); Tingbo Hou (GenAI at Meta); Zecheng He (GenAI at Meta); Haoyu Ma (GenAI at Meta); Jialiang Wang (GenAI at Meta); Animesh Sinha (GenAI at Meta); Sam Tsai (GenAI at Meta); Yaqiao Luo (GenAI at Meta); Xiaoliang Dai (GenAI at Meta); Li Chen (GenAI at Meta); Xide Xia (GenAI at Meta); Peizhao Zhang (GenAI at Meta); Peter Vajda (GenAI at Meta); Ahmed M. Elgammal (The State University of New Jersey); Felix Juefei-Xu (GenAI at Meta) PDF Poster Video (Right click to download)
898	FaceCPT: Toward Cross-Modal Facial Representation Learning with Face-Caption Pre-Training Md Mahedi Hasan (West Virginia University); Shoaib Meraj Sami (West Virginia University); Nasser Nasrabadi (West Virginia University); Jeremy M. Dawson (West Virginia University) PDF Poster Video (Right click to download)
900	Time-Scaling State-Space Models for Dense Video Captioning AJ Piergiovanni (Google Deepmind); Ganesh Satish Mallya (Google Deepmind); Dahun Kim (Google Deepmind); Anelia Angelova (Google Deepmind) PDF Poster
902	ITC-RWKV: Interactive Tissue–Cell Modeling with Recurrent Key-Value Aggregation for Histopathological Subtyping Yating Huang (The University of Manchester); Qijun Yang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester) PDF Poster Video (Right click to download)
903	Toward Robust Audio-Visual Synchronization Detection in Egocentric Video with Sparse Synchronization Events Jordan Voas (The University of Texas at Austin); Wei-Cheng Tseng (The University of Texas at Austin); Benoit Vallade (Amazon Prime Video); Alex Mackin (Amazon Prime Video); David Higham (Amazon Prime Video); David Harwath (The University of Texas at Austin) PDF Poster Video (Right click to download)
914	Improving Human Motion Plausibility with Body Momentum Ha Linh Nguyen (National University of Singapore); Tze Ho Elden Tse (National University of Singapore); Angela Yao (National University of Singapore) PDF Poster Video (Right click to download)
915	DualDistill: A Unified Cross-Modal Knowledge Distillation Framework for Camera-Based BEV Representation Gaeun Kim (Seoul National University of Science and Technology); Daeil Han (Seoul National University of Science and Technology); Yeong Jun Koh (Chungnam National University); Hanul Kim (Seoul National University of Science and Technology) PDF Poster Video (Right click to download)
922	FSLC: Fast Scoring with Learnable Coreset for Zero-shot Industrial Anomaly Detection Songtao Ni (Shanghai Jiaotong University); Yuxin Li (Shanghai Jiaotong University); Xu Zhao (Shanghai Jiao Tong University) PDF Poster Video (Right click to download)
931	Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation Yuan Yao (University of Rochester); Yicong Hong (Adobe Research); Difan Liu (Adobe Research); Long Mai (Adobe Research); Feng Liu (Adobe Research); Jiebo Luo (University of Rochester) PDF Poster Video (Right click to download)
938	Grad-CL: Source Free Domain Adaptation with Gradient Guided Feature Disalignment Rini Smita Thakur (Indian Institute of Science Education and Research Bhopal); Rajeev Ranjan Dwivedi (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download)
940	Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation Nguyen Lan Vi Vu (Ho Chi Minh City University of Technology); Thanh-Huy Nguyen (Carnegie Mellon University); Thien Nguyen (Posts and Telecommunications Institute of Technology); Daisuke Kihara (Purdue University); Tianyang Wang (University of Alabama at Birmingham); Xingjian Li (Carnegie Mellon University); Min Xu (Carnegie Mellon University) PDF Poster Video (Right click to download)
943	HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation Yulong Shi (Northeastern University); Jiapeng Li (Northeastern University); Lin Qi (Northeastern University) PDF Poster
948	Tracking Meets Large Multimodal Models for Driving Scenario Understanding Ayesha Ishaq (Mohamed bin Zayed University of Artificial Intelligence); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
949	Prompt Image to Watch and Hear: Multimodal Prompting for Parameter-Efficient Audio-Visual Learning Kai Wang (University of Toronto); Shentong Mo (Carnegie Mellon University); Yapeng Tian (University of Texas at Dallas); Dimitrios Hatzinakos (University of Toronto) PDF Poster Video (Right click to download)
955	Clean Sample Selection and Noisy Sample Rematching for Text-Based Pedestrian Retrieval Daiqiang Li (Sichuan University); Weicheng Zhang (Sichuan University); yuanyuan wu (Chengdu University of Technology); Honggang Chen (Sichuan University) PDF Poster Video (Right click to download)
960	A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading Junlai Qiu (Hainan University); Yunzhu Chen (Guangxi Polytechnic of Construction); Hao Zheng (Tencent Jarvis Lab); Yawen Huang (Tencent Jarvis Lab); Yuexiang Li (Guangxi Medical University) PDF Poster
962	LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba Yunxiang Fu (University of Hong Kong); Chaoqi Chen (Shenzhen University); Yizhou Yu (The University of Hong Kong) PDF Poster Video (Right click to download)
976	Advancing Utility Pole and Sign Detection Through Deep Learning Carl Dickinson (University of Strathclyde); Gaetano Di Caterina (University of Strathclyde) PDF Poster Video (Right click to download)
981	Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams Takuya Nakabayashi (Keio University); Navami Kairanda (Max Planck Institute for Informatics); Hideo Saito (Keio University); Vladislav Golyanik (Max Planck Institute for Informatics) PDF Poster Video (Right click to download)
986	Visible Structure Retrieval for Lightweight Image-Based Relocalisation Fereidoon Zangeneh (KTH Royal Institute of Technology); Leonard Bruns (KTH Royal Institute of Technology); Amit Dekel (Univrses AB); Alessandro Pieropan (Univrses AB); Patric Jensfelt (KTH Royal Institute of Technology) PDF Poster Video (Right click to download)
987	Spatiotemporal Event Spotting via 3D Heatmaps with Dynamically Shifted Gaussian Kernels Ankhzaya Jamsrandorj (Korea Institute of Science and Technology); VANYI CHAO (University of Science and Technology); Hoang Quoc Nguyen (Korea Institute of Science and Technology); Yin May Oo (University of Science and Technology); Muhammad Amrulloh Robbani (University of Science and Technology); Yewon Hwang (Korea Institute of Science and Technology); Kyung-Ryoul Mun (Korea Institute of Science and Technology); Jinwook Kim (Korea Institute of Science and Technology) PDF Poster Video (Right click to download)
988	Ink Enhancement for Ancient Bamboo Manuscripts Using Iterative Restoration-Degradation Adversarial Learning Chongsheng ZHANG (Henan University, Ludwig-Maximilians-Universität München); Junchao Ma (Henan Univeristy); Wenhao Zhang (Henan Univeristy); Kamel Aouaidjia (Henan Univeristy); Qilong Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München) PDF Poster Video (Right click to download)
989	MedOpenSeg: Open-World Medical Segmentation with Memory-Augmented Transformers Luisa Vargas (Eurecom); Eleonora Poeta (Politecnico di Torino); Tania Cerquitelli (Politecnico di Torino); Elena Baralis (Politecnico di Torino); Maria A Zuluaga (Eurecom) PDF Poster Video (Right click to download)
992	Dual Polarity Prompts with Stochastic Entropy Perturbation for Label Noise Changhui Hu (Universitat de Barcelona); Bhalaji Nagarajan (Barcelona Supercomputing Center); Ricardo Marques (Universitat Pompeu Fabra); Petia Radeva Ivanova (Universitat de Barcelona) PDF Poster Video (Right click to download)
999	Lid-Lab-NeRF: Generating Temporally Consistent, Labelled LiDAR Point Clouds using Neural Radiance Fields Shrestha Srivastava (Indian Institute of Science Education and Research Bhopal); Vaibhav Kumar (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download)
1004	RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation Nuren Zhaksylyk (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Jay Nitin Paranjape (Johns Hopkins University); S. Swaroop Vedula (Johns Hopkins University); Shameema Sikder (Johns Hopkins University); Vishal M. Patel (Johns Hopkins University); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
1006	Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models Lorenzo Baraldi (University of Pisa); Davide Bucciarelli (University of Pisa); Zifan Zeng (Technische Universität München); Chongzhe Zhang (Technische Universität Berlin); Qunli Zhang (Huawei Technologies Ltd.); Marcella Cornia (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia); Feng Liu (Huawei Technologies Ltd.); Zheng Hu (Huawei Technologies Ltd.); Rita Cucchiara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download)
1007	Is Structural Awareness the Key to Event Camera Data Cleansing for Enhancing Veracity? Haiyu Li (The University of Sheffield); Charith Abhayaratne (The University of Sheffield) PDF Poster Video (Right click to download)
1013	SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank Transformation Abdelrahman Elsayed (Mohamed bin Zayed University of Artificial Intelligence); Sarim Hashmi (Mohamed bin Zayed University of Artificial Intelligence); Mohammed Elseiagy (Mohamed bin Zayed University of Artificial Intelligence); Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
1015	In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Congbo Ma (New York University); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
1024	Quantifying Risk in Pedestrian Crowds Using Divergence Estimated from Flows of Head-Tracking Data Haruto Nakayama (University of Tsukuba); Masaki Onishi (AIST) PDF Poster Video (Right click to download)
1029	PrIINeR: Towards Prior-Informed Implicit Neural Representations for Accelerated MRI Ziad Al-Haj Hemidi (Universität zu Lübeck); Eytan Kats (Universität zu Lübeck); Mattias P. Heinrich (Universität zu Lübeck) PDF Poster Video (Right click to download)
1030	Language-Guided Decision Override for Adaptive and Retraining-Free Video Anomaly Detection Ryo Moriyama (Aoyama Gakuin University); Shin Suzuki (Aoyama Gakuin University); Naoshi Kaneko (Tokyo Denki University); Kazuhiko Sumi (Aoyama Gakuin University) PDF Poster Video (Right click to download)
1035	Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation Julien Walther (University of Bordeaux); Rémi Giraud (University of Bordeaux); Michaël Clément (University of Bordeaux) PDF Poster Video (Right click to download)
1036	Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving Mert Keser (Continental AG, Technische Universität München); Halil Ibrahim Orhan (Technische Universität München); Niki Amini-Naieni (University of Oxford); Gesina Schwalbe (Universität zu Lübeck); Alois C. Knoll (Technical University Munich); Matthias Rottmann (Universität Osnabrück) PDF Poster Video (Right click to download)
1037	Generative Data Augmentation for Object Point Cloud Segmentation Dekai Zhu (Technische Universität München); Stefan Gavranovic (Siemens AG); Flavien Boussuge (Siemens AG); Benjamin Busam (Technische Universität München); Slobodan Ilic (Technische Universität München) PDF Poster Video (Right click to download)
1051	Conditional Prototype Learning for Few-Shot Object Detection Zhenwei He (Chongqing University of Technology); Xinye Liao (Chongqing University of Technology); Xin Feng (Chongqing University of Technology) PDF Poster Video (Right click to download)
1058	Atomizer: Generalizing to unseen modalities by breaking images down to a set of scalars Hugo Riffaud de Turckheim (INRIA); Diego Marcos (INRIA); Roberto Interdonato (CIRAD); Sylvain Lobry (Université de Montpellier) PDF Poster Video (Right click to download)
1060	Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation Lee Chae-Yeon (Pohang University of Science and Technology); Nam Hyeon-Woo (Pohang University of Science and Technology); Tae-Hyun Oh (Korea Advanced Institute of Science and Technology) PDF Poster Video (Right click to download)
1061	IPGPhormer: Interpretable Pathology Graph-Transformer for Survival Analysis Guo Tang (Harbin Institute of Technology Shenzhen); Songhan Jiang (Harbin Institute of Technology Shenzhen); Jinpeng Lu (University of Science and Technology of China); Linghan Cai (Harbin Institute of Technology Shenzhen); Yongbing Zhang (Harbin Institute of Technology Shenzhen) PDF Poster Video (Right click to download)
1062	Calibration-Aware Prompt Learning for Medical Vision-Language Models Abhishek Basu (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shamshad (Mohamed bin Zayed University of Artificial Intelligence); Ashshak Sharifdeen (Mohamed bin Zayed University of Artificial Intelligence); Karthik Nandakumar (Michigan State University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
1064	3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes Tejaswini Medi (Universität Mannheim); Arianna Rampini (Autodesk); Pradyumna Reddy (Autodesk); Pradeep Kumar Jayaraman (Autodesk); Margret Keuper (Universität Mannheim) PDF Poster Video (Right click to download)
1071	Stabilizing Open-Set Test-Time Adaptation via Primary-Auxiliary Filtering and Knowledge-Integrated Prediction Byung-Joon Lee (Sungkyunkwan University); Jin-Seop Lee (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University) PDF Poster Video (Right click to download)
1072	Controllable Garment Generation with Multi-Modal Diffusion Guidance Sanhita Pathak (IIT Delhi); Vinay Kaushik (IIIT Sonepat); Brejesh Lall (IIT Delhi) PDF Poster Video (Right click to download)
1076	CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation Anshul Kaushal (Panjab University); Kunal Jangid (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download)
1077	Towards Sharper Object Boundaries in Self-Supervised Depth Estimation Aurélien Cecille (LIRIS); Stefan Duffner (LIRIS); Franck Davoine (LIRIS); Rémi Agier (Visual Behavior); Thibault Neveu (Visual Behavior) PDF Poster Video (Right click to download)
1081	ImProvShow: Multimodal Fusion for Image Provenance Summarization Alexander Black (University of Surrey); Jing Shi (Adobe Research); Yifei Fan (Adobe Research); John Collomosse (Adobe Research) PDF Poster Video (Right click to download)
1091	CLIMB-3D: Class-Incremental Imbalanced 3D Instance Segmentation Vishal Thengane (University of Surrey); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence); Lu Yin (University of Surrey); Xiatian Zhu (University of Surrey); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
1095	Supervised Segmentation Model for Improved Detection of OSSN using Slit Lamp Images Kajal Singh (Indian Institute of Technology Mandi); Shagun Bhatt (Indian Institute of Technology Mandi); Ramkailash Gujar (Dr Shroff’s Charity Eye Hospital ); Arnav Bhavsar (Indian Institute of Technology Mandi); Dinesh Singh (Indian Institute of Technology Mandi) PDF Poster Video (Right click to download)
1096	LED: Light Enhanced Depth Estimation at Night Simon de Moreau (Mines Paris - PSL University); Yasser Almehio (Valeo AI); Andrei Bursuc (Valeo AI); Hafid EL IDRISSI (Valeo AI); Bogdan Stanciulescu (Mines Paris - PSL University); Fabien Moutarde (Mines Paris - PSL University) PDF Poster Video (Right click to download)
1109	AKD-BNN: Adaptive Kernel Dynamic Bayesian Neural Networks for Enhanced Medical Image Segmentation with Uncertainty Estimation Qianying He (City University of Macau); Xuan Liu (City University of Macau); Wenjian Liu (City University of Macau) PDF Poster Video (Right click to download)
1110	Robust Human Registration with Body Part Segmentation on Noisy Point Clouds Kai Lascheit (ETH Zurich); Francis Engelmann (Stanford University); Daniel Barath (ETH Zurich); Marc Pollefeys (Microsoft); Leonidas Guibas (Stanford University) PDF Poster Video (Right click to download)
1118	Conformal Predictors for Efficient Video Text Spotting Amor Ben Tanfous (Terminal Industries); Sankha Subhra Mukherjee (Terminal Industries); Neil M. Robertson (Terminal Industries) PDF Poster
1121	MIAS-SAM: Medical Image Anomaly Segmentation without thresholding Marco Colussi (University of Milan); Dragan Ahmetovic (University of Milan); Sergio Mascetti (University of Milan) PDF Poster Video (Right click to download)
1123	Evaluating Self-Supervised Learning in Medical Imaging: A Systematic Investigation of Robustness, Generalizability, and Multi-Domain Impact Valay Bundele (University of Tuebingen); Karahan Sarıtaş (University of Tuebingen); Bora Kargi (University of Tuebingen); Oğuz Ata Çal (University of Tuebingen); Kıvanç Tezören (University of Tuebingen); Zohreh Ghaderi (University of Tuebingen); Hendrik Lensch (University of Tuebingen) PDF Poster Video (Right click to download)
1127	ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds Shanle Yao (University of North Carolina at Charlotte); Ghazal Alinezhad Noghre (University of North Carolina at Charlotte); Armin Danesh Pazho (University of North Carolina at Charlotte); Hamed Tabkhivayghan (University of North Carolina at Charlotte) PDF Poster Video (Right click to download)
1132	Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning Wenyi Lian (Uppsala University); Patrick Micke (Uppsala University); Joakim Lindblad (Uppsala University); Nataša Sladoje (Uppsala University) PDF Poster Video (Right click to download)
1134	TRUDI and TITUS: A Multi-Perspective Dataset and A Three-Stage Recognition System for Transportation Unit Identification Emre Gülsoylu (Universität Hamburg); André Peter Kelm (Universität Hamburg); Lennart Bengtson (Universität Hamburg); Matthias Hirsch (Universität Hamburg); Christian Wilms (Universität Hamburg); Tim Rolff (Universität Hamburg); Janick Edinger (University of Hamburg); Simone Frintrop (Universität Hamburg) PDF Poster Video (Right click to download)
1135	The Trauma THOMPSON Dataset for Real-World Emergency AI Yupeng Zhuo (Purdue University); Eddie Zhang (Purdue University); Xiangchen Yu (Purdue University); Aditya Pachpande (Purdue University); Andrew Wallace Kirkpatrick (University of Calgary); Kyle Couperus (The Geneva Foundation); Jessica Mckee (University of Calgary); Juan Wachs (Purdue University) PDF Poster Video (Right click to download)
1137	ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates Hamidreza Dastmalchi (York University); Aijun An (York University); Ali Cheraghian (CSIRO) PDF Poster Video (Right click to download)
1156	Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion Yu Zhu (Tohoku University); Naoya Chiba (Osaka University); Koichi Hashimoto (Tohoku University) PDF Poster Video (Right click to download)
1161	Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data Alexander Hepburn (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Javier Portilla (Consejo Superior de Investigaciones Científicas (CSIC)) PDF Poster Video (Right click to download)
1171	Conflict-Aware Adversarial Training Zhiyu Xue (UC Santa Barbara); Haohan Wang (University of Illinois at Urbana-Champaign); Yao Qin (UC Santa Barbara); Ramtin Pedarsani (UC Santa Barbara) PDF Poster Video (Right click to download)
1177	Binarizing Severely Degraded Ancient Bamboo Slips: Dataset and Baseline Chongsheng Zhang (Henan University, Ludwig-Maximilians-Universität München); Wanwan Fu (Henan University); Qilong Li (Henan Univeristy); SAMRA ZAFAR (Henan Univeristy); Zhanshuo Zhang (Henan Univeristy); Qiyan Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München) PDF Poster Video (Right click to download)
1180	MO-SHW: Hierarchy-Aware Multi-Objective Optimization for Open-World Segmentation Erico M. Pereira (Universidade Federal de Minas Gerais); Frederico Gadelha Guimarães (Universidade Federal de Minas Gerais); Jefersson A Dos Santos (University of Sheffield) PDF Poster Video (Right click to download)
1183	Contrastive Point Feature Matching for Open-world Object Counting Ngo Xuan Cuong (University of Arkansas) PDF Poster Video (Right click to download)
1187	Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers Kevin Raina (University of Ottawa); Tanya Schmah (University of Ottawa) PDF Poster Video (Right click to download)
1196	IoSR: End-to-End Intraoral Scans Repairing Manel Farhat (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Achraf Ben-hamadou (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ahmed Rekik (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ons Abida (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Oussama Smaoui (Biotech Dental Group) PDF Poster Video (Right click to download)
1199	AULUNet: An Adaptive Ultra-Lightweight U-Net Framework for Efficient Skin Lesion Segmentation in Resource-Constrained Environments Md Maklachur Rahman (Texas A&M University); Soon Ki Jung (Kyungpook National University); Tracy Hammond (Texas A&M University) PDF Poster Video (Right click to download)
1209	Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning Bahareh Nikpour (McGill University); Narges Armanfard (McGill University) PDF Poster Video (Right click to download)
1220	Hair Strand Reconstruction based on 3D Gaussian Splatting Yimin Pan (Technical University of Munich); Matthias Nießner (Technical University of Munich); Tobias Kirschstein (Technical University of Munich) PDF Poster Video (Right click to download)
1224	RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans Bheeshm Sharma (IIT Bombay); Karthikeyan Jaganathan (IIT Bombay); Balamurugan Palaniappan (IIT Bombay) PDF Poster Video (Right click to download)
1257	End-to-End LiDAR optimization for point cloud registration Siddhant Katyan (Université Laval); Marc-Andre Gardner (Bentley Systems); Jean-Francois Lalonde (Université Laval) PDF Poster Video (Right click to download)

If there are any mistakes on this page, please do not hesitate to contact bmvc@bmvc2025.org