12
Part Segmentation and Motion Estimation for Articulated Objects with Dynamic 3D Gaussians
Jun-Jee Chao (University of Minnesota); Qingyuan Jiang (University of Minnesota); Volkan Isler (University of Texas at Austin)
PDF Poster Video (Right click to download) 

18
Volumetric Temporal Texture for Smoke Stylization using Dynamic Radiance Fields
Dongqing Wang (École Polytechnique Fédérale de Lausanne (EPFL)); Ehsan Pajouheshgar (École Polytechnique Fédérale de Lausanne (EPFL)); Yitao Xu (École Polytechnique Fédérale de Lausanne (EPFL)); Tong Zhang (École Polytechnique Fédérale de Lausanne (EPFL)); Sabine Süsstrunk (École Polytechnique Fédérale de Lausanne (EPFL))
PDF Poster Video (Right click to download) 

22
Learning from Synthetic Data for Visual Grounding
Ruozhen He (Rice University); Ziyan Yang (Rice University); Paola Cascante-Bonilla (Stony Brook University); Alexander C. Berg (University of California, Irvine); Vicente Ordonez (Rice University)
PDF Poster Video (Right click to download) 

24
MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection
Taiga Yamane (NTT Corporation); Satoshi Suzuki (NTT Corporation); Ryo Masumura (NTT Corporation); Shota Orihashi (NTT Corporation); Tomohiro Tanaka (NTT Corporation); Mana Ihori (NTT Corporation); Naoki Makishima (NTT Corporation); Naotaka Kawata (NTT Corporation)
PDF Poster 

26
Guiding a diffusion model with itself using sliding windows
Nikolas Adaloglou (Heinrich Heine University); Tim Kaiser (Heinrich Heine University); Damir Iagudin (Heinrich Heine University); Markus Kollmann (Heinrich Heine University)
PDF Poster Video (Right click to download) 

28
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh (UC Berkeley); Ta-Ying Cheng (University of Oxford); He-Yen Hsieh (Harvard University); David Chuan-En Lin (Carnegie Mellon University); Yi Ma (University of Hong Kong); Andrew Markham (University of Oxford); Niki Trigoni (University of Oxford); H. T. Kung (Harvard University); Yubei Chen (UC Davis)
PDF Poster Video (Right click to download) 

29
SPARTAN: Spatiotemporal Pose-Aware Retrieval for Text-guided Autonomous Navigation
Xiangyu Bai (Northeastern University); Sai Anish Sreeramagiri (Northeastern University); Sai Siddhartha Vivek Dhir Rangoju (Northeastern University); Bishoy Galoaa (Northeastern University); Eric C Mortin (DEVCOM Analysis Center (DAC), US Army); Sarah Ostadabbas (Northeastern University)
PDF Poster Video (Right click to download) 

37
Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
Yuqing Luo (Cardiff University); YIXIAO LI (Beihang University); Jiang Liu (Cardiff University); Jun Fu (Cardiff University); Hadi Amirpour (University of Klagenfurt); Guanghui Yue (Shenzhen University); Baoquan Zhao (SUN YAT-SEN UNIVERSITY); Padraig Corcoran (Cardiff University); Hantao Liu (Cardiff University); Wei Zhou (Cardiff University)
PDF Poster Video (Right click to download) 

38
GC-Font: Few-Shot Font Generation via Global Contextual Feature Modelling
Weiran Chen (Soochow University); Guiqian Zhu (Soochow University); Ying Li (Soochow University); Yi Ji (Soochow University); Chunping Liu (Soochow University)
PDF Poster Video (Right click to download) 

39
DocAttentionRect: Attention-Guided Document Image Rectification
Pooja Kumari (Indian Institute of Technology Madras); Sukhendu Das (Indian Institute of Technology Madras)
PDF Poster Video (Right click to download) 

43
Split Matching for Inductive Zero-shot Semantic Segmentation
Jialei Chen (Nagoya University); Xu Zheng (The Hong Kong University of Science and Technology (Guangzhou) (HKUST (GZ))); Dongyue Li (Nagoya University); Chong Yi (Nagoya University); Seigo Ito (Nagoya University); Danda Pani Paudel (Artificial Intelligence and Technology (INSAIT), Sofia University); Luc Van Gool (Artificial Intelligence and Technology (INSAIT), Sofia University); Hiroshi Murase (Nagoya University); Daisuke Deguchi (Nagoya University)
PDF Poster Video (Right click to download) 

55
Piezoelectric Acoustic Sensing for Sitting Pose Classification
Yuuki Shibuya (Tokyo University of Science); Go Irie (Tokyo University of Science)
PDF Poster Video (Right click to download) 

56
Pointly-Supervised Weak-Shot Semantic Segmentation via Dual Mapping Transfer
Wenhui Jiang (Jiangxi University of Finance and Economics); Ruikang Luo (Jiangxi University of Finance and Economics); Zeyu Luo (Jiangxi University of Finance and Economics); Xiaowei Zhao (Sany Heavy Industry Co.); Junjie Chen (Jiangxi University of Finance and Economics); Yuming Fang (Jiangxi University of Finance and Economics)
PDF Poster Video (Right click to download) 

59
CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance
Yingtie Lei (University of Macau); Fanghai Yi (Guangdong University of Technology); Yihang Dong (University of Chinese Academy of Sciences); Weihuang Liu (University of Macau); Xiaofeng Zhang (Shanghai Jiao Tong University); Zimeng Li (Shenzhen Polytechnic University); Chi-Man Pun (University of Macau); Xuhang Chen (University of Macau)
PDF Poster Video (Right click to download) 

66
ST-GDance: Long-Term and Collision-Free Group Choreography from Music
Jing Xu (Monash University); Weiqiang Wang (Monash University); Cunjian Chen (Monash University); Jun Liu (Lancaster University); Qiuhong Ke (Monash University)
PDF Poster Video (Right click to download) 

69
TACTFL: Temporal Contrastive Training for Multi-modal Federated Learning with Similarity-guided Model Aggregation
Guanxiong Sun (University of Bristol); Majid Mirmehdi (University of Bristol); Zahraa S. Abdallah (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Ian James Craddock (University of Bristol); Telmo M Silva Filho (University of Bristol)
PDF Poster 

77
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Luc Boudier (École Polytechnique); Loris Manganelli (École Polytechnique); Eleftherios Tsonis (École Polytechnique); Nicolas Dufour (École Polytechnique, École des Ponts); Vicky Kalogeiton (Ecole polytechnique)
PDF Poster Video (Right click to download) 

81
SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation
Xiao Cao (National University of Singapore); Beibei Lin (National University of Singapore); Bo Wang (University of Mississippi); Zhiyong Huang (National University of Singapore); Robby T. Tan (National University of Singapore)
PDF Poster Video (Right click to download) 

83
RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze
Ruicheng Zhang (SUN YAT-SEN UNIVERSITY); Puxin Yan (SUN YAT-SEN UNIVERSITY); Zeyu Zhang (The Australian National University); Yicheng Chang (SUN YAT-SEN UNIVERSITY); Hongyi Chen (SUN YAT-SEN UNIVERSITY); Zhi Jin (SUN YAT-SEN UNIVERSITY)
PDF Poster Video (Right click to download) 

84
Continual Vision-and-Language Navigation
SeongJun Jeong (Seoul National University); Gi-Cheon Kang (Seoul National University); Seongho Choi (Seoul National University); Joochan Kim (Korea Institute of Science and Technology); Byoung-Tak Zhang (Seoul National University)
PDF Poster Video (Right click to download) 

88
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
Yuyang Xue (University of Edinburgh); Edward Moroshko (University of Edinburgh); Feng Chen (University of Edinburgh); Jingyu Sun (University of Edinburgh); Steven G. McDonagh (University of Edinburgh); Sotos Tsaftaris (University of Edinburgh)
PDF Poster Video (Right click to download) 

102
STAIN: Smooth Tile-Aware Instance Normalisation for Virtual Staining
David Armstrong (The Queen's University Belfast); Iain B Styles (The Queen's University Belfast); Niall McLaughlin (The Queen's University Belfast)
PDF Poster Video (Right click to download) 

113
Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning
Ruxiao Duan (Yale University); Jieneng Chen (Johns Hopkins University); Adam Kortylewski (University of Freiburg); Alan Yuille (Johns Hopkins University); Yaoyao Liu (University of Illinois Urbana-Champaign)
PDF Poster Video (Right click to download) 

117
RAPrivacy: a Readable Anonymizer for Privacy Preserving Action Recognition
ZiZhen Wang (National Yang Ming Chiao Tung University); Yen-Lung Chu (National Yang Ming Chiao Tung University); Pei-Chun Tsai (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University)
PDF Poster Video (Right click to download) 

121
BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching
Zhihua Liu (University of Leicester); Lei Tong (AstraZeneca); Xilin He (The Chinese University of Hong Kong); Che Liu (Imperial College London); Rossella Arcucci (Imperial College London); Chen Jin (Astrazeneca); Huiyu Zhou (University of Leicester)
PDF Poster Video (Right click to download) 

122
Distribution-guided Generative Replay with Semantic Prompts for Class-Incremental Chest X-ray Diagnosis
Jayant Mahawar (Indian Institue of Technology Jodhpur); Devi Prasad Maharathy (Indian Institute of Technology Jodhpur); Angshuman Paul (Indian Institute of Technology Jodhpur)
PDF Poster Video (Right click to download) 

124
Mask2Act: Predictive Multi-Object Tracking as Video Pre-Training for Robot Manipulation
Junbo Zhang (Tsinghua University); Kaisheng Ma (Tsinghua University)
PDF Poster Video (Right click to download) 

126
CLAIR: CLIP-Aided Weakly Supervised Zero-Shot Cross-Domain Image Retrieval
Chor Boon Tan (National University of Singapore); Conghui Hu (National University of Singapore); Gim Hee Lee (National University of Singapore)
PDF Poster Video (Right click to download) 

128
Uncertainty Diffusion: Parameter-Efficient Depth Refinement via Uncertainty-Guided Diffusion Models
Jeng-Huo Tzeng (National Yang Ming Chiao Tung University); Chuan-Yuan Huang (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University)
PDF Poster Video (Right click to download) 

131
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Noreen Anwar (LITIV, Polytechnique Montréal); Guillaume-Alexandre Bilodeau (LITIV, Polytechnique Montréal); Wassim Bouachir (Université du Québec (TELUQ))
PDF Poster Video (Right click to download) 

133
M$^2$StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting
Xingyu Miao (Durham University); Xueqi Qiu (Durham University); Haoran Duan (Tsinghua University); Yawen Huang (Tencent Jarvis Lab); Xian Wu (Tencent Jarvis Lab); Jingjing Deng (University of Bristol); Yang Long (Durham University)
PDF Poster Video (Right click to download) 

136
Lang4D: Weakly Supervised Learning of 4D Language Splatting
Mana Masuda (CyberAgent, Keio University); Taiki Sekii (CyberAgent)
PDF Poster Video (Right click to download) 

138
SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet
Woosung Joung (Korea University); Daewon Chae (University of Michigan - Ann Arbor); Jinkyu Kim (Korea University)
PDF Poster Video (Right click to download) 

147
Spatial-Frequency Domain Aggregation for Visual Place Recognition
Chaoqun Wang (South China Normal University); Shaobo Min (Tencent)
PDF Poster Video (Right click to download) 

150
WTNet: A Weather Transfer Network for Domain-Adaptive All-In-One Adverse Weather Image Restoration
Si-Yu Huang (National Yang Ming Chiao Tung University); Fu-Jen Tsai (National Tsing Hua University); Chia-Wen Lin (National Tsing Hua University); Yen-Yu Lin (National Yang Ming Chiao Tung University)
PDF Poster Video (Right click to download) 

154
Seed-to-Seed: Unpaired Image Translation in Diffusion Seed Space
Or Greenberg (General Motors R&D, The Hebrew University of Jerusalem); Eran Kishon (General Motors R&D); Dani Lischinski (The Hebrew University of Jerusalem)
PDF Poster 

159
Identity-Motion Trade-offs in Text-to-Video Generation
Yuval Atzmon (NVIDIA Research); Rinon Gal (NVIDIA Research); Yoad Tewel (NVIDIA Research); Yoni Kasten (NVIDIA Research); Gal Chechik (NVIDIA Research)
PDF Poster 

165
Four eyes see more than two: Dataset Distillation with Mixture-of-Experts
Jia-Jiun Yao (National Yang Ming Chiao Tung University); Sheng-Feng Yu (National Yang Ming Chiao Tung University, Macronix International Co., Ltd.); Wei-Chen Chiu (National Yang Ming Chiao Tung University)
PDF Poster Video (Right click to download) 

168
PSScreen: Partially Supervised Multiple Retinal Disease Screening
Boyi Zheng (University of Oulu); Qing Liu (University of Oulu)
PDF Poster Video (Right click to download) 

173
CellMamba: Adaptive Mamba for Accurate and Efficient Cell Detection
Ruochen Liu (University of Liverpool); Yi.Tian (National University of Singapore); Jiahao Wang (Xi'an Jiaotong-Liverpool University); Hongbin Liu (Xi'an Jiaotong-Liverpool University); Xianxu Hou (Xi'an Jiaotong-Liverpool University); Jingxin Liu (Xi'an Jiaotong-Liverpool University)
PDF Poster Video (Right click to download) 

181
LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction
Md Zahidul Hasan (Concordia University); Abdessamad Ben Hamza (Concordia University); Nizar Bouguila (Concordia University)
PDF Poster Video (Right click to download) 

182
eXtended Multimodal Composite Association Score (xMCAS): A Gender Inclusive Approach to Measurement of Bias in Text-To-Image Diffusion Models
Abhishek Mandal (Dublin City University); Susan Leavy (University College Dublin); Suzanne Little (Dublin City University)
PDF Poster Video (Right click to download) 

184
Graph Similarity Learning of Floor Plans
Casper van Engelenburg (Delft University of Technology); Jan van Gemert (Delft University of Technology); Seyran Khademi (Delft University of Technology)
PDF Poster 

195
Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification
Shuxian Ma (University of Jinan); Zihao Dong (University of Jinan); Runmin Cong (Shandong University); Sam Tak-Wu Kwong (Lingnan University); Xiuli Shao (Nankai University)
PDF Poster Video (Right click to download) 

211
Efficient Image Restoration via Latent Consistency Flow Matching
Elad Cohen (Sony Semiconductor Israel); Idan Achituve (Sony Semiconductor Israel); Idit Diamant (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel)
PDF Poster Video (Right click to download) 

218
Bridging Visual-Textual Modalities: Weakly Supervised Histopathology Segmentation
Sicong Gao (University of New South Wales); Matthew AB Baker (University of New South Wales); Maurice Pagnucco (University of New South Wales); Zhiwei Gao (Hebei Key Laboratory of Electromagnetic Environmental Effects and Information Processing); Yang Song (University of New South Wales)
PDF Poster Video (Right click to download) 

220
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking
Martha Teiko Teye (University of Wuppertal); Ori Maoz (Aptiv); Matthias Rottmann (Universität Osnabrück)
PDF Poster Video (Right click to download) 

228
Dual-Expert Collaborative Network for Fake News Detection with External Knowledge Integration
Wenxin Luo (Beijing Jiaotong University); Zhu Teng (Beijing Jiaotong Univeristy); Wei Zhang (Beijing Jiaotong University); Baopeng Zhang (Beijing Jiaotong Univeristy)
PDF Poster Video (Right click to download) 

232
Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering
Nattapong Kurpukdee (University of York); Adrian G. Bors (University of York)
PDF Poster Video (Right click to download) 

238
Capture and Reconstruct 3D Clothed Human from Images
Onat Vuran (ETH Zürich); Hsuan-I Ho (ETH Zürich)
PDF Poster Video (Right click to download) 

239
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation
Maëlic Neau (Umeå University); Paulo Eduardo Santos (PrioriAnalytica); Anne-Gwenn Bosser (Bretagne INP); Akihiro Sugimoto (National Institute of Informatics); Cedric Buche (CNRS IRL 2010 CROSSING, IMT Atlantique)
PDF Poster Video (Right click to download) 

257
S$^2$V2V: Training-Free Video-to-Video with Sparse Points and Motion Guidance
Xinyu Zhang (University of Auckland); Zicheng Duan (The University of Adelaide); Dong Gong (University of New South Wales); Lingqiao Liu (University of Adelaide)
PDF Poster Video (Right click to download) 

259
Revisiting Entropy Minimization for Long-Sequence Continual Test-Time Adaptation
WeiQin Chuah (Royal Melbourne Institute of Technology); Ruwan Tennakoon (Royal Melbourne Institute of Technology); Alireza Bab-Hadiashar (Royal Melbourne Institute of Technology)
PDF Poster Video (Right click to download) 

261
ADIR: Adaptive Diffusion for Image Reconstruction
Shady Abu-Hussein (Tel Aviv University); Tom Tirer (Tel Aviv University); Raja Giryes (Bar Ilan University)
PDF Poster Video (Right click to download) 

265
Enhancing Visual Tracking by Leveraging High-frequency Information within Event Signals
Yuheng Jiang (University of Science and Technology of China); Hebei Li (University of Science and Technology of China); Dachun Kai (University of Science and Technology of China); Yansong Peng (University of Science and Technology of China); Jiahui Yuan (University of Science and Technology of China); Peilin Xiao (University of Science and Technology of China); Yueyi Zhang (MiroMind); Xiaoyan Sun (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

270
FaceGCD: Generalized Face Discovery via Dynamic Prefix Generation
Yunseok Oh (Inha University, AutoLabs, Inc.); Dong-Wan Choi (Inha University)
PDF Poster Video (Right click to download) 

272
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
Yoojin Jang (Ulsan National Institute of Science and Technology); Junsu Kim (Ulsan National Institute of Science and Technology); Ha Yeon Kim (Ulsan National Institute of Science and Technology); Eun-Ki Lee (Hanyang University); Eun-Sol Kim (Hanyang University); Seungryul Baek (Ulsan National Institute of Science and Technology); Jaejun Yoo (Ulsan National Institute of Science and Technology)
PDF Poster Video (Right click to download) 

276
Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending
Junsik Jung (KAIST); Yoonki Cho (KAIST); Woo Jae Kim (KAIST); Lin Wang (Nanyang Technological University); Sung-Eui Yoon (KAIST)
PDF Poster Video (Right click to download) 

277
CFFlow: An Optical Flow Estimation Hinging on Cross-Frequency Attention
Mengfei Wang (Shanghai Institute of Microsystem and Information Technology); Dongchen Zhu (Shanghai Institute of Microsystem and Information Technology); Lei Wang (Shanghai Institute of Microsystem and Information Technology); Jiamao Li (Shanghai Institute of Microsystem and Information Technology)
PDF Poster Video (Right click to download) 

281
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
Mohamed Fazli Mohamed Imam (Mohamed bin Zayed University of Artificial Intelligence); Rufael Fekadu Marew (Mohamed bin Zayed University of Artificial Intelligence); Jameel Hassan Abdul Samadh (Johns Hopkins University); Mustansar Fiaz (IBM Research); Alham Fikri Aji (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

283
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields
Woo Jae Kim (Korea Advanced Institute of Science and Technology (KAIST)); Kyu Beom Han (Korea Advanced Institute of Science and Technology (KAIST)); Yoonki Cho (Korea Advanced Institute of Science and Technology (KAIST)); Youngju Na (Korea Advanced Institute of Science and Technology (KAIST)); Junsik Jung (Korea Advanced Institute of Science and Technology (KAIST)); Sooel Son (Korea Advanced Institute of Science and Technology (KAIST)); Sung-Eui Yoon (Korea Advanced Institute of Science and Technology (KAIST))
PDF Poster Video (Right click to download) 

284
Multimodal Feature Collaboration and Fusion for Fine-Grained Action Recognition
Xinyu Bian (Beijing University of Posts and Telecommunications); Dongliang Chang (Beijing University of Posts and Telecommunications); Yuqi Yang (Beijing University of Posts and Telecommunications); Lei Chen (Tsinghua University); Zhanyu Ma (Beijing University of Post and Telecommunication)
PDF Poster Video (Right click to download) 

288
Benchmarking Microsaccade Recognition with Event Cameras: A Novel Dataset and Evaluation
Waseem Shariff (University of Galway); Timothy Hanley (University of Galway); Maciej Stec (University of Galway); Hossein Javidnia (University of Dublin); Peter Corcoran (University of Galway)
PDF Poster Video (Right click to download) 

289
CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings
Mattia Nardon (Bruno Kessler Foundation); Mikel Mujika Agirre (Ikerlan); Ander González Tomé (Ikerlan); Daniel Sedano Algarabel (Ikerlan); Josep Rueda Collell (ikerlan); Ana Paola Caro (Andreu World); Andrea Caraffa (Bruno Kessler Foundation); Fabio Poiesi (Bruno Kessler Foundation); Paul Ian Chippendale (Bruno Kessler Foundation); Davide Boscaini (Bruno Kessler Foundation)
PDF Poster Video (Right click to download) 

291
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
Nils Hütten (Bergische Universität Wuppertal); Florian Hölken (Bergische Universität Wuppertal); Hasan Tercan (Bergische Universität Wuppertal); Tobias Meisen (Bergische Universität Wuppertal)
PDF Poster 

292
SVAC: Scaling Is All You Need For Referring Video Object Segmentation
Li Zhang (Columbia University); Haoxiang Gao (Carnegie Mellon University); Zhihao Zhang (Columbia University); Luoxiao Huang (New York University); Tao Zhang (Wuhan University)
PDF Poster Video (Right click to download) 

294
Task Progressive Curriculum Learning for Robust Visual Question Answering
Ahmed Akl (Griffith University); Abdelwahed Khamis (CSIRO); Zhe Wang (Griffith University); Ali Cheraghian (CSIRO); Sara Khalifa (Queensland University of Technology); Kewen Wang (Griffith University)
PDF Poster Video (Right click to download) 

310
PME3D: An Adaptive and Efficient Multi-modal Feature Extraction Plug-in for 3D Object Detection
TianyiYu (University of Glasgow); Lei Yu (Beihang University)
PDF Poster Video (Right click to download) 

313
OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware
Nick Lemke (Technische Universität Darmstadt); John Kalkhof (Technical University of Darmstadt); Niklas Babendererde (Technische Universität Darmstadt); Anirban Mukhopadhyay (Technical University of Darmstadt)
PDF Poster Video (Right click to download) 

320
Extreme Model Compression with Structured Sparsity at Low Precision
Dan Liu (McGill University); Nikita Dvornik (Palona AI); Xue Liu (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

328
DepthHMR: Leveraging Depth Around Humans for Multi-Human Mesh Generation
Nikhil Sharma (University of Illinois Chicago); Jiachen Tao (University of Illinois Chicago); Junyi Wu (University of Illinois Chicago); Yan Yan (University of Illinois Chicago)
PDF Poster 

330
FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion
Kazuaki Mishima (Institute of Science Tokyo); Antoni Bigata Casademunt (Imperial College London); Stavros Petridis (Imperial College London); Maja Pantic (Imperial College London); Kenji Suzuki (Institute of Science Tokyo)
PDF Poster Video (Right click to download) 

333
Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking
Milad Khanchi (Concordia University); Maria Amer (Concordia University); Charalambos Poullis (Concordia University)
PDF Poster Video (Right click to download) 

334
DAOVI: Distortion-Aware Omnidirectional Video Inpainting
Ryosuke Seshimo (Keio University); Mariko Isogawa (Keio University)
PDF Poster Video (Right click to download) 

338
GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
Tianyue Wang (University of the Chinese Academy of Sciences); Shuang Yang (University of Chinese Academy of Sciences); Shiguang Shan (University of Chinese Academy of Sciences); Xilin CHEN (University of Chinese Academy of Sciences)
PDF Poster Video (Right click to download) 

339
Events Meet Dynamic Mode Decomposition: Capturing the Spatiotemporal Dynamics of Moving Objects
Zhouning Du (AISIN CORPORATION); Israr Ulhaq (AISIN CORPORATION); Thanh Thi Huyen Phan (AISIN CORPORATION); Yuichiro Yoshimura (AISIN CORPORATION); Jigyasa Chand (AISIN CORPORATION); Truong Vinh Truong Duy (AISIN CORPORATION)
PDF Poster Video (Right click to download) 

342
Flatness-aware Curriculum Learning via Adversarial Difficulty
Hiroaki Aizawa (Hiroshima University); Yoshikazu Hayashi (Gifu University)
PDF Poster 

346
Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID
Priyank Pathak (University of Central Florida); Yogesh S Rawat (University of Central Florida)
PDF Poster Video (Right click to download) 

348
PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing
PEILIN XIONG (The University of Electro-Communications); Junwen Chen (The University of Electro-Communications); HONGHUI YUAN (The University of Electro-Communications); Keiji Yanai (The University of Electro-Communications)
PDF Poster Video (Right click to download) 

354
Dual-Branch Network via Multiple Illumination-Aware Representation Learning for Steel Surface Defect Classification
Yong Seok Oh (Dongguk University); Min Geol Kim (Sogang University); Bogyeong Kim (Dongguk University); Jun Young Kim (Dongguk University); Hyeongseob Jo (Dongguk University); Jae Hyeon Park (Dongguk University); Gyoomin Lee (Dongguk University); Sung In Cho (Sogang University)
PDF Poster Video (Right click to download) 

357
What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos
Qiyue Sun (University of Birmingham); Qiming Huang (University of Birmingham); Yang Yang (Shandong University); Hongjun Wang (Shandong University); Jianbo Jiao (University of Birmingham)
PDF Poster 

358
Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones
Yujie Zhao (Institute of Computing Technology, Chinese Academy of Sciences); Jiabei Zeng (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences)
PDF Poster Video (Right click to download) 

359
SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation
Sang-Eun Lee (Kyoto University); Ko Nishino (Kyoto University); Shohei Nobuhara (Kyoto Institute of Technology)
PDF Poster 

364
Learning from Silence and Noise for Visual Sound Source Localization
Xavier Juanola (Universitat Pompeu Fabra); Giovana Morais (New York University); Magdalena Fuentes (New York University); Gloria Haro (Universitat Pompeu Fabra)
PDF Poster Video (Right click to download) 

366
HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting
Maksym Ivashechkin (University of Surrey); Oscar Mendez (University of Surrey); Richard Bowden (University of Surrey)
PDF Poster Video (Right click to download) 

367
RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction
Zhuodong Jiang (University of Bristol); Haoran Wang (University of Bristol); Guoxi Huang (University of Bristol); Brett Seymour (National Park Service); Nantheera Anantrasirichai (University of Bristol)
PDF Poster 

370
RGB-Event Fusion for Robust Lane Detection
Jingtao Dong (Beijing Institute of Technology); Hao Zhuang (Beijing Institute of Technology); Hao Yang (Beijing Institute of Technology); Liyuan Pan (Beijing Institute of Technology)
PDF Poster Video (Right click to download) 

373
Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval
Emil Demić (University of Ljubljana); Luka Čehovin Zajc (University of Ljubljana)
PDF Poster Video (Right click to download) 

376
Multi-Rationale Explainable Object Recognition via Contrastive Conditional Inference
Ali Rasekh (L3S Research Center); Sepehr Kazemi Ranjbar (Independent Researcher); Simon Gottschalk (L3S Research Center)
PDF Poster Video (Right click to download) 

381
Permutation-Invariant Polar Harmonic Pooling for Point-based Neural Networks
Jaspreet Singh Maan (University of Lincoln); Grzegorz Cielniak (University of Lincoln)
PDF Poster Video (Right click to download) 

391
Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation
Ritsuki Kurohiji (Wakayama University); Hirotaka Hachiya (Wakayama University)
PDF Poster Video (Right click to download) 

392
UMM: A Unified Multi-Modal Model for Low-Level Vision Tasks with Dual-Driven Prompting
Ziqi Luo ( South University of Science and Technology); Jinxiang Lai (The Hong Kong University of Science and Technology); Ruitao Chen (Southern University of Science and Technology); Jinyu Yang (tapall.ai); Bin-Bin Gao (Tencent); Qiang Nie (The Hong Kong University of Science and Technology); Jun Liu (Tencent YouTu Lab); Jinfan Wang (Southern University of Science and Technology); Feng Zheng (Southern University of Science and Technology)
PDF Poster Video (Right click to download) 

398
Knowledge Distillation via Cross Supervising with Attention for Remote Sensing Object Detection
KefanZhan (Xiangtan University); An Luo (Xiangtan University); Yunpeng Zeng (Xiangtan University); Jiaxin Li (Xiangtan University); Yuan Zhang (Xiangtan University); Kai Hu (Xiangtan University)
PDF Poster Video (Right click to download) 

399
3D Shape Reconstruction from Autonomous Driving Radars
Samah Hussein (École Polytechnique Fédérale de Lausanne (EPFL)); Junfeng Guan (École Polytechnique Fédérale de Lausanne (EPFL), Bosch Research); Swathi Shree Narashiman (École Polytechnique Fédérale de Lausanne (EPFL)); Saurabh Gupta (University of Illinois Urbana-Champaign); Haitham Al Hassanieh (École Polytechnique Fédérale de Lausanne (EPFL))
PDF Poster Video (Right click to download) 

402
Log NeRF: Comparing Spaces for Learning Radiance Fields
Sihe Chen (Northeastern University); Luv Verma (Northeastern University); Bruce A. Maxwell (Northeastern University)
PDF Poster Video (Right click to download) 

404
Exploring Histogram-based Color Constancy
David R. Treadwell IV (Northeastern University); Yunxuan Rao (Northeastern University); Daniel Y. Bi (Northeastern University); Bruce A. Maxwell (Northeastern University)
PDF Poster Video (Right click to download) 

405
HVLO-YOLO: An Ultra-Lightweight Detection Model for High-voltage Line Obstacles
Weichao Pan (Shandong Jianzhu University); Xu Wang (Shandong Jianzhu University); Chengze Lv (Shandong Jianzhu University); Zicheng Lin (Shandong Jianzhu University); Gongrui Wang (Shandong Jianzhu University); Xuening Zhang (Harbin Institute of Technology); Yi Sun (University of Ulster); Xingbo Liu (Shandong Jianzhu University)
PDF Poster Video (Right click to download) 

408
Multimodal Hate Detection Using Dual-Stream Graph Neural Networks
Jiangbei Yue (University of Exeter); Shuonan Yang (University of Exeter); Tailin Chen (University of Exeter); Jianbo Jiao (University of Birmingham); Zeyu Fu (University of Exeter)
PDF Poster Video (Right click to download) 

416
Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps
Yoojin Oh (Ewha Womans University); Junhyug Noh (Ewha Womans University)
PDF Poster Video (Right click to download) 

423
JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection
Zhiming Liu (University of Bristo); Paul Hill (University of Bristol); Nantheera Anantrasirichai (University of Bristol)
PDF Poster 

427
Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data
Lintao XU (Université Gustave Eiffel); Chaohui Wang (Université Gustave Eiffel)
PDF Poster Video (Right click to download) 

444
Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering
Haopeng Li (The University of Melbourne); Mohammed Bennamoun (University of Western Australia); Jun Liu (Lancaster University); Hossein Rahmani (Lancaster University); Qiuhong Ke (Monash University)
PDF Poster Video (Right click to download) 

445
Frequency-Temporal Feature Integration for Compressed Video Action Recognition
Zhou Jiang wan (Beijing University of Posts and Telecommunications); Yue Ming (Beijing University of Posts and Telecommunications)
PDF Poster Video (Right click to download) 

453
HalfMix Augmentation and Regularized Dual-Path Learning for Cross-Domain Gaze Estimation
Jiuk Hong (Kyungpook National University); Heechul Jung (Kyungpook National University)
PDF Poster Video (Right click to download) 

457
A Novel Local Focusing Mechanism for Deepfake Detection Generalization
Mingliang Li (Jiangxi Normal University); Hanxi Li (Jiangxi Normal University); Lin Yuanbo Wu (The University of Warwick); Changhong Liu (Jiangxi Normal University)
PDF Poster Video (Right click to download) 

458
Intra-Modal Divergence-Weighted Distillation for Vision-Language Models
Addad Youva (Université de Caen Basse Normandie); Alexis Lechervy (Université de Caen Basse Normandie); Frédéric Jurie (Université de Caen Normandie)
PDF Poster Video (Right click to download) 

461
FSF3A: Federated Spatial Feature Alignment and Adaptive Aggregation for Heterogeneous Brain Tumor Segmentation
Peketi Divya (Indian Institute of Technology Hyderabad); C Krishna Mohan (Indian Institute of Technology Hyderabad); Sobhan Babu (Indian Institute of Technology Hyderabad); Sumanth Yenduri (Sam Houston State University)
PDF Poster Video (Right click to download) 

478
Bézier Curve-Based Stroke Extraction for Handwritten Characters
Yuxuan TENG (Tokyo University of Science); Takuya Matsuzaki (Tokyo University of Science)
PDF Poster Video (Right click to download) 

479
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
Hariprasath Govindarajan (Linköping University, Qualcomm Inc); Maciej Wozniak (KTH Royal Institute of Technology); Marvin Klingner (Qualcomm Inc); Camille Maurice (Qualcomm Inc); B Ravi Kiran (Qualcomm Inc); Senthil Yogamani (Qualcomm Inc)
PDF Poster Video (Right click to download) 

486
FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction
Junliang Ye (Australian National University); Lei Wang (Griffith University); Md Zakir Hossain (Curtin University)
PDF Poster 

493
Audio-Visual Separation with Hierarchical Fusion and Representation Alignment
Han Hu (University of Birmingham); Dongheng Lin (University of Birmingham); Qiming Huang (University of Birmingham); Yuqi Hou (University of Birmingham); Hyung Jin Chang (University of Birmingham); Jianbo Jiao (University of Birmingham)
PDF Poster 

495
An Explorative Study on Abstract Images and Visual Representations Learned from Them
Haotian LI (University of Birmingham); Jianbo Jiao (University of Birmingham)
PDF Poster Video (Right click to download) 

499
One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving
Andrea Bertogalli (Politecnico di Milano); Giacomo Boracchi (Politecnico di Milano); Luca Magri (Politecnico di Milano)
PDF Poster Video (Right click to download) 

500
UFD-KD: Unified Frequency Decoupled Knowledge Distillation
Lu Sihan (Beijing Institute of Technology); Yang Zheng (Institute of automation, Chinese Academy of Sciences); Jie Liu (Beijing University of Posts and Telecommunications); Zhenghao Xi (Shanghai University of Engineering Science)
PDF Poster Video (Right click to download) 

502
DefectGPT: Towards Multi-Class Defect Detection with Limited Electrical Samples
Zhuoyi Lin (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Kaixin Xu (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Aye Phyu Phyu Aung (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Wen Qiu (Advanced Micro Devices (Singapore) Pte Ltd); Bernice Zee (Advanced Micro Devices (Singapore) Pte Ltd); Jiann Min Chin (Advanced Micro Devices (Singapore) Pte Ltd); Senthilnath Jayavelu (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR))
PDF Poster Video (Right click to download) 

507
Robust and Label-efficient Deep Waste Detection
Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Khan Muhammad (Sung Kyun Kwan University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

509
Dynamic Convolution and Graph-Coupled Attention for Cross-Subject EEG-Vision Decoding
Tianyu Zhang (Durham University); FAN WAN (Tongfang Knowledge Network Digital Technology Co., Ltd. China National Nuclear Corporation); Kaili Sun (Durham University); Xingyu Miao (Durham University); Yueming Sun (University of Durham); Minye Shao (Durham University); Yang Long (Durham University)
PDF Poster Video (Right click to download) 

511
LieMorph: Transformer-based Image Registration Using Flows on Lie Groups
Johannes Bostelmann (Universität zu Lübeck); Jan Lellmann (Universität zu Lübeck)
PDF Poster Video (Right click to download) 

516
Making Rotation Averaging Fast and Robust with Anisotropic Coordinate Descent
Yaroslava Lochman (Chalmers University of Technology); Carl Olsson (Lund University); Christopher Zach (Chalmers University of Technology)
PDF Poster Video (Right click to download) 

517
Catching the Unknown with Limited Data: Bi-Directional Prompt Tuning in CLIP for Few-Shot Open-Set Adaptation
Moloud Abdar (The University of Queensland); Md Mehedi Hasan (Deakin University); Biplab Banerjee (Indian Institute of Technology Bombay); Abbas Khosravi (Deakin University); Pietro Lio (University of Cambridge)
PDF Poster Video (Right click to download) 

528
OmniSegNet: Towards Scalable, Efficient & Universal Medical Image Segmentation
Soma Dasgupta (Tata Consultancy Services); Swarnava Dey (Tata Consultancy Services); Arijit Mukherjee (Tata Consultancy Services); Arpan Pal (Tata Consultancy Services)
PDF Poster Video (Right click to download) 

540
OptSplat: Recurrent Optimization for Generalizable Reconstruction and Novel View Renderings
Vemburaj Chockalingam Yadav (German Research Center for Artificial Intelligence); Alain Pagani (German Research Center for Artificial Intelligence); Didier Stricker (German Research Center for Artificial Intelligence, RPTU)
PDF Poster Video (Right click to download) 

543
Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes
Saqib Javed (CVLab, EPFL); Ahmad Jarrar Khan (CVLab, EPFL); Corentin Dumery (CVLab, EPFL); Chen Zhao (CVLab, EPFL); Mathieu Salzmann (CVLab, EPFL, Swiss Data Science Center)
PDF Poster Video (Right click to download) 

546
Multi-Method Ensemble for Out-of-Distribution Detection
Lucas Rakotoarivony (Thales, cortAIx Labs)
PDF Poster Video (Right click to download) 

548
Pandora: Articulated 3D Scene Graphs from Egocentric Vision
Alan Yu (Massachusetts Institute of Technology); Yun Chang (Massachusetts Institute of Technology); Christopher Xie ( Meta Reality Labs); Luca Carlone (Massachusetts Institute of Technology)
PDF Poster Video (Right click to download) 

558
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance
Zicheng Duan (The University of Adelaide); Yuxuan Ding (Qualcomm AI Research); Chenhui Gou (Monash University); Ziqin Zhou (The University of Adelaide); Ethan Smith (Leonardo.AI); Lingqiao Liu (The University of Adelaide)
PDF Poster 

566
OpenHuman4D: Open-Vocabulary 4D Human Parsing
Keito Suzuki (University of California, San Diego); Bang Du (University of California, San Diego); Runfa Li (University of California, San Diego); Kunyao Chen (Qualcomm Inc); Lei Wang (Qualcomm Inc); Peng Liu (Qualcomm Inc); Ning Bi (Qualcomm Inc); Truong Nguyen (University of California, San Diego)
PDF Poster Video (Right click to download) 

567
Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection
Zihan Wang (McGill University); Samira Ebrahimi Kahou (University of Calgary); Narges Armanfard (McGill University)
PDF Poster Video (Right click to download) 

577
Zero-Shot CFC: Fast Real-World Image Denoising based on Cross-Frequency Consistency
Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud)
PDF Poster Video (Right click to download) 

588
CoT-SD: Chain-of-Thought Semantic Denoising
Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud)
PDF Poster Video (Right click to download) 

593
PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads
Shashikant Verma (Indian Institute of Technology Gandhinagar); Shanmuganathan Raman (Indian Institute of Technology Gandhinagar)
PDF Poster Video (Right click to download) 

594
Video Dataset Condensation with Diffusion Models
Zhe Li (FAU Erlangen-Nürnberg); Hadrien Reynaud (Imperial College London); Mischa Dombrowski (FAU Erlangen-Nürnberg); Sarah Cechnicka (Imperial College London); Franciskus Xaverius Erick (FAU Erlangen-Nürnberg); Bernhard Kainz (FAU Erlangen-Nürnberg, Imperial College London)
PDF Poster Video (Right click to download) 

596
TopoMortar: A Dataset to Evaluate Topology Accuracy in Image Segmentation
Juan Miguel Valverde (Technical University of Denmark); Motoya Koga (Sojo University); Nijihiko Otsuka (Sojo University); Anders Dahl (Technical University of Denmark)
PDF Poster Video (Right click to download) 

602
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng (SUN YAT-SEN UNIVERSITY); Jing Wang (SUN YAT-SEN UNIVERSITY); Fuwei Zhao (ByteDance Inc.); xujie zhang (SUN YAT-SEN UNIVERSITY); Xiaodan Liang (SUN YAT-SEN UNIVERSITY)
PDF Poster Video (Right click to download) 

603
Dual-Stream Adapters for Open-Set Segmentation in Driving Scenes
Shyam Nandan Rai (Politecnico di Torino); Massimiliano Mancini (University of Trento); Barbara Caputo (Politecnico di Torino); Carlo Masone (Politecnico di Torino)
PDF Poster Video (Right click to download) 

610
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
Joohyun Chang (Kyung Hee University); Soyeon Hong (Kyung Hee University); Hyogun Lee (Kyung Hee University); Seong Jong Ha (AI R&D Division, CJ Group); Dongho Lee (AI R&D Division, CJ Group); Seong Tae Kim (Kyung Hee University); Jinwoo Choi (Kyung Hee University)
PDF Poster Video (Right click to download) 

614
PerSense: Training-Free Personalized Instance Segmentation in Dense Images
Muhammad Ibraheem Siddiqui (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Umer Sheikh (Mohamed bin Zayed University of Artificial Intelligence); Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

626
Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation
Sobhan Asasi (University of Surrey); Mohamed Ilyes Lakhal (University of Surrey); Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey)
PDF Poster 

627
TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation
ZechaoGuan (Southeast University); Feng Yan (Meituan Company); Shuai Du (Southeast University); Lin Ma (Meituan Company); Qingshan Liu (Southeast University)
PDF Poster Video (Right click to download) 

628
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Yingshuang Zou (Tsinghua University); Yikang Ding (Megvii Technology); Chuanrui Zhang (Tsinghua University); Jiazhe Guo (Tsinghua University); Bohan Li (Shanghai Jiaotong University); Xiaoyang Lyu (University of Hong Kong); Feiyang Tan (Mach Drive); Xiaojuan Qi (University of Hong Kong); Haoqian Wang (Tsinghua University)
PDF Poster Video (Right click to download) 

630
Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives
Prajwal Singh (IIT Gandhinagar); Ashish Tiwari (IIT Gandhinagar); Gautam Vashishtha (IIT Gandhinagar); Shanmuganathan Raman (IIT Gandhinagar)
PDF Poster Video (Right click to download) 

632
Segmentation Assisted Incremental Test Time Adaptation in an Open World
Manogna Sreenivas (Indian Institute of Science); Soma Biswas (Indian Institute of Science)
PDF Poster Video (Right click to download) 

644
Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems
Qihao Yuan (University of Groningen); Kailai Li (University of Groningen); Jiaming Zhang (Karlsruhe Institute of Technology)
PDF Poster Video (Right click to download) 

646
Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework
Mosong Ma (Imperial College London); Tania Stathaki (Imperial College London); Michalis Lazarou (University of Surrey)
PDF Poster Video (Right click to download) 

649
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang (University of California, Davis); Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories (MERL)); Suhas Lohit (Mitsubishi Electric Research Laboratories (MERL)); Michael J. Jones (Mitsubishi Electric Research Laboratories (MERL)); Jiawei Zhang (University of California, Davis)
PDF Poster Video (Right click to download) 

650
Prompt-Informed Reinforcement Learning for Visual Coverage Path Planning
Venkat Margapuri (Villanova University)
PDF Poster Video (Right click to download) 

656
3D Curvix: From Multiview 2D Edges to 3D Curve Segments
Qiwu Zhang (Brown University); Chiang-Heng Chien (Brown University); Ricardo Fabbri (Universidade do Estado do Rio de Janeiro); Benjamin Kimia (Brown University)
PDF Poster Video (Right click to download) 

661
Leveraging Sparsity for Efficient Inference of High-Resolution Vision Foundation Models
Xin Xu (University of Illinois Urbana-Champaign); Jason Kuen (Adobe); Brian L Price (Adobe); Kangning Liu (Adobe); Zijun Wei (Adobe); Yu-Xiong Wang (University of Illinois Urbana-Champaign)
PDF Poster Video (Right click to download) 

664
Canonical Makeup Transfer
Xinyu Lin (CUHK-Shenzhen); Kun Zhou (Shenzhen University); Xiaoguang Han (CUHK-Shenzhen); Jiangbo Lu (SmartMore Corporation)
PDF Poster Video (Right click to download) 

666
Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization
Alberto Compagnoni (University of Modena and Reggio Emilia); Davide Caffagni (University of Modena and Reggio Emilia); Nicholas Moratelli (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia ); Marcella Cornia (University of Modena and Reggio Emilia); Rita Cucchiara (University of Modena and Reggio Emilia)
PDF Poster Video (Right click to download) 

668
MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression
Ofir Gordon (Sony Semiconductor Israel); Ariel Lapid (Sony Semiconductor Israel); Elad Cohen (Sony Semiconductor Israel); Yarden Yagil (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel)
PDF Poster Video (Right click to download) 

675
Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network
Abdullah Tariq (Edith Cowan University); Martin Masek (Edith Cowan University); R Muhammad Atif Azad (Birmingham City University); Syed Zulqarnain Gilani (Edith Cowan University)
PDF Poster Video (Right click to download) 

680
On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics
Li-Wei Chen (KU Leuven); Ombretta Strafforello (KU Leuven); Anne-Sofie Maerten (KU Leuven); Tinne Tuytelaars (KU Leuven); Johan Wagemans (KU Leuven)
PDF Poster 

690
MonoGSDF: Exploring Monocular Geometric Cues for Gaussian Splatting-Guided Implicit Surface Reconstruction
Kunyi Li (Technical University of Munich); Michael Niemeyer (Google); Zeyu Chen (Tsinghua University); Nassir Navab (Technical University of Munich); Federico Tombari (Google)
PDF Poster Video (Right click to download) 

694
Occam’s LGS: An Efficient Approach for Language Gaussian Splatting
Jiahuan Cheng (Johns Hopkins University); Jan-Nico Zaech (INSAIT, Sofia University); Luc Van Gool (INSAIT, Sofia University); Danda Pani Paudel (INSAIT, Sofia University)
PDF Poster 

698
SAMWave: Adapting Segment Anything Model to difficult tasks
Saurabh Yadav (IIIT Delhi); Avi Gupta (IIIT Delhi); Koteswar Rao Jerripothula (IIT Kanpur)
PDF Poster Video (Right click to download) 

700
Q-Align: Alleviating Attention Leakage in Zero-Shot Appearance Transfer via Query-Query Alignment
Namu Kim (KT Corporation); Wonbin Kweon (University of Illinois Urbana-Champaign); Minsoo Kim (Pohang University of Science and Technology); Hwanjo Yu (Pohang University of Science and Technology)
PDF Poster Video (Right click to download) 

701
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
Faizan Farooq Khan (King Abdullah University of Science and Technology); Vladan Stojnić (Czech Technical University in Prague); Zakaria Laskar (Czech Technical University in Prague); Mohamed Elhoseiny (King Abdullah University of Science and Technology); Giorgos Tolias (Czech Technical University of Prague)
PDF Poster Video (Right click to download) 

704
JOG3R: Towards 3D-Consistent Video Generators
Chun-Hao Paul Huang (Adobe Systems); Niloy J. Mitra (Adobe Systems); Hyeonho Jeong (Adobe Systems); Jae Shin Yoon (Adobe Systems); Duygu Ceylan (Adobe Systems)
PDF Poster Video (Right click to download) 

705
ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation
Chenzhi Liu (The University of Queensland); Mahsa Baktashmotlagh (The University of Queensland); Yanran Tang (The University of Queensland); Zi Huang (University of Queensland); Ruihong Qiu (The University of Queensland)
PDF Poster Video (Right click to download) 

707
Catch Your Concepts: A Flexible Concept Locator for Interpretable Visual Recognition
Qiyang Wan (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Chengzhi Gao (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Xilin CHEN (Institute of Computing Technology, Chinese Academy of Sciences (CAS))
PDF Poster Video (Right click to download) 

709
TAPM-Net: Trajectory-Aware Perturbation Modeling for Infrared Small Target Detection
Hongyang Xie (The University of Warwick); Hongyang He (The University of Warwick); Victor Sanchez (The University of Warwick)
PDF Poster 

716
Size-aware Contrastive Imitation Learning for Language-conditioned Multi-task Robotic Manipulation
Jiakai Huang (South China Normal University); Weiping Zheng (South China Normal University)
PDF Poster Video (Right click to download) 

717
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
Zizhao Li (University of Melbourne); Zhengkang Xiang (University of Melbourne); Joseph West (University of Melbourne); Kourosh Khoshelham (University of Melbourne)
PDF Poster Video (Right click to download) 

718
PADS: Plug-and-Play 3D Human Pose Analysis via Diffusion Generative Modeling
Haorui Ji (Australian National University); Hongdong Li (Australian National University)
PDF Poster 

721
Gromov Wasserstein Optimal Transport for Semantic Correspondences
Francis Snelgar (Australian National University); Stephen Gould (Australian National University); Ming Xu (Australian National University); Liang Zheng (Australian National University); Akshay Asthana (Seeing Machines)
PDF Poster Video (Right click to download) 

722
Unsupervised Multimodal Deepfake Detection Through Explicit Intra-Modal and Cross-Modal Inconsistency Discovery
Mulin Tian (University of Southern California); Mahyar Khayatkhoei (University of Southern California); Joe Mathai (University of Southern California); Wael AbdAlmageed (Clemson University)
PDF Poster Video (Right click to download) 

732
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Riza Velioglu (Universität Bielefeld); Petra Bevandić (Universität Bielefeld); Robin Chan (Technische Universität Berlin); Barbara Hammer (Universität Bielefeld)
PDF Poster Video (Right click to download) 

736
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
Yue Hu (AgiBot, Harbin Institute of Technology); Siyuan Huang (Shanghai Jiao Tong University); Yue Liao (National University of Singapore); Shengcong Chen (Agibot); Pengfei Zhou (Agibot); Liliang Chen (AgiBot); Guanghui Ren (AgiBot); Maoqing Yao (Agibot)
PDF Poster Video (Right click to download) 

737
Self-Intersection-Aware 3D Human Motion Generation Using an Efficient Human Sphere Proxy
Pascal Herrmann (Bosch Research); Maarten Bieshaar (Bosch Research); Dennis Mack (Bosch Research); Paul Robert Herzog (Bosch Research); Juergen Gall (University of Bonn)
PDF Poster Video (Right click to download) 

745
Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
Fatih Erdoğan (Koç University); Merve Rabia Barin (Koç University); Fatma Güney (Koç University)
PDF Poster Video (Right click to download) 

751
TPA: Temporal Prompt Alignment for Fetal Congenital Heart Defect Classification
Darya Taratynova (Mohamed bin Zayed University of Artificial Intelligence); Alya Almsouti (Mohamed bin Zayed University of Artificial Intelligence); Beknur Kalmakhanbet (Mohamed bin Zayed University of Artificial Intelligence); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

753
Boosting Camera Motion Control for Video Diffusion Transformers
Soon Yau Cheong (University of Surrey); Duygu Ceylan (Adobe Research); Armin Mustafa (University of Surrey); Andrew Gilbert (University of Surrey); Chun-Hao Paul Huang (Adobe Research)
PDF Poster Video (Right click to download) 

755
Audio-Guided Visual Editing with Complex Multi-Modal Prompts
Hyeonyu Kim (MAUM AI Inc.); Seokhoon Jeong (Ulsan National Institute of Science and Technology); Seonghee Han (Ulsan National Institute of Science and Technology); Chanhyuk Choi (Ulsan National Institute of Science and Technology); Taehwan Kim (Ulsan National Institute of Science and Technology)
PDF Poster Video (Right click to download) 

762
Learning a Neural Association Network for Self-supervised Multi-Object Tracking
Shuai Li (University of Bonn); Michael Burke (Monash University); Subramanian Ramamoorthy (University of Edinburgh); Juergen Gall (University of Bonn)
PDF Poster Video (Right click to download) 

766
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva (Télécom Paris); Fabio Pizzati (MBZUAI); Philip Torr (University of Oxford); Stéphane Lathuilière (INRIA)
PDF Poster Video (Right click to download) 

768
Interpretable Text-Guided Image Clustering via Iterative Search
Bingchen Zhao (University of Edinburgh); Oisin Mac Aodha (University of Edinburgh)
PDF Poster 

770
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction
Kiana Hooshanfar (University of Tehran); Alireza Hosseini (University of Tehran); Mona Ahmadian (University of Surrey); Ahmad Kalhor (University of Tehran); Babak N Araabi (University of Tehran)
PDF Poster Video (Right click to download) 

782
Is Safety Checker Still Safe? A Study on the Covert NSFW Text
Xin Li (Shanghai Fudan Microelectronics Group CO.,LTD.); Kai Chen (Shanghai Fudan Microelectronics Group CO.,LTD.); XUE YANG (Shanghai Fudan Microelectronics Group CO.,LTD.); Weijun Shan (Shanghai Fudan Microelectronics Group CO.,LTD.); Jun Yu (Shanghai Fudan Microelectronics Group CO.,LTD.); Qing Li (Shanghai Fudan Microelectronics Group CO.,LTD.)
PDF Poster 

784
UDT : Unsupervised Discovery of Transformations between Fine-Grained Classes in Diffusion Models
Youngjae Choi (Soongsil University); Hyunsuh Koh (Soongsil University); Hojae Jeong (Soongsil University); ByungKwan Chae (Soongsil University); Sungyong Park (Soongsil University); Heewon Kim (Soongsil University)
PDF Poster Video (Right click to download) 

786
TAG: A Simple Yet Effective Temporal-Aware Approach for Zero-Shot Video Temporal Grounding
Jin-Seop Lee (Sungkyunkwan University); SungJoon Lee (Sungkyunkwan University); Jaehan Ahn (Sungkyunkwan University); YunSeok Choi (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University)
PDF Poster Video (Right click to download) 

787
${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting
Yuxi Hu (Graz University of Technology); Jun Zhang (Graz University of Technology); Kuangyi Chen (Graz University of Technology); Zhe Zhang (Peking University); Friedrich Fraundorfer (Graz University of Technology)
PDF Poster Video (Right click to download) 

788
Vision Backbone Efficient Selection for Image Classification in Low-Data Regimes
Joris Guerin (IRD); Shray Bansal (Georgia Institute of Technology); Amirreza Shaban (Field AI); Paulo Mann (Federal University of Rio de Janeiro); Harshvardhan Gazula (Massachusetts Institute of Technology)
PDF Poster Video (Right click to download) 

811
DEAD: Data-Efficient Audiovisual Dubbing using Neural Rendering Priors
Jack Saunders (University of Bath); Vinay Namboodiri (University of Bath)
PDF Poster Video (Right click to download) 

824
A Unified Framework for High-Frame-Rate HDR Video Synthesis
Hue Nguyen (York University); Trevor Dalton Canham (York University); Michael S. Brown (York University)
PDF Poster Video (Right click to download) 

825
Modular Embedding Recomposition for Incremental Learning
Aniello Panariello (University of Modena and Reggio Emilia); Emanuele Frascaroli (University of Modena and Reggio Emilia); Pietro Buzzega (University of Modena and Reggio Emilia); Lorenzo Bonicelli (University of Modena and Reggio Emilia); Angelo Porrello (University of Modena and Reggio Emilia); Simone Calderara (University of Modena and Reggio Emilia)
PDF Poster Video (Right click to download) 

827
MonoTracker: Monocular RGB-Only 6D Tracking of Unknown Object
Zilong Deng (ETH Zürich); Shaochang Tan (Universität Zürich); Zuria Bauer (ETH Zürich); Daniel Barath (ETH Zürich); Marc Pollefeys (Microsoft)
PDF Poster Video (Right click to download) 

828
Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval
Adriano Fragomeni (University of Bristol); Dima Damen (University of Bristol); Michael Wray (University of Bristol)
PDF Poster Video (Right click to download) 

831
LOGen: Toward LiDAR Object Generation by Point Diffusion
Ellington Kirby (Valeo.ai); Mickael Chen (Valeo.ai); Renaud Marlet (Valeo.ai); Nermin Samet (Valeo.ai)
PDF Poster Video (Right click to download) 

832
SIMULDITEX: Single Image Multiscale & Lightweight Diffusion for Texture Modelling
Pierrick Chatillon (Université de Caen Normandie); Julien Rabin (Université de Caen Normandie); David Tschumperlé (Université de Caen Normandie)
PDF Poster Video (Right click to download) 

835
Cloud-Stereo: A Dataset and Benchmark for Reconstructing Atmospheric Clouds from Stereo Images
Jacob Lin (University of Oxford); Edward Gryspeerdt (Imperial College London); Ronald Clark (University of Oxford)
PDF Poster Video (Right click to download) 

850
C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression
Baptiste Bauvin (Thales); Loïc Baret (Thales); Ola Ahmad (Thales)
PDF Poster Video (Right click to download) 

851
Asymmetric Event-Image Stereo with Temporal Feature Gating and Iterative Structure-Detail Refinement
Hao Zhuang (Beijing Institute of Technology); Yan Yang (Australian National University); Liyuan Pan (Beijing Institute of Technology)
PDF Poster Video (Right click to download) 

852
LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
Jae Myung Kim (University of Tuebingen); Stephan Alaniz (Télécom Paris); Cordelia Schmid (Inria, Ecole normale supérieure); Zeynep Akata (Technische Universität München)
PDF Poster Video (Right click to download) 

857
Lost in Time: A New Temporal Benchmark for VideoLLMs
Daniel Cores (University of Santiago de Compostela); Michael Dorkenwald (University of Amsterdam); Manuel Mucientes (Universidad de Santiago de Compostela); Cees G. M. Snoek (University of Amsterdam); Yuki M Asano (University of Technology Nuremberg)
PDF Poster Video (Right click to download) 

859
Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift
Björn Michele (Université de Bretagne Sud); Alexandre Boulch (valeo.ai); Gilles Puy (valeo.ai); Tuan-Hung VU (valeo.ai); Renaud Marlet (valeo.ai); Nicolas Courty (IRISA)
PDF Poster Video (Right click to download) 

864
Geometry-Aware Diffusion Models for Multiview Scene Inpainting
Ahmad Salimi (York University); Tristan Ty Aumentado-Armstrong (York University); Marcus A Brubaker (Google DeepMind); Konstantinos G. Derpanis (York University)
PDF Poster Video (Right click to download) 

865
FootFormer: Estimating Stability from Visual Input
Keaton Yukio Kraiger (Pennsylvania State University); Jingjing Li (Pennsylvania State University); Skanda Bharadwaj (Pennsylvania State University); Jesse Scott (Scientific Applications & Research Associates (SARA), Inc.); Robert T. Collins (Pennsylvania State University); Yanxi Liu (Pennsylvania State University)
PDF Poster 

866
Image Recognition with Vision and Language Embeddings of VLMs
Illia Volkov (Czech Technical University of Prague); Nikita Kisel (Czech Technical Univeresity in Prague); Klara Janouskova (Czech Technical Univeresity in Prague); Jiri Matas (Czech Technical University in Prague)
PDF Poster 

873
Exploring Image Representation with Decoupled Classical Visual Descriptors
Chenyuan Qu (University of Birmingham); Hao Chen (University of Cambridge); Jianbo Jiao (University of Birmingham)
PDF Poster Video (Right click to download) 

875
Lost in Translation? Vocabulary Alignment for Source-Free Adaptation in Open-Vocabulary Semantic Segmentation
Silvio Mazzucco (The Good AI Lab); Carl Persson (The Good AI Lab); Mattia Segu (The Good AI Lab); Pier Luigi Dovesi (The Good AI Lab); Federico Tombari (Google); Luc Van Gool (INSAIT, Sofia University); Matteo Poggi (University of Bologna)
PDF Poster Video (Right click to download) 

880
Distortion-Aware Multi-Object Tracking via Virtual Plane Projection in Overhead Fisheye Cameras
Panithi Vanasirikul (OxygenAI); Piyanon Charoenpoonpanich (OxygenAI); Ekapol Chuangsuwanich (Chulalongkorn University)
PDF Poster Video (Right click to download) 

887
QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising
Qijun Yang (The University of Manchester); Yating Huang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester)
PDF Poster Video (Right click to download) 

896
Llama Learns to Direct: DirectorLLM for Human-Centric Video Generation
Kunpeng Song (The State University of New Jersey); Tingbo Hou (GenAI at Meta); Zecheng He (GenAI at Meta); Haoyu Ma (GenAI at Meta); Jialiang Wang (GenAI at Meta); Animesh Sinha (GenAI at Meta); Sam Tsai (GenAI at Meta); Yaqiao Luo (GenAI at Meta); Xiaoliang Dai (GenAI at Meta); Li Chen (GenAI at Meta); Xide Xia (GenAI at Meta); Peizhao Zhang (GenAI at Meta); Peter Vajda (GenAI at Meta); Ahmed M. Elgammal (The State University of New Jersey); Felix Juefei-Xu (GenAI at Meta)
PDF Poster Video (Right click to download) 

898
FaceCPT: Toward Cross-Modal Facial Representation Learning with Face-Caption Pre-Training
Md Mahedi Hasan (West Virginia University); Shoaib Meraj Sami (West Virginia University); Nasser Nasrabadi (West Virginia University); Jeremy M. Dawson (West Virginia University)
PDF Poster Video (Right click to download) 

900
Time-Scaling State-Space Models for Dense Video Captioning
AJ Piergiovanni (Google Deepmind); Ganesh Satish Mallya (Google Deepmind); Dahun Kim (Google Deepmind); Anelia Angelova (Google Deepmind)
PDF Poster 

902
ITC-RWKV: Interactive Tissue–Cell Modeling with Recurrent Key-Value Aggregation for Histopathological Subtyping
Yating Huang (The University of Manchester); Qijun Yang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester)
PDF Poster Video (Right click to download) 

903
Toward Robust Audio-Visual Synchronization Detection in Egocentric Video with Sparse Synchronization Events
Jordan Voas (The University of Texas at Austin); Wei-Cheng Tseng (The University of Texas at Austin); Benoit Vallade (Amazon Prime Video); Alex Mackin (Amazon Prime Video); David Higham (Amazon Prime Video); David Harwath (The University of Texas at Austin)
PDF Poster Video (Right click to download) 

914
Improving Human Motion Plausibility with Body Momentum
Ha Linh Nguyen (National University of Singapore); Tze Ho Elden Tse (National University of Singapore); Angela Yao (National University of Singapore)
PDF Poster Video (Right click to download) 

915
DualDistill: A Unified Cross-Modal Knowledge Distillation Framework for Camera-Based BEV Representation
Gaeun Kim (Seoul National University of Science and Technology); Daeil Han (Seoul National University of Science and Technology); Yeong Jun Koh (Chungnam National University); Hanul Kim (Seoul National University of Science and Technology)
PDF Poster Video (Right click to download) 

922
FSLC: Fast Scoring with Learnable Coreset for Zero-shot Industrial Anomaly Detection
Songtao Ni (Shanghai Jiaotong University); Yuxin Li (Shanghai Jiaotong University); Xu Zhao (Shanghai Jiao Tong University)
PDF Poster Video (Right click to download) 

931
Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation
Yuan Yao (University of Rochester); Yicong Hong (Adobe Research); Difan Liu (Adobe Research); Long Mai (Adobe Research); Feng Liu (Adobe Research); Jiebo Luo (University of Rochester)
PDF Poster Video (Right click to download) 

938
Grad-CL: Source Free Domain Adaptation with Gradient Guided Feature Disalignment
Rini Smita Thakur (Indian Institute of Science Education and Research Bhopal); Rajeev Ranjan Dwivedi (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal)
PDF Poster Video (Right click to download) 

940
Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation
Nguyen Lan Vi Vu (Ho Chi Minh City University of Technology); Thanh-Huy Nguyen (Carnegie Mellon University); Thien Nguyen (Posts and Telecommunications Institute of Technology); Daisuke Kihara (Purdue University); Tianyang Wang (University of Alabama at Birmingham); Xingjian Li (Carnegie Mellon University); Min Xu (Carnegie Mellon University)
PDF Poster Video (Right click to download) 

943
HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
Yulong Shi (Northeastern University); Jiapeng Li (Northeastern University); Lin Qi (Northeastern University)
PDF Poster 

948
Tracking Meets Large Multimodal Models for Driving Scenario Understanding
Ayesha Ishaq (Mohamed bin Zayed University of Artificial Intelligence); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

949
Prompt Image to Watch and Hear: Multimodal Prompting for Parameter-Efficient Audio-Visual Learning
Kai Wang (University of Toronto); Shentong Mo (Carnegie Mellon University); Yapeng Tian (University of Texas at Dallas); Dimitrios Hatzinakos (University of Toronto)
PDF Poster Video (Right click to download) 

955
Clean Sample Selection and Noisy Sample Rematching for Text-Based Pedestrian Retrieval
Daiqiang Li (Sichuan University); Weicheng Zhang (Sichuan University); yuanyuan wu (Chengdu University of Technology); Honggang Chen (Sichuan University)
PDF Poster Video (Right click to download) 

960
A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
Junlai Qiu (Hainan University); Yunzhu Chen (Guangxi Polytechnic of Construction); Hao Zheng (Tencent Jarvis Lab); Yawen Huang (Tencent Jarvis Lab); Yuexiang Li (Guangxi Medical University)
PDF Poster 

962
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba
Yunxiang Fu (University of Hong Kong); Chaoqi Chen (Shenzhen University); Yizhou Yu (The University of Hong Kong)
PDF Poster Video (Right click to download) 

976
Advancing Utility Pole and Sign Detection Through Deep Learning
Carl Dickinson (University of Strathclyde); Gaetano Di Caterina (University of Strathclyde)
PDF Poster Video (Right click to download) 

981
Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams
Takuya Nakabayashi (Keio University); Navami Kairanda (Max Planck Institute for Informatics); Hideo Saito (Keio University); Vladislav Golyanik (Max Planck Institute for Informatics)
PDF Poster Video (Right click to download) 

986
Visible Structure Retrieval for Lightweight Image-Based Relocalisation
Fereidoon Zangeneh (KTH Royal Institute of Technology); Leonard Bruns (KTH Royal Institute of Technology); Amit Dekel (Univrses AB); Alessandro Pieropan (Univrses AB); Patric Jensfelt (KTH Royal Institute of Technology)
PDF Poster Video (Right click to download) 

987
Spatiotemporal Event Spotting via 3D Heatmaps with Dynamically Shifted Gaussian Kernels
Ankhzaya Jamsrandorj (Korea Institute of Science and Technology); VANYI CHAO (University of Science and Technology); Hoang Quoc Nguyen (Korea Institute of Science and Technology); Yin May Oo (University of Science and Technology); Muhammad Amrulloh Robbani (University of Science and Technology); Yewon Hwang (Korea Institute of Science and Technology); Kyung-Ryoul Mun (Korea Institute of Science and Technology); Jinwook Kim (Korea Institute of Science and Technology)
PDF Poster Video (Right click to download) 

988
Ink Enhancement for Ancient Bamboo Manuscripts Using Iterative Restoration-Degradation Adversarial Learning
Chongsheng ZHANG (Henan University, Ludwig-Maximilians-Universität München); Junchao Ma (Henan Univeristy); Wenhao Zhang (Henan Univeristy); Kamel Aouaidjia (Henan Univeristy); Qilong Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München)
PDF Poster Video (Right click to download) 

989
MedOpenSeg: Open-World Medical Segmentation with Memory-Augmented Transformers
Luisa Vargas (Eurecom); Eleonora Poeta (Politecnico di Torino); Tania Cerquitelli (Politecnico di Torino); Elena Baralis (Politecnico di Torino); Maria A Zuluaga (Eurecom)
PDF Poster Video (Right click to download) 

992
Dual Polarity Prompts with Stochastic Entropy Perturbation for Label Noise
Changhui Hu (Universitat de Barcelona); Bhalaji Nagarajan (Barcelona Supercomputing Center); Ricardo Marques (Universitat Pompeu Fabra); Petia Radeva Ivanova (Universitat de Barcelona)
PDF Poster Video (Right click to download) 

999
Lid-Lab-NeRF: Generating Temporally Consistent, Labelled LiDAR Point Clouds using Neural Radiance Fields
Shrestha Srivastava (Indian Institute of Science Education and Research Bhopal); Vaibhav Kumar (Indian Institute of Science Education and Research Bhopal)
PDF Poster Video (Right click to download) 

1004
RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation
Nuren Zhaksylyk (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Jay Nitin Paranjape (Johns Hopkins University); S. Swaroop Vedula (Johns Hopkins University); Shameema Sikder (Johns Hopkins University); Vishal M. Patel (Johns Hopkins University); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

1006
Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models
Lorenzo Baraldi (University of Pisa); Davide Bucciarelli (University of Pisa); Zifan Zeng (Technische Universität München); Chongzhe Zhang (Technische Universität Berlin); Qunli Zhang (Huawei Technologies Ltd.); Marcella Cornia (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia); Feng Liu (Huawei Technologies Ltd.); Zheng Hu (Huawei Technologies Ltd.); Rita Cucchiara (University of Modena and Reggio Emilia)
PDF Poster Video (Right click to download) 

1007
Is Structural Awareness the Key to Event Camera Data Cleansing for Enhancing Veracity?
Haiyu Li (The University of Sheffield); Charith Abhayaratne (The University of Sheffield)
PDF Poster Video (Right click to download) 

1013
SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank Transformation
Abdelrahman Elsayed (Mohamed bin Zayed University of Artificial Intelligence); Sarim Hashmi (Mohamed bin Zayed University of Artificial Intelligence); Mohammed Elseiagy (Mohamed bin Zayed University of Artificial Intelligence); Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

1015
In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models
Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Congbo Ma (New York University); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

1024
Quantifying Risk in Pedestrian Crowds Using Divergence Estimated from Flows of Head-Tracking Data
Haruto Nakayama (University of Tsukuba); Masaki Onishi (AIST)
PDF Poster Video (Right click to download) 

1029
PrIINeR: Towards Prior-Informed Implicit Neural Representations for Accelerated MRI
Ziad Al-Haj Hemidi (Universität zu Lübeck); Eytan Kats (Universität zu Lübeck); Mattias P. Heinrich (Universität zu Lübeck)
PDF Poster Video (Right click to download) 

1030
Language-Guided Decision Override for Adaptive and Retraining-Free Video Anomaly Detection
Ryo Moriyama (Aoyama Gakuin University); Shin Suzuki (Aoyama Gakuin University); Naoshi Kaneko (Tokyo Denki University); Kazuhiko Sumi (Aoyama Gakuin University)
PDF Poster Video (Right click to download) 

1035
Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation
Julien Walther (University of Bordeaux); Rémi Giraud (University of Bordeaux); Michaël Clément (University of Bordeaux)
PDF Poster Video (Right click to download) 

1036
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving
Mert Keser (Continental AG, Technische Universität München); Halil Ibrahim Orhan (Technische Universität München); Niki Amini-Naieni (University of Oxford); Gesina Schwalbe (Universität zu Lübeck); Alois C. Knoll (Technical University Munich); Matthias Rottmann (Universität Osnabrück)
PDF Poster Video (Right click to download) 

1037
Generative Data Augmentation for Object Point Cloud Segmentation
Dekai Zhu (Technische Universität München); Stefan Gavranovic (Siemens AG); Flavien Boussuge (Siemens AG); Benjamin Busam (Technische Universität München); Slobodan Ilic (Technische Universität München)
PDF Poster Video (Right click to download) 

1051
Conditional Prototype Learning for Few-Shot Object Detection
Zhenwei He (Chongqing University of Technology); Xinye Liao (Chongqing University of Technology); Xin Feng (Chongqing University of Technology)
PDF Poster Video (Right click to download) 

1058
Atomizer: Generalizing to unseen modalities by breaking images down to a set of scalars
Hugo Riffaud de Turckheim (INRIA); Diego Marcos (INRIA); Roberto Interdonato (CIRAD); Sylvain Lobry (Université de Montpellier)
PDF Poster Video (Right click to download) 

1060
Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation
Lee Chae-Yeon (Pohang University of Science and Technology); Nam Hyeon-Woo (Pohang University of Science and Technology); Tae-Hyun Oh (Korea Advanced Institute of Science and Technology)
PDF Poster Video (Right click to download) 

1061
IPGPhormer: Interpretable Pathology Graph-Transformer for Survival Analysis
Guo Tang (Harbin Institute of Technology Shenzhen); Songhan Jiang (Harbin Institute of Technology Shenzhen); Jinpeng Lu (University of Science and Technology of China); Linghan Cai (Harbin Institute of Technology Shenzhen); Yongbing Zhang (Harbin Institute of Technology Shenzhen)
PDF Poster Video (Right click to download) 

1062
Calibration-Aware Prompt Learning for Medical Vision-Language Models
Abhishek Basu (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shamshad (Mohamed bin Zayed University of Artificial Intelligence); Ashshak Sharifdeen (Mohamed bin Zayed University of Artificial Intelligence); Karthik Nandakumar (Michigan State University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

1064
3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
Tejaswini Medi (Universität Mannheim); Arianna Rampini (Autodesk); Pradyumna Reddy (Autodesk); Pradeep Kumar Jayaraman (Autodesk); Margret Keuper (Universität Mannheim)
PDF Poster Video (Right click to download) 

1071
Stabilizing Open-Set Test-Time Adaptation via Primary-Auxiliary Filtering and Knowledge-Integrated Prediction
Byung-Joon Lee (Sungkyunkwan University); Jin-Seop Lee (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University)
PDF Poster Video (Right click to download) 

1072
Controllable Garment Generation with Multi-Modal Diffusion Guidance
Sanhita Pathak (IIT Delhi); Vinay Kaushik (IIIT Sonepat); Brejesh Lall (IIT Delhi)
PDF Poster Video (Right click to download) 

1076
CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation
Anshul Kaushal (Panjab University); Kunal Jangid (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal)
PDF Poster Video (Right click to download) 

1077
Towards Sharper Object Boundaries in Self-Supervised Depth Estimation
Aurélien Cecille (LIRIS); Stefan Duffner (LIRIS); Franck Davoine (LIRIS); Rémi Agier (Visual Behavior); Thibault Neveu (Visual Behavior)
PDF Poster Video (Right click to download) 

1081
ImProvShow: Multimodal Fusion for Image Provenance Summarization
Alexander Black (University of Surrey); Jing Shi (Adobe Research); Yifei Fan (Adobe Research); John Collomosse (Adobe Research)
PDF Poster Video (Right click to download) 

1091
CLIMB-3D: Class-Incremental Imbalanced 3D Instance Segmentation
Vishal Thengane (University of Surrey); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence); Lu Yin (University of Surrey); Xiatian Zhu (University of Surrey); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

1095
Supervised Segmentation Model for Improved Detection of OSSN using Slit Lamp Images
Kajal Singh (Indian Institute of Technology Mandi); Shagun Bhatt (Indian Institute of Technology Mandi); Ramkailash Gujar (Dr Shroff’s Charity Eye Hospital ); Arnav Bhavsar (Indian Institute of Technology Mandi); Dinesh Singh (Indian Institute of Technology Mandi)
PDF Poster Video (Right click to download) 

1096
LED: Light Enhanced Depth Estimation at Night
Simon de Moreau (Mines Paris - PSL University); Yasser Almehio (Valeo AI); Andrei Bursuc (Valeo AI); Hafid EL IDRISSI (Valeo AI); Bogdan Stanciulescu (Mines Paris - PSL University); Fabien Moutarde (Mines Paris - PSL University)
PDF Poster Video (Right click to download) 

1109
AKD-BNN: Adaptive Kernel Dynamic Bayesian Neural Networks for Enhanced Medical Image Segmentation with Uncertainty Estimation
Qianying He (City University of Macau); Xuan Liu (City University of Macau); Wenjian Liu (City University of Macau)
PDF Poster Video (Right click to download) 

1110
Robust Human Registration with Body Part Segmentation on Noisy Point Clouds
Kai Lascheit (ETH Zurich); Francis Engelmann (Stanford University); Daniel Barath (ETH Zurich); Marc Pollefeys (Microsoft); Leonidas Guibas (Stanford University)
PDF Poster Video (Right click to download) 

1118
Conformal Predictors for Efficient Video Text Spotting
Amor Ben Tanfous (Terminal Industries); Sankha Subhra Mukherjee (Terminal Industries); Neil M. Robertson (Terminal Industries)
PDF Poster 

1121
MIAS-SAM: Medical Image Anomaly Segmentation without thresholding
Marco Colussi (University of Milan); Dragan Ahmetovic (University of Milan); Sergio Mascetti (University of Milan)
PDF Poster Video (Right click to download) 

1123
Evaluating Self-Supervised Learning in Medical Imaging: A Systematic Investigation of Robustness, Generalizability, and Multi-Domain Impact
Valay Bundele (University of Tuebingen); Karahan Sarıtaş (University of Tuebingen); Bora Kargi (University of Tuebingen); Oğuz Ata Çal (University of Tuebingen); Kıvanç Tezören (University of Tuebingen); Zohreh Ghaderi (University of Tuebingen); Hendrik Lensch (University of Tuebingen)
PDF Poster Video (Right click to download) 

1127
ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds
Shanle Yao (University of North Carolina at Charlotte); Ghazal Alinezhad Noghre (University of North Carolina at Charlotte); Armin Danesh Pazho (University of North Carolina at Charlotte); Hamed Tabkhivayghan (University of North Carolina at Charlotte)
PDF Poster Video (Right click to download) 

1132
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
Wenyi Lian (Uppsala University); Patrick Micke (Uppsala University); Joakim Lindblad (Uppsala University); Nataša Sladoje (Uppsala University)
PDF Poster Video (Right click to download) 

1134
TRUDI and TITUS: A Multi-Perspective Dataset and A Three-Stage Recognition System for Transportation Unit Identification
Emre Gülsoylu (Universität Hamburg); André Peter Kelm (Universität Hamburg); Lennart Bengtson (Universität Hamburg); Matthias Hirsch (Universität Hamburg); Christian Wilms (Universität Hamburg); Tim Rolff (Universität Hamburg); Janick Edinger (University of Hamburg); Simone Frintrop (Universität Hamburg)
PDF Poster Video (Right click to download) 

1135
The Trauma THOMPSON Dataset for Real-World Emergency AI
Yupeng Zhuo (Purdue University); Eddie Zhang (Purdue University); Xiangchen Yu (Purdue University); Aditya Pachpande (Purdue University); Andrew Wallace Kirkpatrick (University of Calgary); Kyle Couperus (The Geneva Foundation); Jessica Mckee (University of Calgary); Juan Wachs (Purdue University)
PDF Poster Video (Right click to download) 

1137
ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates
Hamidreza Dastmalchi (York University); Aijun An (York University); Ali Cheraghian (CSIRO)
PDF Poster Video (Right click to download) 

1156
Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion
Yu Zhu (Tohoku University); Naoya Chiba (Osaka University); Koichi Hashimoto (Tohoku University)
PDF Poster Video (Right click to download) 

1161
Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data
Alexander Hepburn (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Javier Portilla (Consejo Superior de Investigaciones Científicas (CSIC))
PDF Poster Video (Right click to download) 

1171
Conflict-Aware Adversarial Training
Zhiyu Xue (UC Santa Barbara); Haohan Wang (University of Illinois at Urbana-Champaign); Yao Qin (UC Santa Barbara); Ramtin Pedarsani (UC Santa Barbara)
PDF Poster Video (Right click to download) 

1177
Binarizing Severely Degraded Ancient Bamboo Slips: Dataset and Baseline
Chongsheng Zhang (Henan University, Ludwig-Maximilians-Universität München); Wanwan Fu (Henan University); Qilong Li (Henan Univeristy); SAMRA ZAFAR (Henan Univeristy); Zhanshuo Zhang (Henan Univeristy); Qiyan Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München)
PDF Poster Video (Right click to download) 

1180
MO-SHW: Hierarchy-Aware Multi-Objective Optimization for Open-World Segmentation
Erico M. Pereira (Universidade Federal de Minas Gerais); Frederico Gadelha Guimarães (Universidade Federal de Minas Gerais); Jefersson A Dos Santos (University of Sheffield)
PDF Poster Video (Right click to download) 

1183
Contrastive Point Feature Matching for Open-world Object Counting
Ngo Xuan Cuong (University of Arkansas)
PDF Poster Video (Right click to download) 

1187
Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
Kevin Raina (University of Ottawa); Tanya Schmah (University of Ottawa)
PDF Poster Video (Right click to download) 

1196
IoSR: End-to-End Intraoral Scans Repairing
Manel Farhat (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Achraf Ben-hamadou (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ahmed Rekik (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ons Abida (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Oussama Smaoui (Biotech Dental Group)
PDF Poster Video (Right click to download) 

1199
AULUNet: An Adaptive Ultra-Lightweight U-Net Framework for Efficient Skin Lesion Segmentation in Resource-Constrained Environments
Md Maklachur Rahman (Texas A&M University); Soon Ki Jung (Kyungpook National University); Tracy Hammond (Texas A&M University)
PDF Poster Video (Right click to download) 

1209
Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning
Bahareh Nikpour (McGill University); Narges Armanfard (McGill University)
PDF Poster Video (Right click to download) 

1220
Hair Strand Reconstruction based on 3D Gaussian Splatting
Yimin Pan (Technical University of Munich); Matthias Nießner (Technical University of Munich); Tobias Kirschstein (Technical University of Munich)
PDF Poster Video (Right click to download) 

1224
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans
Bheeshm Sharma (IIT Bombay); Karthikeyan Jaganathan (IIT Bombay); Balamurugan Palaniappan (IIT Bombay)
PDF Poster Video (Right click to download) 

1257
End-to-End LiDAR optimization for point cloud registration
Siddhant Katyan (Université Laval); Marc-Andre Gardner (Bentley Systems); Jean-Francois Lalonde (Université Laval)
PDF Poster Video (Right click to download) 

If there are any mistakes on this page, please do not hesitate to contact bmvc@bmvc2025.org