| 12 |
Part Segmentation and Motion Estimation for Articulated Objects with Dynamic 3D Gaussians Jun-Jee Chao (University of Minnesota); Qingyuan Jiang (University of Minnesota); Volkan Isler (University of Texas at Austin) PDF Poster Video (Right click to download) |
| 18 |
Volumetric Temporal Texture for Smoke Stylization using Dynamic Radiance Fields Dongqing Wang (École Polytechnique Fédérale de Lausanne (EPFL)); Ehsan Pajouheshgar (École Polytechnique Fédérale de Lausanne (EPFL)); Yitao Xu (École Polytechnique Fédérale de Lausanne (EPFL)); Tong Zhang (École Polytechnique Fédérale de Lausanne (EPFL)); Sabine Süsstrunk (École Polytechnique Fédérale de Lausanne (EPFL)) PDF Poster Video (Right click to download) |
| 22 |
Learning from Synthetic Data for Visual Grounding Ruozhen He (Rice University); Ziyan Yang (Rice University); Paola Cascante-Bonilla (Stony Brook University); Alexander C. Berg (University of California, Irvine); Vicente Ordonez (Rice University) PDF Poster Video (Right click to download) |
| 24 |
MSMVD: Exploiting Multi-scale Image Features via Multi-scale BEV Features for Multi-view Pedestrian Detection Taiga Yamane (NTT Corporation); Satoshi Suzuki (NTT Corporation); Ryo Masumura (NTT Corporation); Shota Orihashi (NTT Corporation); Tomohiro Tanaka (NTT Corporation); Mana Ihori (NTT Corporation); Naoki Makishima (NTT Corporation); Naotaka Kawata (NTT Corporation) PDF Poster |
| 26 |
Guiding a diffusion model with itself using sliding windows Nikolas Adaloglou (Heinrich Heine University); Tim Kaiser (Heinrich Heine University); Damir Iagudin (Heinrich Heine University); Markus Kollmann (Heinrich Heine University) PDF Poster Video (Right click to download) |
| 28 |
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh (UC Berkeley); Ta-Ying Cheng (University of Oxford); He-Yen Hsieh (Harvard University); David Chuan-En Lin (Carnegie Mellon University); Yi Ma (University of Hong Kong); Andrew Markham (University of Oxford); Niki Trigoni (University of Oxford); H. T. Kung (Harvard University); Yubei Chen (UC Davis) PDF Poster Video (Right click to download) |
| 29 |
SPARTAN: Spatiotemporal Pose-Aware Retrieval for Text-guided Autonomous Navigation Xiangyu Bai (Northeastern University); Sai Anish Sreeramagiri (Northeastern University); Sai Siddhartha Vivek Dhir Rangoju (Northeastern University); Bishoy Galoaa (Northeastern University); Eric C Mortin (DEVCOM Analysis Center (DAC), US Army); Sarah Ostadabbas (Northeastern University) PDF Poster Video (Right click to download) |
| 37 |
Cross-Modal Scene Semantic Alignment for Image Complexity Assessment Yuqing Luo (Cardiff University); YIXIAO LI (Beihang University); Jiang Liu (Cardiff University); Jun Fu (Cardiff University); Hadi Amirpour (University of Klagenfurt); Guanghui Yue (Shenzhen University); Baoquan Zhao (SUN YAT-SEN UNIVERSITY); Padraig Corcoran (Cardiff University); Hantao Liu (Cardiff University); Wei Zhou (Cardiff University) PDF Poster Video (Right click to download) |
| 38 |
GC-Font: Few-Shot Font Generation via Global Contextual Feature Modelling Weiran Chen (Soochow University); Guiqian Zhu (Soochow University); Ying Li (Soochow University); Yi Ji (Soochow University); Chunping Liu (Soochow University) PDF Poster Video (Right click to download) |
| 39 |
DocAttentionRect: Attention-Guided Document Image Rectification Pooja Kumari (Indian Institute of Technology Madras); Sukhendu Das (Indian Institute of Technology Madras) PDF Poster Video (Right click to download) |
| 43 |
Split Matching for Inductive Zero-shot Semantic Segmentation Jialei Chen (Nagoya University); Xu Zheng (The Hong Kong University of Science and Technology (Guangzhou) (HKUST (GZ))); Dongyue Li (Nagoya University); Chong Yi (Nagoya University); Seigo Ito (Nagoya University); Danda Pani Paudel (Artificial Intelligence and Technology (INSAIT), Sofia University); Luc Van Gool (Artificial Intelligence and Technology (INSAIT), Sofia University); Hiroshi Murase (Nagoya University); Daisuke Deguchi (Nagoya University) PDF Poster Video (Right click to download) |
| 55 |
Piezoelectric Acoustic Sensing for Sitting Pose Classification Yuuki Shibuya (Tokyo University of Science); Go Irie (Tokyo University of Science) PDF Poster Video (Right click to download) |
| 56 |
Pointly-Supervised Weak-Shot Semantic Segmentation via Dual Mapping Transfer Wenhui Jiang (Jiangxi University of Finance and Economics); Ruikang Luo (Jiangxi University of Finance and Economics); Zeyu Luo (Jiangxi University of Finance and Economics); Xiaowei Zhao (Sany Heavy Industry Co.); Junjie Chen (Jiangxi University of Finance and Economics); Yuming Fang (Jiangxi University of Finance and Economics) PDF Poster Video (Right click to download) |
| 59 |
CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance Yingtie Lei (University of Macau); Fanghai Yi (Guangdong University of Technology); Yihang Dong (University of Chinese Academy of Sciences); Weihuang Liu (University of Macau); Xiaofeng Zhang (Shanghai Jiao Tong University); Zimeng Li (Shenzhen Polytechnic University); Chi-Man Pun (University of Macau); Xuhang Chen (University of Macau) PDF Poster Video (Right click to download) |
| 66 |
ST-GDance: Long-Term and Collision-Free Group Choreography from Music Jing Xu (Monash University); Weiqiang Wang (Monash University); Cunjian Chen (Monash University); Jun Liu (Lancaster University); Qiuhong Ke (Monash University) PDF Poster Video (Right click to download) |
| 69 |
TACTFL: Temporal Contrastive Training for Multi-modal Federated Learning with Similarity-guided Model Aggregation Guanxiong Sun (University of Bristol); Majid Mirmehdi (University of Bristol); Zahraa S. Abdallah (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Ian James Craddock (University of Bristol); Telmo M Silva Filho (University of Bristol) PDF Poster |
| 77 |
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance Luc Boudier (École Polytechnique); Loris Manganelli (École Polytechnique); Eleftherios Tsonis (École Polytechnique); Nicolas Dufour (École Polytechnique, École des Ponts); Vicky Kalogeiton (Ecole polytechnique) PDF Poster Video (Right click to download) |
| 81 |
SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation Xiao Cao (National University of Singapore); Beibei Lin (National University of Singapore); Bo Wang (University of Mississippi); Zhiyong Huang (National University of Singapore); Robby T. Tan (National University of Singapore) PDF Poster Video (Right click to download) |
| 83 |
RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze Ruicheng Zhang (SUN YAT-SEN UNIVERSITY); Puxin Yan (SUN YAT-SEN UNIVERSITY); Zeyu Zhang (The Australian National University); Yicheng Chang (SUN YAT-SEN UNIVERSITY); Hongyi Chen (SUN YAT-SEN UNIVERSITY); Zhi Jin (SUN YAT-SEN UNIVERSITY) PDF Poster Video (Right click to download) |
| 84 |
Continual Vision-and-Language Navigation SeongJun Jeong (Seoul National University); Gi-Cheon Kang (Seoul National University); Seongho Choi (Seoul National University); Joochan Kim (Korea Institute of Science and Technology); Byoung-Tak Zhang (Seoul National University) PDF Poster Video (Right click to download) |
| 88 |
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models Yuyang Xue (University of Edinburgh); Edward Moroshko (University of Edinburgh); Feng Chen (University of Edinburgh); Jingyu Sun (University of Edinburgh); Steven G. McDonagh (University of Edinburgh); Sotos Tsaftaris (University of Edinburgh) PDF Poster Video (Right click to download) |
| 102 |
STAIN: Smooth Tile-Aware Instance Normalisation for Virtual Staining David Armstrong (The Queen's University Belfast); Iain B Styles (The Queen's University Belfast); Niall McLaughlin (The Queen's University Belfast) PDF Poster Video (Right click to download) |
| 113 |
Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning Ruxiao Duan (Yale University); Jieneng Chen (Johns Hopkins University); Adam Kortylewski (University of Freiburg); Alan Yuille (Johns Hopkins University); Yaoyao Liu (University of Illinois Urbana-Champaign) PDF Poster Video (Right click to download) |
| 117 |
RAPrivacy: a Readable Anonymizer for Privacy Preserving Action Recognition ZiZhen Wang (National Yang Ming Chiao Tung University); Yen-Lung Chu (National Yang Ming Chiao Tung University); Pei-Chun Tsai (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download) |
| 121 |
BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching Zhihua Liu (University of Leicester); Lei Tong (AstraZeneca); Xilin He (The Chinese University of Hong Kong); Che Liu (Imperial College London); Rossella Arcucci (Imperial College London); Chen Jin (Astrazeneca); Huiyu Zhou (University of Leicester) PDF Poster Video (Right click to download) |
| 122 |
Distribution-guided Generative Replay with Semantic Prompts for Class-Incremental Chest X-ray Diagnosis Jayant Mahawar (Indian Institue of Technology Jodhpur); Devi Prasad Maharathy (Indian Institute of Technology Jodhpur); Angshuman Paul (Indian Institute of Technology Jodhpur) PDF Poster Video (Right click to download) |
| 124 |
Mask2Act: Predictive Multi-Object Tracking as Video Pre-Training for Robot Manipulation Junbo Zhang (Tsinghua University); Kaisheng Ma (Tsinghua University) PDF Poster Video (Right click to download) |
| 126 |
CLAIR: CLIP-Aided Weakly Supervised Zero-Shot Cross-Domain Image Retrieval Chor Boon Tan (National University of Singapore); Conghui Hu (National University of Singapore); Gim Hee Lee (National University of Singapore) PDF Poster Video (Right click to download) |
| 128 |
Uncertainty Diffusion: Parameter-Efficient Depth Refinement via Uncertainty-Guided Diffusion Models Jeng-Huo Tzeng (National Yang Ming Chiao Tung University); Chuan-Yuan Huang (National Yang Ming Chiao Tung University); Kuan-Wen Chen (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download) |
| 131 |
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications Noreen Anwar (LITIV, Polytechnique Montréal); Guillaume-Alexandre Bilodeau (LITIV, Polytechnique Montréal); Wassim Bouachir (Université du Québec (TELUQ)) PDF Poster Video (Right click to download) |
| 133 |
M$^2$StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting Xingyu Miao (Durham University); Xueqi Qiu (Durham University); Haoran Duan (Tsinghua University); Yawen Huang (Tencent Jarvis Lab); Xian Wu (Tencent Jarvis Lab); Jingjing Deng (University of Bristol); Yang Long (Durham University) PDF Poster Video (Right click to download) |
| 136 |
Lang4D: Weakly Supervised Learning of 4D Language Splatting Mana Masuda (CyberAgent, Keio University); Taiki Sekii (CyberAgent) PDF Poster Video (Right click to download) |
| 138 |
SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet Woosung Joung (Korea University); Daewon Chae (University of Michigan - Ann Arbor); Jinkyu Kim (Korea University) PDF Poster Video (Right click to download) |
| 147 |
Spatial-Frequency Domain Aggregation for Visual Place Recognition Chaoqun Wang (South China Normal University); Shaobo Min (Tencent) PDF Poster Video (Right click to download) |
| 150 |
WTNet: A Weather Transfer Network for Domain-Adaptive All-In-One Adverse Weather Image Restoration Si-Yu Huang (National Yang Ming Chiao Tung University); Fu-Jen Tsai (National Tsing Hua University); Chia-Wen Lin (National Tsing Hua University); Yen-Yu Lin (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download) |
| 154 |
Seed-to-Seed: Unpaired Image Translation in Diffusion Seed Space Or Greenberg (General Motors R&D, The Hebrew University of Jerusalem); Eran Kishon (General Motors R&D); Dani Lischinski (The Hebrew University of Jerusalem) PDF Poster |
| 159 |
Identity-Motion Trade-offs in Text-to-Video Generation Yuval Atzmon (NVIDIA Research); Rinon Gal (NVIDIA Research); Yoad Tewel (NVIDIA Research); Yoni Kasten (NVIDIA Research); Gal Chechik (NVIDIA Research) PDF Poster |
| 165 |
Four eyes see more than two: Dataset Distillation with Mixture-of-Experts Jia-Jiun Yao (National Yang Ming Chiao Tung University); Sheng-Feng Yu (National Yang Ming Chiao Tung University, Macronix International Co., Ltd.); Wei-Chen Chiu (National Yang Ming Chiao Tung University) PDF Poster Video (Right click to download) |
| 168 |
PSScreen: Partially Supervised Multiple Retinal Disease Screening Boyi Zheng (University of Oulu); Qing Liu (University of Oulu) PDF Poster Video (Right click to download) |
| 173 |
CellMamba: Adaptive Mamba for Accurate and Efficient Cell Detection Ruochen Liu (University of Liverpool); Yi.Tian (National University of Singapore); Jiahao Wang (Xi'an Jiaotong-Liverpool University); Hongbin Liu (Xi'an Jiaotong-Liverpool University); Xianxu Hou (Xi'an Jiaotong-Liverpool University); Jingxin Liu (Xi'an Jiaotong-Liverpool University) PDF Poster Video (Right click to download) |
| 181 |
LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan (Concordia University); Abdessamad Ben Hamza (Concordia University); Nizar Bouguila (Concordia University) PDF Poster Video (Right click to download) |
| 182 |
eXtended Multimodal Composite Association Score (xMCAS): A Gender Inclusive Approach to Measurement of Bias in Text-To-Image Diffusion Models Abhishek Mandal (Dublin City University); Susan Leavy (University College Dublin); Suzanne Little (Dublin City University) PDF Poster Video (Right click to download) |
| 184 |
Graph Similarity Learning of Floor Plans Casper van Engelenburg (Delft University of Technology); Jan van Gemert (Delft University of Technology); Seyran Khademi (Delft University of Technology) PDF Poster |
| 195 |
Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification Shuxian Ma (University of Jinan); Zihao Dong (University of Jinan); Runmin Cong (Shandong University); Sam Tak-Wu Kwong (Lingnan University); Xiuli Shao (Nankai University) PDF Poster Video (Right click to download) |
| 211 |
Efficient Image Restoration via Latent Consistency Flow Matching Elad Cohen (Sony Semiconductor Israel); Idan Achituve (Sony Semiconductor Israel); Idit Diamant (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel) PDF Poster Video (Right click to download) |
| 218 |
Bridging Visual-Textual Modalities: Weakly Supervised Histopathology Segmentation Sicong Gao (University of New South Wales); Matthew AB Baker (University of New South Wales); Maurice Pagnucco (University of New South Wales); Zhiwei Gao (Hebei Key Laboratory of Electromagnetic Environmental Effects and Information Processing); Yang Song (University of New South Wales) PDF Poster Video (Right click to download) |
| 220 |
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking Martha Teiko Teye (University of Wuppertal); Ori Maoz (Aptiv); Matthias Rottmann (Universität Osnabrück) PDF Poster Video (Right click to download) |
| 228 |
Dual-Expert Collaborative Network for Fake News Detection with External Knowledge Integration Wenxin Luo (Beijing Jiaotong University); Zhu Teng (Beijing Jiaotong Univeristy); Wei Zhang (Beijing Jiaotong University); Baopeng Zhang (Beijing Jiaotong Univeristy) PDF Poster Video (Right click to download) |
| 232 |
Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering Nattapong Kurpukdee (University of York); Adrian G. Bors (University of York) PDF Poster Video (Right click to download) |
| 238 |
Capture and Reconstruct 3D Clothed Human from Images Onat Vuran (ETH Zürich); Hsuan-I Ho (ETH Zürich) PDF Poster Video (Right click to download) |
| 239 |
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation Maëlic Neau (Umeå University); Paulo Eduardo Santos (PrioriAnalytica); Anne-Gwenn Bosser (Bretagne INP); Akihiro Sugimoto (National Institute of Informatics); Cedric Buche (CNRS IRL 2010 CROSSING, IMT Atlantique) PDF Poster Video (Right click to download) |
| 257 |
S$^2$V2V: Training-Free Video-to-Video with Sparse Points and Motion Guidance Xinyu Zhang (University of Auckland); Zicheng Duan (The University of Adelaide); Dong Gong (University of New South Wales); Lingqiao Liu (University of Adelaide) PDF Poster Video (Right click to download) |
| 259 |
Revisiting Entropy Minimization for Long-Sequence Continual Test-Time Adaptation WeiQin Chuah (Royal Melbourne Institute of Technology); Ruwan Tennakoon (Royal Melbourne Institute of Technology); Alireza Bab-Hadiashar (Royal Melbourne Institute of Technology) PDF Poster Video (Right click to download) |
| 261 |
ADIR: Adaptive Diffusion for Image Reconstruction Shady Abu-Hussein (Tel Aviv University); Tom Tirer (Tel Aviv University); Raja Giryes (Bar Ilan University) PDF Poster Video (Right click to download) |
| 265 |
Enhancing Visual Tracking by Leveraging High-frequency Information within Event Signals Yuheng Jiang (University of Science and Technology of China); Hebei Li (University of Science and Technology of China); Dachun Kai (University of Science and Technology of China); Yansong Peng (University of Science and Technology of China); Jiahui Yuan (University of Science and Technology of China); Peilin Xiao (University of Science and Technology of China); Yueyi Zhang (MiroMind); Xiaoyan Sun (University of Science and Technology of China) PDF Poster Video (Right click to download) |
| 270 |
FaceGCD: Generalized Face Discovery via Dynamic Prefix Generation Yunseok Oh (Inha University, AutoLabs, Inc.); Dong-Wan Choi (Inha University) PDF Poster Video (Right click to download) |
| 272 |
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing Yoojin Jang (Ulsan National Institute of Science and Technology); Junsu Kim (Ulsan National Institute of Science and Technology); Ha Yeon Kim (Ulsan National Institute of Science and Technology); Eun-Ki Lee (Hanyang University); Eun-Sol Kim (Hanyang University); Seungryul Baek (Ulsan National Institute of Science and Technology); Jaejun Yoo (Ulsan National Institute of Science and Technology) PDF Poster Video (Right click to download) |
| 276 |
Learning Event-guided Exposure-agnostic Video Frame Interpolation via Adaptive Feature Blending Junsik Jung (KAIST); Yoonki Cho (KAIST); Woo Jae Kim (KAIST); Lin Wang (Nanyang Technological University); Sung-Eui Yoon (KAIST) PDF Poster Video (Right click to download) |
| 277 |
CFFlow: An Optical Flow Estimation Hinging on Cross-Frequency Attention Mengfei Wang (Shanghai Institute of Microsystem and Information Technology); Dongchen Zhu (Shanghai Institute of Microsystem and Information Technology); Lei Wang (Shanghai Institute of Microsystem and Information Technology); Jiamao Li (Shanghai Institute of Microsystem and Information Technology) PDF Poster Video (Right click to download) |
| 281 |
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections Mohamed Fazli Mohamed Imam (Mohamed bin Zayed University of Artificial Intelligence); Rufael Fekadu Marew (Mohamed bin Zayed University of Artificial Intelligence); Jameel Hassan Abdul Samadh (Johns Hopkins University); Mustansar Fiaz (IBM Research); Alham Fikri Aji (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 283 |
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields Woo Jae Kim (Korea Advanced Institute of Science and Technology (KAIST)); Kyu Beom Han (Korea Advanced Institute of Science and Technology (KAIST)); Yoonki Cho (Korea Advanced Institute of Science and Technology (KAIST)); Youngju Na (Korea Advanced Institute of Science and Technology (KAIST)); Junsik Jung (Korea Advanced Institute of Science and Technology (KAIST)); Sooel Son (Korea Advanced Institute of Science and Technology (KAIST)); Sung-Eui Yoon (Korea Advanced Institute of Science and Technology (KAIST)) PDF Poster Video (Right click to download) |
| 284 |
Multimodal Feature Collaboration and Fusion for Fine-Grained Action Recognition Xinyu Bian (Beijing University of Posts and Telecommunications); Dongliang Chang (Beijing University of Posts and Telecommunications); Yuqi Yang (Beijing University of Posts and Telecommunications); Lei Chen (Tsinghua University); Zhanyu Ma (Beijing University of Post and Telecommunication) PDF Poster Video (Right click to download) |
| 288 |
Benchmarking Microsaccade Recognition with Event Cameras: A Novel Dataset and Evaluation Waseem Shariff (University of Galway); Timothy Hanley (University of Galway); Maciej Stec (University of Galway); Hossein Javidnia (University of Dublin); Peter Corcoran (University of Galway) PDF Poster Video (Right click to download) |
| 289 |
CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings Mattia Nardon (Bruno Kessler Foundation); Mikel Mujika Agirre (Ikerlan); Ander González Tomé (Ikerlan); Daniel Sedano Algarabel (Ikerlan); Josep Rueda Collell (ikerlan); Ana Paola Caro (Andreu World); Andrea Caraffa (Bruno Kessler Foundation); Fabio Poiesi (Bruno Kessler Foundation); Paul Ian Chippendale (Bruno Kessler Foundation); Davide Boscaini (Bruno Kessler Foundation) PDF Poster Video (Right click to download) |
| 291 |
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations Nils Hütten (Bergische Universität Wuppertal); Florian Hölken (Bergische Universität Wuppertal); Hasan Tercan (Bergische Universität Wuppertal); Tobias Meisen (Bergische Universität Wuppertal) PDF Poster |
| 292 |
SVAC: Scaling Is All You Need For Referring Video Object Segmentation Li Zhang (Columbia University); Haoxiang Gao (Carnegie Mellon University); Zhihao Zhang (Columbia University); Luoxiao Huang (New York University); Tao Zhang (Wuhan University) PDF Poster Video (Right click to download) |
| 294 |
Task Progressive Curriculum Learning for Robust Visual Question Answering Ahmed Akl (Griffith University); Abdelwahed Khamis (CSIRO); Zhe Wang (Griffith University); Ali Cheraghian (CSIRO); Sara Khalifa (Queensland University of Technology); Kewen Wang (Griffith University) PDF Poster Video (Right click to download) |
| 310 |
PME3D: An Adaptive and Efficient Multi-modal Feature Extraction Plug-in for 3D Object Detection TianyiYu (University of Glasgow); Lei Yu (Beihang University) PDF Poster Video (Right click to download) |
| 313 |
OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware Nick Lemke (Technische Universität Darmstadt); John Kalkhof (Technical University of Darmstadt); Niklas Babendererde (Technische Universität Darmstadt); Anirban Mukhopadhyay (Technical University of Darmstadt) PDF Poster Video (Right click to download) |
| 320 |
Extreme Model Compression with Structured Sparsity at Low Precision Dan Liu (McGill University); Nikita Dvornik (Palona AI); Xue Liu (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 328 |
DepthHMR: Leveraging Depth Around Humans for Multi-Human Mesh Generation Nikhil Sharma (University of Illinois Chicago); Jiachen Tao (University of Illinois Chicago); Junyi Wu (University of Illinois Chicago); Yan Yan (University of Illinois Chicago) PDF Poster |
| 330 |
FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion Kazuaki Mishima (Institute of Science Tokyo); Antoni Bigata Casademunt (Imperial College London); Stavros Petridis (Imperial College London); Maja Pantic (Imperial College London); Kenji Suzuki (Institute of Science Tokyo) PDF Poster Video (Right click to download) |
| 333 |
Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking Milad Khanchi (Concordia University); Maria Amer (Concordia University); Charalambos Poullis (Concordia University) PDF Poster Video (Right click to download) |
| 334 |
DAOVI: Distortion-Aware Omnidirectional Video Inpainting Ryosuke Seshimo (Keio University); Mariko Isogawa (Keio University) PDF Poster Video (Right click to download) |
| 338 |
GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition Tianyue Wang (University of the Chinese Academy of Sciences); Shuang Yang (University of Chinese Academy of Sciences); Shiguang Shan (University of Chinese Academy of Sciences); Xilin CHEN (University of Chinese Academy of Sciences) PDF Poster Video (Right click to download) |
| 339 |
Events Meet Dynamic Mode Decomposition: Capturing the Spatiotemporal Dynamics of Moving Objects Zhouning Du (AISIN CORPORATION); Israr Ulhaq (AISIN CORPORATION); Thanh Thi Huyen Phan (AISIN CORPORATION); Yuichiro Yoshimura (AISIN CORPORATION); Jigyasa Chand (AISIN CORPORATION); Truong Vinh Truong Duy (AISIN CORPORATION) PDF Poster Video (Right click to download) |
| 342 |
Flatness-aware Curriculum Learning via Adversarial Difficulty Hiroaki Aizawa (Hiroshima University); Yoshikazu Hayashi (Gifu University) PDF Poster |
| 346 |
Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID Priyank Pathak (University of Central Florida); Yogesh S Rawat (University of Central Florida) PDF Poster Video (Right click to download) |
| 348 |
PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing PEILIN XIONG (The University of Electro-Communications); Junwen Chen (The University of Electro-Communications); HONGHUI YUAN (The University of Electro-Communications); Keiji Yanai (The University of Electro-Communications) PDF Poster Video (Right click to download) |
| 354 |
Dual-Branch Network via Multiple Illumination-Aware Representation Learning for Steel Surface Defect Classification Yong Seok Oh (Dongguk University); Min Geol Kim (Sogang University); Bogyeong Kim (Dongguk University); Jun Young Kim (Dongguk University); Hyeongseob Jo (Dongguk University); Jae Hyeon Park (Dongguk University); Gyoomin Lee (Dongguk University); Sung In Cho (Sogang University) PDF Poster Video (Right click to download) |
| 357 |
What Can We Learn from Harry Potter? An Exploratory Study of Visual Representation Learning from Atypical Videos Qiyue Sun (University of Birmingham); Qiming Huang (University of Birmingham); Yang Yang (Shandong University); Hongjun Wang (Shandong University); Jianbo Jiao (University of Birmingham) PDF Poster |
| 358 |
Pose-Robust Calibration Strategy for Point-of-Gaze Estimation on Mobile Phones Yujie Zhao (Institute of Computing Technology, Chinese Academy of Sciences); Jiabei Zeng (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences) PDF Poster Video (Right click to download) |
| 359 |
SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation Sang-Eun Lee (Kyoto University); Ko Nishino (Kyoto University); Shohei Nobuhara (Kyoto Institute of Technology) PDF Poster |
| 364 |
Learning from Silence and Noise for Visual Sound Source Localization Xavier Juanola (Universitat Pompeu Fabra); Giovana Morais (New York University); Magdalena Fuentes (New York University); Gloria Haro (Universitat Pompeu Fabra) PDF Poster Video (Right click to download) |
| 366 |
HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting Maksym Ivashechkin (University of Surrey); Oscar Mendez (University of Surrey); Richard Bowden (University of Surrey) PDF Poster Video (Right click to download) |
| 367 |
RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction Zhuodong Jiang (University of Bristol); Haoran Wang (University of Bristol); Guoxi Huang (University of Bristol); Brett Seymour (National Park Service); Nantheera Anantrasirichai (University of Bristol) PDF Poster |
| 370 |
RGB-Event Fusion for Robust Lane Detection Jingtao Dong (Beijing Institute of Technology); Hao Zhuang (Beijing Institute of Technology); Hao Yang (Beijing Institute of Technology); Liyuan Pan (Beijing Institute of Technology) PDF Poster Video (Right click to download) |
| 373 |
Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval Emil Demić (University of Ljubljana); Luka Čehovin Zajc (University of Ljubljana) PDF Poster Video (Right click to download) |
| 376 |
Multi-Rationale Explainable Object Recognition via Contrastive Conditional Inference Ali Rasekh (L3S Research Center); Sepehr Kazemi Ranjbar (Independent Researcher); Simon Gottschalk (L3S Research Center) PDF Poster Video (Right click to download) |
| 381 |
Permutation-Invariant Polar Harmonic Pooling for Point-based Neural Networks Jaspreet Singh Maan (University of Lincoln); Grzegorz Cielniak (University of Lincoln) PDF Poster Video (Right click to download) |
| 391 |
Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation Ritsuki Kurohiji (Wakayama University); Hirotaka Hachiya (Wakayama University) PDF Poster Video (Right click to download) |
| 392 |
UMM: A Unified Multi-Modal Model for Low-Level Vision Tasks with Dual-Driven Prompting Ziqi Luo ( South University of Science and Technology); Jinxiang Lai (The Hong Kong University of Science and Technology); Ruitao Chen (Southern University of Science and Technology); Jinyu Yang (tapall.ai); Bin-Bin Gao (Tencent); Qiang Nie (The Hong Kong University of Science and Technology); Jun Liu (Tencent YouTu Lab); Jinfan Wang (Southern University of Science and Technology); Feng Zheng (Southern University of Science and Technology) PDF Poster Video (Right click to download) |
| 398 |
Knowledge Distillation via Cross Supervising with Attention for Remote Sensing Object Detection KefanZhan (Xiangtan University); An Luo (Xiangtan University); Yunpeng Zeng (Xiangtan University); Jiaxin Li (Xiangtan University); Yuan Zhang (Xiangtan University); Kai Hu (Xiangtan University) PDF Poster Video (Right click to download) |
| 399 |
3D Shape Reconstruction from Autonomous Driving Radars Samah Hussein (École Polytechnique Fédérale de Lausanne (EPFL)); Junfeng Guan (École Polytechnique Fédérale de Lausanne (EPFL), Bosch Research); Swathi Shree Narashiman (École Polytechnique Fédérale de Lausanne (EPFL)); Saurabh Gupta (University of Illinois Urbana-Champaign); Haitham Al Hassanieh (École Polytechnique Fédérale de Lausanne (EPFL)) PDF Poster Video (Right click to download) |
| 402 |
Log NeRF: Comparing Spaces for Learning Radiance Fields Sihe Chen (Northeastern University); Luv Verma (Northeastern University); Bruce A. Maxwell (Northeastern University) PDF Poster Video (Right click to download) |
| 404 |
Exploring Histogram-based Color Constancy David R. Treadwell IV (Northeastern University); Yunxuan Rao (Northeastern University); Daniel Y. Bi (Northeastern University); Bruce A. Maxwell (Northeastern University) PDF Poster Video (Right click to download) |
| 405 |
HVLO-YOLO: An Ultra-Lightweight Detection Model for High-voltage Line Obstacles Weichao Pan (Shandong Jianzhu University); Xu Wang (Shandong Jianzhu University); Chengze Lv (Shandong Jianzhu University); Zicheng Lin (Shandong Jianzhu University); Gongrui Wang (Shandong Jianzhu University); Xuening Zhang (Harbin Institute of Technology); Yi Sun (University of Ulster); Xingbo Liu (Shandong Jianzhu University) PDF Poster Video (Right click to download) |
| 408 |
Multimodal Hate Detection Using Dual-Stream Graph Neural Networks Jiangbei Yue (University of Exeter); Shuonan Yang (University of Exeter); Tailin Chen (University of Exeter); Jianbo Jiao (University of Birmingham); Zeyu Fu (University of Exeter) PDF Poster Video (Right click to download) |
| 416 |
Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps Yoojin Oh (Ewha Womans University); Junhyug Noh (Ewha Womans University) PDF Poster Video (Right click to download) |
| 423 |
JDATT: A Joint Distillation Framework for Atmospheric Turbulence Mitigation and Target Detection Zhiming Liu (University of Bristo); Paul Hill (University of Bristol); Nantheera Anantrasirichai (University of Bristol) PDF Poster |
| 427 |
Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data Lintao XU (Université Gustave Eiffel); Chaohui Wang (Université Gustave Eiffel) PDF Poster Video (Right click to download) |
| 444 |
Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering Haopeng Li (The University of Melbourne); Mohammed Bennamoun (University of Western Australia); Jun Liu (Lancaster University); Hossein Rahmani (Lancaster University); Qiuhong Ke (Monash University) PDF Poster Video (Right click to download) |
| 445 |
Frequency-Temporal Feature Integration for Compressed Video Action Recognition Zhou Jiang wan (Beijing University of Posts and Telecommunications); Yue Ming (Beijing University of Posts and Telecommunications) PDF Poster Video (Right click to download) |
| 453 |
HalfMix Augmentation and Regularized Dual-Path Learning for Cross-Domain Gaze Estimation Jiuk Hong (Kyungpook National University); Heechul Jung (Kyungpook National University) PDF Poster Video (Right click to download) |
| 457 |
A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li (Jiangxi Normal University); Hanxi Li (Jiangxi Normal University); Lin Yuanbo Wu (The University of Warwick); Changhong Liu (Jiangxi Normal University) PDF Poster Video (Right click to download) |
| 458 |
Intra-Modal Divergence-Weighted Distillation for Vision-Language Models Addad Youva (Université de Caen Basse Normandie); Alexis Lechervy (Université de Caen Basse Normandie); Frédéric Jurie (Université de Caen Normandie) PDF Poster Video (Right click to download) |
| 461 |
FSF3A: Federated Spatial Feature Alignment and Adaptive Aggregation for Heterogeneous Brain Tumor Segmentation Peketi Divya (Indian Institute of Technology Hyderabad); C Krishna Mohan (Indian Institute of Technology Hyderabad); Sobhan Babu (Indian Institute of Technology Hyderabad); Sumanth Yenduri (Sam Houston State University) PDF Poster Video (Right click to download) |
| 478 |
Bézier Curve-Based Stroke Extraction for Handwritten Characters Yuxuan TENG (Tokyo University of Science); Takuya Matsuzaki (Tokyo University of Science) PDF Poster Video (Right click to download) |
| 479 |
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan (Linköping University, Qualcomm Inc); Maciej Wozniak (KTH Royal Institute of Technology); Marvin Klingner (Qualcomm Inc); Camille Maurice (Qualcomm Inc); B Ravi Kiran (Qualcomm Inc); Senthil Yogamani (Qualcomm Inc) PDF Poster Video (Right click to download) |
| 486 |
FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction Junliang Ye (Australian National University); Lei Wang (Griffith University); Md Zakir Hossain (Curtin University) PDF Poster |
| 493 |
Audio-Visual Separation with Hierarchical Fusion and Representation Alignment Han Hu (University of Birmingham); Dongheng Lin (University of Birmingham); Qiming Huang (University of Birmingham); Yuqi Hou (University of Birmingham); Hyung Jin Chang (University of Birmingham); Jianbo Jiao (University of Birmingham) PDF Poster |
| 495 |
An Explorative Study on Abstract Images and Visual Representations Learned from Them Haotian LI (University of Birmingham); Jianbo Jiao (University of Birmingham) PDF Poster Video (Right click to download) |
| 499 |
One target to align them all: LiDAR, RGB and event cameras extrinsic calibration for Autonomous Driving Andrea Bertogalli (Politecnico di Milano); Giacomo Boracchi (Politecnico di Milano); Luca Magri (Politecnico di Milano) PDF Poster Video (Right click to download) |
| 500 |
UFD-KD: Unified Frequency Decoupled Knowledge Distillation Lu Sihan (Beijing Institute of Technology); Yang Zheng (Institute of automation, Chinese Academy of Sciences); Jie Liu (Beijing University of Posts and Telecommunications); Zhenghao Xi (Shanghai University of Engineering Science) PDF Poster Video (Right click to download) |
| 502 |
DefectGPT: Towards Multi-Class Defect Detection with Limited Electrical Samples Zhuoyi Lin (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Kaixin Xu (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Aye Phyu Phyu Aung (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)); Wen Qiu (Advanced Micro Devices (Singapore) Pte Ltd); Bernice Zee (Advanced Micro Devices (Singapore) Pte Ltd); Jiann Min Chin (Advanced Micro Devices (Singapore) Pte Ltd); Senthilnath Jayavelu (Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR)) PDF Poster Video (Right click to download) |
| 507 |
Robust and Label-efficient Deep Waste Detection Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Khan Muhammad (Sung Kyun Kwan University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 509 |
Dynamic Convolution and Graph-Coupled Attention for Cross-Subject EEG-Vision Decoding Tianyu Zhang (Durham University); FAN WAN (Tongfang Knowledge Network Digital Technology Co., Ltd. China National Nuclear Corporation); Kaili Sun (Durham University); Xingyu Miao (Durham University); Yueming Sun (University of Durham); Minye Shao (Durham University); Yang Long (Durham University) PDF Poster Video (Right click to download) |
| 511 |
LieMorph: Transformer-based Image Registration Using Flows on Lie Groups Johannes Bostelmann (Universität zu Lübeck); Jan Lellmann (Universität zu Lübeck) PDF Poster Video (Right click to download) |
| 516 |
Making Rotation Averaging Fast and Robust with Anisotropic Coordinate Descent Yaroslava Lochman (Chalmers University of Technology); Carl Olsson (Lund University); Christopher Zach (Chalmers University of Technology) PDF Poster Video (Right click to download) |
| 517 |
Catching the Unknown with Limited Data: Bi-Directional Prompt Tuning in CLIP for Few-Shot Open-Set Adaptation Moloud Abdar (The University of Queensland); Md Mehedi Hasan (Deakin University); Biplab Banerjee (Indian Institute of Technology Bombay); Abbas Khosravi (Deakin University); Pietro Lio (University of Cambridge) PDF Poster Video (Right click to download) |
| 528 |
OmniSegNet: Towards Scalable, Efficient & Universal Medical Image Segmentation Soma Dasgupta (Tata Consultancy Services); Swarnava Dey (Tata Consultancy Services); Arijit Mukherjee (Tata Consultancy Services); Arpan Pal (Tata Consultancy Services) PDF Poster Video (Right click to download) |
| 540 |
OptSplat: Recurrent Optimization for Generalizable Reconstruction and Novel View Renderings Vemburaj Chockalingam Yadav (German Research Center for Artificial Intelligence); Alain Pagani (German Research Center for Artificial Intelligence); Didier Stricker (German Research Center for Artificial Intelligence, RPTU) PDF Poster Video (Right click to download) |
| 543 |
Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Saqib Javed (CVLab, EPFL); Ahmad Jarrar Khan (CVLab, EPFL); Corentin Dumery (CVLab, EPFL); Chen Zhao (CVLab, EPFL); Mathieu Salzmann (CVLab, EPFL, Swiss Data Science Center) PDF Poster Video (Right click to download) |
| 546 |
Multi-Method Ensemble for Out-of-Distribution Detection Lucas Rakotoarivony (Thales, cortAIx Labs) PDF Poster Video (Right click to download) |
| 548 |
Pandora: Articulated 3D Scene Graphs from Egocentric Vision Alan Yu (Massachusetts Institute of Technology); Yun Chang (Massachusetts Institute of Technology); Christopher Xie ( Meta Reality Labs); Luca Carlone (Massachusetts Institute of Technology) PDF Poster Video (Right click to download) |
| 558 |
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance Zicheng Duan (The University of Adelaide); Yuxuan Ding (Qualcomm AI Research); Chenhui Gou (Monash University); Ziqin Zhou (The University of Adelaide); Ethan Smith (Leonardo.AI); Lingqiao Liu (The University of Adelaide) PDF Poster |
| 566 |
OpenHuman4D: Open-Vocabulary 4D Human Parsing Keito Suzuki (University of California, San Diego); Bang Du (University of California, San Diego); Runfa Li (University of California, San Diego); Kunyao Chen (Qualcomm Inc); Lei Wang (Qualcomm Inc); Peng Liu (Qualcomm Inc); Ning Bi (Qualcomm Inc); Truong Nguyen (University of California, San Diego) PDF Poster Video (Right click to download) |
| 567 |
Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection Zihan Wang (McGill University); Samira Ebrahimi Kahou (University of Calgary); Narges Armanfard (McGill University) PDF Poster Video (Right click to download) |
| 577 |
Zero-Shot CFC: Fast Real-World Image Denoising based on Cross-Frequency Consistency Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud) PDF Poster Video (Right click to download) |
| 588 |
CoT-SD: Chain-of-Thought Semantic Denoising Yanlin Jiang (Beijing University of Technology); Yuchen Liu (Beijing University of Technology); Mingren Liu (Alibaba Cloud) PDF Poster Video (Right click to download) |
| 593 |
PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads Shashikant Verma (Indian Institute of Technology Gandhinagar); Shanmuganathan Raman (Indian Institute of Technology Gandhinagar) PDF Poster Video (Right click to download) |
| 594 |
Video Dataset Condensation with Diffusion Models Zhe Li (FAU Erlangen-Nürnberg); Hadrien Reynaud (Imperial College London); Mischa Dombrowski (FAU Erlangen-Nürnberg); Sarah Cechnicka (Imperial College London); Franciskus Xaverius Erick (FAU Erlangen-Nürnberg); Bernhard Kainz (FAU Erlangen-Nürnberg, Imperial College London) PDF Poster Video (Right click to download) |
| 596 |
TopoMortar: A Dataset to Evaluate Topology Accuracy in Image Segmentation Juan Miguel Valverde (Technical University of Denmark); Motoya Koga (Sojo University); Nijihiko Otsuka (Sojo University); Anders Dahl (Technical University of Denmark) PDF Poster Video (Right click to download) |
| 602 |
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism Jun Zheng (SUN YAT-SEN UNIVERSITY); Jing Wang (SUN YAT-SEN UNIVERSITY); Fuwei Zhao (ByteDance Inc.); xujie zhang (SUN YAT-SEN UNIVERSITY); Xiaodan Liang (SUN YAT-SEN UNIVERSITY) PDF Poster Video (Right click to download) |
| 603 |
Dual-Stream Adapters for Open-Set Segmentation in Driving Scenes Shyam Nandan Rai (Politecnico di Torino); Massimiliano Mancini (University of Trento); Barbara Caputo (Politecnico di Torino); Carlo Masone (Politecnico di Torino) PDF Poster Video (Right click to download) |
| 610 |
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization Joohyun Chang (Kyung Hee University); Soyeon Hong (Kyung Hee University); Hyogun Lee (Kyung Hee University); Seong Jong Ha (AI R&D Division, CJ Group); Dongho Lee (AI R&D Division, CJ Group); Seong Tae Kim (Kyung Hee University); Jinwoo Choi (Kyung Hee University) PDF Poster Video (Right click to download) |
| 614 |
PerSense: Training-Free Personalized Instance Segmentation in Dense Images Muhammad Ibraheem Siddiqui (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Umer Sheikh (Mohamed bin Zayed University of Artificial Intelligence); Hassan Abid (Mohamed bin Zayed University of Artificial Intelligence); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 626 |
Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation Sobhan Asasi (University of Surrey); Mohamed Ilyes Lakhal (University of Surrey); Ozge Mercanoglu Sincan (University of Surrey); Richard Bowden (University of Surrey) PDF Poster |
| 627 |
TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation ZechaoGuan (Southeast University); Feng Yan (Meituan Company); Shuai Du (Southeast University); Lin Ma (Meituan Company); Qingshan Liu (Southeast University) PDF Poster Video (Right click to download) |
| 628 |
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Yingshuang Zou (Tsinghua University); Yikang Ding (Megvii Technology); Chuanrui Zhang (Tsinghua University); Jiazhe Guo (Tsinghua University); Bohan Li (Shanghai Jiaotong University); Xiaoyang Lyu (University of Hong Kong); Feiyang Tan (Mach Drive); Xiaojuan Qi (University of Hong Kong); Haoqian Wang (Tsinghua University) PDF Poster Video (Right click to download) |
| 630 |
Incremental Multi-Scene Modeling via Continual Neural Graphics Primitives Prajwal Singh (IIT Gandhinagar); Ashish Tiwari (IIT Gandhinagar); Gautam Vashishtha (IIT Gandhinagar); Shanmuganathan Raman (IIT Gandhinagar) PDF Poster Video (Right click to download) |
| 632 |
Segmentation Assisted Incremental Test Time Adaptation in an Open World Manogna Sreenivas (Indian Institute of Science); Soma Biswas (Indian Institute of Science) PDF Poster Video (Right click to download) |
| 644 |
Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems Qihao Yuan (University of Groningen); Kailai Li (University of Groningen); Jiaming Zhang (Karlsruhe Institute of Technology) PDF Poster Video (Right click to download) |
| 646 |
Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework Mosong Ma (Imperial College London); Tania Stathaki (Imperial College London); Michalis Lazarou (University of Surrey) PDF Poster Video (Right click to download) |
| 649 |
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes Xinhao Xiang (University of California, Davis); Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories (MERL)); Suhas Lohit (Mitsubishi Electric Research Laboratories (MERL)); Michael J. Jones (Mitsubishi Electric Research Laboratories (MERL)); Jiawei Zhang (University of California, Davis) PDF Poster Video (Right click to download) |
| 650 |
Prompt-Informed Reinforcement Learning for Visual Coverage Path Planning Venkat Margapuri (Villanova University) PDF Poster Video (Right click to download) |
| 656 |
3D Curvix: From Multiview 2D Edges to 3D Curve Segments Qiwu Zhang (Brown University); Chiang-Heng Chien (Brown University); Ricardo Fabbri (Universidade do Estado do Rio de Janeiro); Benjamin Kimia (Brown University) PDF Poster Video (Right click to download) |
| 661 |
Leveraging Sparsity for Efficient Inference of High-Resolution Vision Foundation Models Xin Xu (University of Illinois Urbana-Champaign); Jason Kuen (Adobe); Brian L Price (Adobe); Kangning Liu (Adobe); Zijun Wei (Adobe); Yu-Xiong Wang (University of Illinois Urbana-Champaign) PDF Poster Video (Right click to download) |
| 664 |
Canonical Makeup Transfer Xinyu Lin (CUHK-Shenzhen); Kun Zhou (Shenzhen University); Xiaoguang Han (CUHK-Shenzhen); Jiangbo Lu (SmartMore Corporation) PDF Poster Video (Right click to download) |
| 666 |
Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization Alberto Compagnoni (University of Modena and Reggio Emilia); Davide Caffagni (University of Modena and Reggio Emilia); Nicholas Moratelli (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia ); Marcella Cornia (University of Modena and Reggio Emilia); Rita Cucchiara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download) |
| 668 |
MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression Ofir Gordon (Sony Semiconductor Israel); Ariel Lapid (Sony Semiconductor Israel); Elad Cohen (Sony Semiconductor Israel); Yarden Yagil (Sony Semiconductor Israel); Arnon Netzer (Sony Semiconductor Israel); Hai Victor Habi (Sony Semiconductor Israel) PDF Poster Video (Right click to download) |
| 675 |
Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network Abdullah Tariq (Edith Cowan University); Martin Masek (Edith Cowan University); R Muhammad Atif Azad (Birmingham City University); Syed Zulqarnain Gilani (Edith Cowan University) PDF Poster Video (Right click to download) |
| 680 |
On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics Li-Wei Chen (KU Leuven); Ombretta Strafforello (KU Leuven); Anne-Sofie Maerten (KU Leuven); Tinne Tuytelaars (KU Leuven); Johan Wagemans (KU Leuven) PDF Poster |
| 690 |
MonoGSDF: Exploring Monocular Geometric Cues for Gaussian Splatting-Guided Implicit Surface Reconstruction Kunyi Li (Technical University of Munich); Michael Niemeyer (Google); Zeyu Chen (Tsinghua University); Nassir Navab (Technical University of Munich); Federico Tombari (Google) PDF Poster Video (Right click to download) |
| 694 |
Occam’s LGS: An Efficient Approach for Language Gaussian Splatting Jiahuan Cheng (Johns Hopkins University); Jan-Nico Zaech (INSAIT, Sofia University); Luc Van Gool (INSAIT, Sofia University); Danda Pani Paudel (INSAIT, Sofia University) PDF Poster |
| 698 |
SAMWave: Adapting Segment Anything Model to difficult tasks Saurabh Yadav (IIIT Delhi); Avi Gupta (IIIT Delhi); Koteswar Rao Jerripothula (IIT Kanpur) PDF Poster Video (Right click to download) |
| 700 |
Q-Align: Alleviating Attention Leakage in Zero-Shot Appearance Transfer via Query-Query Alignment Namu Kim (KT Corporation); Wonbin Kweon (University of Illinois Urbana-Champaign); Minsoo Kim (Pohang University of Science and Technology); Hwanjo Yu (Pohang University of Science and Technology) PDF Poster Video (Right click to download) |
| 701 |
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders Faizan Farooq Khan (King Abdullah University of Science and Technology); Vladan Stojnić (Czech Technical University in Prague); Zakaria Laskar (Czech Technical University in Prague); Mohamed Elhoseiny (King Abdullah University of Science and Technology); Giorgos Tolias (Czech Technical University of Prague) PDF Poster Video (Right click to download) |
| 704 |
JOG3R: Towards 3D-Consistent Video Generators Chun-Hao Paul Huang (Adobe Systems); Niloy J. Mitra (Adobe Systems); Hyeonho Jeong (Adobe Systems); Jae Shin Yoon (Adobe Systems); Duygu Ceylan (Adobe Systems) PDF Poster Video (Right click to download) |
| 705 |
ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation Chenzhi Liu (The University of Queensland); Mahsa Baktashmotlagh (The University of Queensland); Yanran Tang (The University of Queensland); Zi Huang (University of Queensland); Ruihong Qiu (The University of Queensland) PDF Poster Video (Right click to download) |
| 707 |
Catch Your Concepts: A Flexible Concept Locator for Interpretable Visual Recognition Qiyang Wan (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Chengzhi Gao (Institute of Computing Technology, Chinese Academy of Sciences (CAS)); Xilin CHEN (Institute of Computing Technology, Chinese Academy of Sciences (CAS)) PDF Poster Video (Right click to download) |
| 709 |
TAPM-Net: Trajectory-Aware Perturbation Modeling for Infrared Small Target Detection Hongyang Xie (The University of Warwick); Hongyang He (The University of Warwick); Victor Sanchez (The University of Warwick) PDF Poster |
| 716 |
Size-aware Contrastive Imitation Learning for Language-conditioned Multi-task Robotic Manipulation Jiakai Huang (South China Normal University); Weiping Zheng (South China Normal University) PDF Poster Video (Right click to download) |
| 717 |
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects Zizhao Li (University of Melbourne); Zhengkang Xiang (University of Melbourne); Joseph West (University of Melbourne); Kourosh Khoshelham (University of Melbourne) PDF Poster Video (Right click to download) |
| 718 |
PADS: Plug-and-Play 3D Human Pose Analysis via Diffusion Generative Modeling Haorui Ji (Australian National University); Hongdong Li (Australian National University) PDF Poster |
| 721 |
Gromov Wasserstein Optimal Transport for Semantic Correspondences Francis Snelgar (Australian National University); Stephen Gould (Australian National University); Ming Xu (Australian National University); Liang Zheng (Australian National University); Akshay Asthana (Seeing Machines) PDF Poster Video (Right click to download) |
| 722 |
Unsupervised Multimodal Deepfake Detection Through Explicit Intra-Modal and Cross-Modal Inconsistency Discovery Mulin Tian (University of Southern California); Mahyar Khayatkhoei (University of Southern California); Joe Mathai (University of Southern California); Wael AbdAlmageed (Clemson University) PDF Poster Video (Right click to download) |
| 732 |
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Riza Velioglu (Universität Bielefeld); Petra Bevandić (Universität Bielefeld); Robin Chan (Technische Universität Berlin); Barbara Hammer (Universität Bielefeld) PDF Poster Video (Right click to download) |
| 736 |
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models Yue Hu (AgiBot, Harbin Institute of Technology); Siyuan Huang (Shanghai Jiao Tong University); Yue Liao (National University of Singapore); Shengcong Chen (Agibot); Pengfei Zhou (Agibot); Liliang Chen (AgiBot); Guanghui Ren (AgiBot); Maoqing Yao (Agibot) PDF Poster Video (Right click to download) |
| 737 |
Self-Intersection-Aware 3D Human Motion Generation Using an Efficient Human Sphere Proxy Pascal Herrmann (Bosch Research); Maarten Bieshaar (Bosch Research); Dennis Mack (Bosch Research); Paul Robert Herzog (Bosch Research); Juergen Gall (University of Bonn) PDF Poster Video (Right click to download) |
| 745 |
Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping Fatih Erdoğan (Koç University); Merve Rabia Barin (Koç University); Fatma Güney (Koç University) PDF Poster Video (Right click to download) |
| 751 |
TPA: Temporal Prompt Alignment for Fetal Congenital Heart Defect Classification Darya Taratynova (Mohamed bin Zayed University of Artificial Intelligence); Alya Almsouti (Mohamed bin Zayed University of Artificial Intelligence); Beknur Kalmakhanbet (Mohamed bin Zayed University of Artificial Intelligence); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 753 |
Boosting Camera Motion Control for Video Diffusion Transformers Soon Yau Cheong (University of Surrey); Duygu Ceylan (Adobe Research); Armin Mustafa (University of Surrey); Andrew Gilbert (University of Surrey); Chun-Hao Paul Huang (Adobe Research) PDF Poster Video (Right click to download) |
| 755 |
Audio-Guided Visual Editing with Complex Multi-Modal Prompts Hyeonyu Kim (MAUM AI Inc.); Seokhoon Jeong (Ulsan National Institute of Science and Technology); Seonghee Han (Ulsan National Institute of Science and Technology); Chanhyuk Choi (Ulsan National Institute of Science and Technology); Taehwan Kim (Ulsan National Institute of Science and Technology) PDF Poster Video (Right click to download) |
| 762 |
Learning a Neural Association Network for Self-supervised Multi-Object Tracking Shuai Li (University of Bonn); Michael Burke (Monash University); Subramanian Ramamoorthy (University of Edinburgh); Juergen Gall (University of Bonn) PDF Poster Video (Right click to download) |
| 766 |
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva (Télécom Paris); Fabio Pizzati (MBZUAI); Philip Torr (University of Oxford); Stéphane Lathuilière (INRIA) PDF Poster Video (Right click to download) |
| 768 |
Interpretable Text-Guided Image Clustering via Iterative Search Bingchen Zhao (University of Edinburgh); Oisin Mac Aodha (University of Edinburgh) PDF Poster |
| 770 |
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction Kiana Hooshanfar (University of Tehran); Alireza Hosseini (University of Tehran); Mona Ahmadian (University of Surrey); Ahmad Kalhor (University of Tehran); Babak N Araabi (University of Tehran) PDF Poster Video (Right click to download) |
| 782 |
Is Safety Checker Still Safe? A Study on the Covert NSFW Text Xin Li (Shanghai Fudan Microelectronics Group CO.,LTD.); Kai Chen (Shanghai Fudan Microelectronics Group CO.,LTD.); XUE YANG (Shanghai Fudan Microelectronics Group CO.,LTD.); Weijun Shan (Shanghai Fudan Microelectronics Group CO.,LTD.); Jun Yu (Shanghai Fudan Microelectronics Group CO.,LTD.); Qing Li (Shanghai Fudan Microelectronics Group CO.,LTD.) PDF Poster |
| 784 |
UDT : Unsupervised Discovery of Transformations between Fine-Grained Classes in Diffusion Models Youngjae Choi (Soongsil University); Hyunsuh Koh (Soongsil University); Hojae Jeong (Soongsil University); ByungKwan Chae (Soongsil University); Sungyong Park (Soongsil University); Heewon Kim (Soongsil University) PDF Poster Video (Right click to download) |
| 786 |
TAG: A Simple Yet Effective Temporal-Aware Approach for Zero-Shot Video Temporal Grounding Jin-Seop Lee (Sungkyunkwan University); SungJoon Lee (Sungkyunkwan University); Jaehan Ahn (Sungkyunkwan University); YunSeok Choi (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University) PDF Poster Video (Right click to download) |
| 787 |
${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting Yuxi Hu (Graz University of Technology); Jun Zhang (Graz University of Technology); Kuangyi Chen (Graz University of Technology); Zhe Zhang (Peking University); Friedrich Fraundorfer (Graz University of Technology) PDF Poster Video (Right click to download) |
| 788 |
Vision Backbone Efficient Selection for Image Classification in Low-Data Regimes Joris Guerin (IRD); Shray Bansal (Georgia Institute of Technology); Amirreza Shaban (Field AI); Paulo Mann (Federal University of Rio de Janeiro); Harshvardhan Gazula (Massachusetts Institute of Technology) PDF Poster Video (Right click to download) |
| 811 |
DEAD: Data-Efficient Audiovisual Dubbing using Neural Rendering Priors Jack Saunders (University of Bath); Vinay Namboodiri (University of Bath) PDF Poster Video (Right click to download) |
| 824 |
A Unified Framework for High-Frame-Rate HDR Video Synthesis Hue Nguyen (York University); Trevor Dalton Canham (York University); Michael S. Brown (York University) PDF Poster Video (Right click to download) |
| 825 |
Modular Embedding Recomposition for Incremental Learning Aniello Panariello (University of Modena and Reggio Emilia); Emanuele Frascaroli (University of Modena and Reggio Emilia); Pietro Buzzega (University of Modena and Reggio Emilia); Lorenzo Bonicelli (University of Modena and Reggio Emilia); Angelo Porrello (University of Modena and Reggio Emilia); Simone Calderara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download) |
| 827 |
MonoTracker: Monocular RGB-Only 6D Tracking of Unknown Object Zilong Deng (ETH Zürich); Shaochang Tan (Universität Zürich); Zuria Bauer (ETH Zürich); Daniel Barath (ETH Zürich); Marc Pollefeys (Microsoft) PDF Poster Video (Right click to download) |
| 828 |
Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval Adriano Fragomeni (University of Bristol); Dima Damen (University of Bristol); Michael Wray (University of Bristol) PDF Poster Video (Right click to download) |
| 831 |
LOGen: Toward LiDAR Object Generation by Point Diffusion Ellington Kirby (Valeo.ai); Mickael Chen (Valeo.ai); Renaud Marlet (Valeo.ai); Nermin Samet (Valeo.ai) PDF Poster Video (Right click to download) |
| 832 |
SIMULDITEX: Single Image Multiscale & Lightweight Diffusion for Texture Modelling Pierrick Chatillon (Université de Caen Normandie); Julien Rabin (Université de Caen Normandie); David Tschumperlé (Université de Caen Normandie) PDF Poster Video (Right click to download) |
| 835 |
Cloud-Stereo: A Dataset and Benchmark for Reconstructing Atmospheric Clouds from Stereo Images Jacob Lin (University of Oxford); Edward Gryspeerdt (Imperial College London); Ronald Clark (University of Oxford) PDF Poster Video (Right click to download) |
| 850 |
C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression Baptiste Bauvin (Thales); Loïc Baret (Thales); Ola Ahmad (Thales) PDF Poster Video (Right click to download) |
| 851 |
Asymmetric Event-Image Stereo with Temporal Feature Gating and Iterative Structure-Detail Refinement Hao Zhuang (Beijing Institute of Technology); Yan Yang (Australian National University); Liyuan Pan (Beijing Institute of Technology) PDF Poster Video (Right click to download) |
| 852 |
LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance Jae Myung Kim (University of Tuebingen); Stephan Alaniz (Télécom Paris); Cordelia Schmid (Inria, Ecole normale supérieure); Zeynep Akata (Technische Universität München) PDF Poster Video (Right click to download) |
| 857 |
Lost in Time: A New Temporal Benchmark for VideoLLMs Daniel Cores (University of Santiago de Compostela); Michael Dorkenwald (University of Amsterdam); Manuel Mucientes (Universidad de Santiago de Compostela); Cees G. M. Snoek (University of Amsterdam); Yuki M Asano (University of Technology Nuremberg) PDF Poster Video (Right click to download) |
| 859 |
Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift Björn Michele (Université de Bretagne Sud); Alexandre Boulch (valeo.ai); Gilles Puy (valeo.ai); Tuan-Hung VU (valeo.ai); Renaud Marlet (valeo.ai); Nicolas Courty (IRISA) PDF Poster Video (Right click to download) |
| 864 |
Geometry-Aware Diffusion Models for Multiview Scene Inpainting Ahmad Salimi (York University); Tristan Ty Aumentado-Armstrong (York University); Marcus A Brubaker (Google DeepMind); Konstantinos G. Derpanis (York University) PDF Poster Video (Right click to download) |
| 865 |
FootFormer: Estimating Stability from Visual Input Keaton Yukio Kraiger (Pennsylvania State University); Jingjing Li (Pennsylvania State University); Skanda Bharadwaj (Pennsylvania State University); Jesse Scott (Scientific Applications & Research Associates (SARA), Inc.); Robert T. Collins (Pennsylvania State University); Yanxi Liu (Pennsylvania State University) PDF Poster |
| 866 |
Image Recognition with Vision and Language Embeddings of VLMs Illia Volkov (Czech Technical University of Prague); Nikita Kisel (Czech Technical Univeresity in Prague); Klara Janouskova (Czech Technical Univeresity in Prague); Jiri Matas (Czech Technical University in Prague) PDF Poster |
| 873 |
Exploring Image Representation with Decoupled Classical Visual Descriptors Chenyuan Qu (University of Birmingham); Hao Chen (University of Cambridge); Jianbo Jiao (University of Birmingham) PDF Poster Video (Right click to download) |
| 875 |
Lost in Translation? Vocabulary Alignment for Source-Free Adaptation in Open-Vocabulary Semantic Segmentation Silvio Mazzucco (The Good AI Lab); Carl Persson (The Good AI Lab); Mattia Segu (The Good AI Lab); Pier Luigi Dovesi (The Good AI Lab); Federico Tombari (Google); Luc Van Gool (INSAIT, Sofia University); Matteo Poggi (University of Bologna) PDF Poster Video (Right click to download) |
| 880 |
Distortion-Aware Multi-Object Tracking via Virtual Plane Projection in Overhead Fisheye Cameras Panithi Vanasirikul (OxygenAI); Piyanon Charoenpoonpanich (OxygenAI); Ekapol Chuangsuwanich (Chulalongkorn University) PDF Poster Video (Right click to download) |
| 887 |
QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising Qijun Yang (The University of Manchester); Yating Huang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester) PDF Poster Video (Right click to download) |
| 896 |
Llama Learns to Direct: DirectorLLM for Human-Centric Video Generation Kunpeng Song (The State University of New Jersey); Tingbo Hou (GenAI at Meta); Zecheng He (GenAI at Meta); Haoyu Ma (GenAI at Meta); Jialiang Wang (GenAI at Meta); Animesh Sinha (GenAI at Meta); Sam Tsai (GenAI at Meta); Yaqiao Luo (GenAI at Meta); Xiaoliang Dai (GenAI at Meta); Li Chen (GenAI at Meta); Xide Xia (GenAI at Meta); Peizhao Zhang (GenAI at Meta); Peter Vajda (GenAI at Meta); Ahmed M. Elgammal (The State University of New Jersey); Felix Juefei-Xu (GenAI at Meta) PDF Poster Video (Right click to download) |
| 898 |
FaceCPT: Toward Cross-Modal Facial Representation Learning with Face-Caption Pre-Training Md Mahedi Hasan (West Virginia University); Shoaib Meraj Sami (West Virginia University); Nasser Nasrabadi (West Virginia University); Jeremy M. Dawson (West Virginia University) PDF Poster Video (Right click to download) |
| 900 |
Time-Scaling State-Space Models for Dense Video Captioning AJ Piergiovanni (Google Deepmind); Ganesh Satish Mallya (Google Deepmind); Dahun Kim (Google Deepmind); Anelia Angelova (Google Deepmind) PDF Poster |
| 902 |
ITC-RWKV: Interactive Tissue–Cell Modeling with Recurrent Key-Value Aggregation for Histopathological Subtyping Yating Huang (The University of Manchester); Qijun Yang (The University of Manchester); Lintao Xiang (The University of Manchester); Hujun Yin (The University of Manchester) PDF Poster Video (Right click to download) |
| 903 |
Toward Robust Audio-Visual Synchronization Detection in Egocentric Video with Sparse Synchronization Events Jordan Voas (The University of Texas at Austin); Wei-Cheng Tseng (The University of Texas at Austin); Benoit Vallade (Amazon Prime Video); Alex Mackin (Amazon Prime Video); David Higham (Amazon Prime Video); David Harwath (The University of Texas at Austin) PDF Poster Video (Right click to download) |
| 914 |
Improving Human Motion Plausibility with Body Momentum Ha Linh Nguyen (National University of Singapore); Tze Ho Elden Tse (National University of Singapore); Angela Yao (National University of Singapore) PDF Poster Video (Right click to download) |
| 915 |
DualDistill: A Unified Cross-Modal Knowledge Distillation Framework for Camera-Based BEV Representation Gaeun Kim (Seoul National University of Science and Technology); Daeil Han (Seoul National University of Science and Technology); Yeong Jun Koh (Chungnam National University); Hanul Kim (Seoul National University of Science and Technology) PDF Poster Video (Right click to download) |
| 922 |
FSLC: Fast Scoring with Learnable Coreset for Zero-shot Industrial Anomaly Detection Songtao Ni (Shanghai Jiaotong University); Yuxin Li (Shanghai Jiaotong University); Xu Zhao (Shanghai Jiao Tong University) PDF Poster Video (Right click to download) |
| 931 |
Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation Yuan Yao (University of Rochester); Yicong Hong (Adobe Research); Difan Liu (Adobe Research); Long Mai (Adobe Research); Feng Liu (Adobe Research); Jiebo Luo (University of Rochester) PDF Poster Video (Right click to download) |
| 938 |
Grad-CL: Source Free Domain Adaptation with Gradient Guided Feature Disalignment Rini Smita Thakur (Indian Institute of Science Education and Research Bhopal); Rajeev Ranjan Dwivedi (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download) |
| 940 |
Semi-MoE: Mixture-of-Experts meets Semi-Supervised Histopathology Segmentation Nguyen Lan Vi Vu (Ho Chi Minh City University of Technology); Thanh-Huy Nguyen (Carnegie Mellon University); Thien Nguyen (Posts and Telecommunications Institute of Technology); Daisuke Kihara (Purdue University); Tianyang Wang (University of Alabama at Birmingham); Xingjian Li (Carnegie Mellon University); Min Xu (Carnegie Mellon University) PDF Poster Video (Right click to download) |
| 943 |
HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation Yulong Shi (Northeastern University); Jiapeng Li (Northeastern University); Lin Qi (Northeastern University) PDF Poster |
| 948 |
Tracking Meets Large Multimodal Models for Driving Scenario Understanding Ayesha Ishaq (Mohamed bin Zayed University of Artificial Intelligence); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 949 |
Prompt Image to Watch and Hear: Multimodal Prompting for Parameter-Efficient Audio-Visual Learning Kai Wang (University of Toronto); Shentong Mo (Carnegie Mellon University); Yapeng Tian (University of Texas at Dallas); Dimitrios Hatzinakos (University of Toronto) PDF Poster Video (Right click to download) |
| 955 |
Clean Sample Selection and Noisy Sample Rematching for Text-Based Pedestrian Retrieval Daiqiang Li (Sichuan University); Weicheng Zhang (Sichuan University); yuanyuan wu (Chengdu University of Technology); Honggang Chen (Sichuan University) PDF Poster Video (Right click to download) |
| 960 |
A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading Junlai Qiu (Hainan University); Yunzhu Chen (Guangxi Polytechnic of Construction); Hao Zheng (Tencent Jarvis Lab); Yawen Huang (Tencent Jarvis Lab); Yuexiang Li (Guangxi Medical University) PDF Poster |
| 962 |
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba Yunxiang Fu (University of Hong Kong); Chaoqi Chen (Shenzhen University); Yizhou Yu (The University of Hong Kong) PDF Poster Video (Right click to download) |
| 976 |
Advancing Utility Pole and Sign Detection Through Deep Learning Carl Dickinson (University of Strathclyde); Gaetano Di Caterina (University of Strathclyde) PDF Poster Video (Right click to download) |
| 981 |
Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams Takuya Nakabayashi (Keio University); Navami Kairanda (Max Planck Institute for Informatics); Hideo Saito (Keio University); Vladislav Golyanik (Max Planck Institute for Informatics) PDF Poster Video (Right click to download) |
| 986 |
Visible Structure Retrieval for Lightweight Image-Based Relocalisation Fereidoon Zangeneh (KTH Royal Institute of Technology); Leonard Bruns (KTH Royal Institute of Technology); Amit Dekel (Univrses AB); Alessandro Pieropan (Univrses AB); Patric Jensfelt (KTH Royal Institute of Technology) PDF Poster Video (Right click to download) |
| 987 |
Spatiotemporal Event Spotting via 3D Heatmaps with Dynamically Shifted Gaussian Kernels Ankhzaya Jamsrandorj (Korea Institute of Science and Technology); VANYI CHAO (University of Science and Technology); Hoang Quoc Nguyen (Korea Institute of Science and Technology); Yin May Oo (University of Science and Technology); Muhammad Amrulloh Robbani (University of Science and Technology); Yewon Hwang (Korea Institute of Science and Technology); Kyung-Ryoul Mun (Korea Institute of Science and Technology); Jinwook Kim (Korea Institute of Science and Technology) PDF Poster Video (Right click to download) |
| 988 |
Ink Enhancement for Ancient Bamboo Manuscripts Using Iterative Restoration-Degradation Adversarial Learning Chongsheng ZHANG (Henan University, Ludwig-Maximilians-Universität München); Junchao Ma (Henan Univeristy); Wenhao Zhang (Henan Univeristy); Kamel Aouaidjia (Henan Univeristy); Qilong Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München) PDF Poster Video (Right click to download) |
| 989 |
MedOpenSeg: Open-World Medical Segmentation with Memory-Augmented Transformers Luisa Vargas (Eurecom); Eleonora Poeta (Politecnico di Torino); Tania Cerquitelli (Politecnico di Torino); Elena Baralis (Politecnico di Torino); Maria A Zuluaga (Eurecom) PDF Poster Video (Right click to download) |
| 992 |
Dual Polarity Prompts with Stochastic Entropy Perturbation for Label Noise Changhui Hu (Universitat de Barcelona); Bhalaji Nagarajan (Barcelona Supercomputing Center); Ricardo Marques (Universitat Pompeu Fabra); Petia Radeva Ivanova (Universitat de Barcelona) PDF Poster Video (Right click to download) |
| 999 |
Lid-Lab-NeRF: Generating Temporally Consistent, Labelled LiDAR Point Clouds using Neural Radiance Fields Shrestha Srivastava (Indian Institute of Science Education and Research Bhopal); Vaibhav Kumar (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download) |
| 1004 |
RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation Nuren Zhaksylyk (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Jay Nitin Paranjape (Johns Hopkins University); S. Swaroop Vedula (Johns Hopkins University); Shameema Sikder (Johns Hopkins University); Vishal M. Patel (Johns Hopkins University); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 1006 |
Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models Lorenzo Baraldi (University of Pisa); Davide Bucciarelli (University of Pisa); Zifan Zeng (Technische Universität München); Chongzhe Zhang (Technische Universität Berlin); Qunli Zhang (Huawei Technologies Ltd.); Marcella Cornia (University of Modena and Reggio Emilia); Lorenzo Baraldi (University of Modena and Reggio Emilia); Feng Liu (Huawei Technologies Ltd.); Zheng Hu (Huawei Technologies Ltd.); Rita Cucchiara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download) |
| 1007 |
Is Structural Awareness the Key to Event Camera Data Cleansing for Enhancing Veracity? Haiyu Li (The University of Sheffield); Charith Abhayaratne (The University of Sheffield) PDF Poster Video (Right click to download) |
| 1013 |
SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank Transformation Abdelrahman Elsayed (Mohamed bin Zayed University of Artificial Intelligence); Sarim Hashmi (Mohamed bin Zayed University of Artificial Intelligence); Mohammed Elseiagy (Mohamed bin Zayed University of Artificial Intelligence); Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 1015 |
In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models Hu Wang (Mohamed bin Zayed University of Artificial Intelligence); Ibrahim Almakky (Mohamed bin Zayed University of Artificial Intelligence); Congbo Ma (New York University); Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence); Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 1024 |
Quantifying Risk in Pedestrian Crowds Using Divergence Estimated from Flows of Head-Tracking Data Haruto Nakayama (University of Tsukuba); Masaki Onishi (AIST) PDF Poster Video (Right click to download) |
| 1029 |
PrIINeR: Towards Prior-Informed Implicit Neural Representations for Accelerated MRI Ziad Al-Haj Hemidi (Universität zu Lübeck); Eytan Kats (Universität zu Lübeck); Mattias P. Heinrich (Universität zu Lübeck) PDF Poster Video (Right click to download) |
| 1030 |
Language-Guided Decision Override for Adaptive and Retraining-Free Video Anomaly Detection Ryo Moriyama (Aoyama Gakuin University); Shin Suzuki (Aoyama Gakuin University); Naoshi Kaneko (Tokyo Denki University); Kazuhiko Sumi (Aoyama Gakuin University) PDF Poster Video (Right click to download) |
| 1035 |
Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation Julien Walther (University of Bordeaux); Rémi Giraud (University of Bordeaux); Michaël Clément (University of Bordeaux) PDF Poster Video (Right click to download) |
| 1036 |
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving Mert Keser (Continental AG, Technische Universität München); Halil Ibrahim Orhan (Technische Universität München); Niki Amini-Naieni (University of Oxford); Gesina Schwalbe (Universität zu Lübeck); Alois C. Knoll (Technical University Munich); Matthias Rottmann (Universität Osnabrück) PDF Poster Video (Right click to download) |
| 1037 |
Generative Data Augmentation for Object Point Cloud Segmentation Dekai Zhu (Technische Universität München); Stefan Gavranovic (Siemens AG); Flavien Boussuge (Siemens AG); Benjamin Busam (Technische Universität München); Slobodan Ilic (Technische Universität München) PDF Poster Video (Right click to download) |
| 1051 |
Conditional Prototype Learning for Few-Shot Object Detection Zhenwei He (Chongqing University of Technology); Xinye Liao (Chongqing University of Technology); Xin Feng (Chongqing University of Technology) PDF Poster Video (Right click to download) |
| 1058 |
Atomizer: Generalizing to unseen modalities by breaking images down to a set of scalars Hugo Riffaud de Turckheim (INRIA); Diego Marcos (INRIA); Roberto Interdonato (CIRAD); Sylvain Lobry (Université de Montpellier) PDF Poster Video (Right click to download) |
| 1060 |
Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation Lee Chae-Yeon (Pohang University of Science and Technology); Nam Hyeon-Woo (Pohang University of Science and Technology); Tae-Hyun Oh (Korea Advanced Institute of Science and Technology) PDF Poster Video (Right click to download) |
| 1061 |
IPGPhormer: Interpretable Pathology Graph-Transformer for Survival Analysis Guo Tang (Harbin Institute of Technology Shenzhen); Songhan Jiang (Harbin Institute of Technology Shenzhen); Jinpeng Lu (University of Science and Technology of China); Linghan Cai (Harbin Institute of Technology Shenzhen); Yongbing Zhang (Harbin Institute of Technology Shenzhen) PDF Poster Video (Right click to download) |
| 1062 |
Calibration-Aware Prompt Learning for Medical Vision-Language Models Abhishek Basu (Mohamed bin Zayed University of Artificial Intelligence); Fahad Shamshad (Mohamed bin Zayed University of Artificial Intelligence); Ashshak Sharifdeen (Mohamed bin Zayed University of Artificial Intelligence); Karthik Nandakumar (Michigan State University); Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 1064 |
3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes Tejaswini Medi (Universität Mannheim); Arianna Rampini (Autodesk); Pradyumna Reddy (Autodesk); Pradeep Kumar Jayaraman (Autodesk); Margret Keuper (Universität Mannheim) PDF Poster Video (Right click to download) |
| 1071 |
Stabilizing Open-Set Test-Time Adaptation via Primary-Auxiliary Filtering and Knowledge-Integrated Prediction Byung-Joon Lee (Sungkyunkwan University); Jin-Seop Lee (Sungkyunkwan University); Jee-Hyong Lee (Sungkyunkwan University) PDF Poster Video (Right click to download) |
| 1072 |
Controllable Garment Generation with Multi-Modal Diffusion Guidance Sanhita Pathak (IIT Delhi); Vinay Kaushik (IIIT Sonepat); Brejesh Lall (IIT Delhi) PDF Poster Video (Right click to download) |
| 1076 |
CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation Anshul Kaushal (Panjab University); Kunal Jangid (Indian Institute of Science Education and Research Bhopal); Vinod K. Kurmi (Indian Institute of Science Education and Research Bhopal) PDF Poster Video (Right click to download) |
| 1077 |
Towards Sharper Object Boundaries in Self-Supervised Depth Estimation Aurélien Cecille (LIRIS); Stefan Duffner (LIRIS); Franck Davoine (LIRIS); Rémi Agier (Visual Behavior); Thibault Neveu (Visual Behavior) PDF Poster Video (Right click to download) |
| 1081 |
ImProvShow: Multimodal Fusion for Image Provenance Summarization Alexander Black (University of Surrey); Jing Shi (Adobe Research); Yifei Fan (Adobe Research); John Collomosse (Adobe Research) PDF Poster Video (Right click to download) |
| 1091 |
CLIMB-3D: Class-Incremental Imbalanced 3D Instance Segmentation Vishal Thengane (University of Surrey); Jean Lahoud (Mohamed bin Zayed University of Artificial Intelligence); Hisham Cholakkal (Mohamed bin Zayed University of Artificial Intelligence); Rao Muhammad Anwer (Mohamed bin Zayed University of Artificial Intelligence); Lu Yin (University of Surrey); Xiatian Zhu (University of Surrey); Salman Khan (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download) |
| 1095 |
Supervised Segmentation Model for Improved Detection of OSSN using Slit Lamp Images Kajal Singh (Indian Institute of Technology Mandi); Shagun Bhatt (Indian Institute of Technology Mandi); Ramkailash Gujar (Dr Shroff’s Charity Eye Hospital ); Arnav Bhavsar (Indian Institute of Technology Mandi); Dinesh Singh (Indian Institute of Technology Mandi) PDF Poster Video (Right click to download) |
| 1096 |
LED: Light Enhanced Depth Estimation at Night Simon de Moreau (Mines Paris - PSL University); Yasser Almehio (Valeo AI); Andrei Bursuc (Valeo AI); Hafid EL IDRISSI (Valeo AI); Bogdan Stanciulescu (Mines Paris - PSL University); Fabien Moutarde (Mines Paris - PSL University) PDF Poster Video (Right click to download) |
| 1109 |
AKD-BNN: Adaptive Kernel Dynamic Bayesian Neural Networks for Enhanced Medical Image Segmentation with Uncertainty Estimation Qianying He (City University of Macau); Xuan Liu (City University of Macau); Wenjian Liu (City University of Macau) PDF Poster Video (Right click to download) |
| 1110 |
Robust Human Registration with Body Part Segmentation on Noisy Point Clouds Kai Lascheit (ETH Zurich); Francis Engelmann (Stanford University); Daniel Barath (ETH Zurich); Marc Pollefeys (Microsoft); Leonidas Guibas (Stanford University) PDF Poster Video (Right click to download) |
| 1118 |
Conformal Predictors for Efficient Video Text Spotting Amor Ben Tanfous (Terminal Industries); Sankha Subhra Mukherjee (Terminal Industries); Neil M. Robertson (Terminal Industries) PDF Poster |
| 1121 |
MIAS-SAM: Medical Image Anomaly Segmentation without thresholding Marco Colussi (University of Milan); Dragan Ahmetovic (University of Milan); Sergio Mascetti (University of Milan) PDF Poster Video (Right click to download) |
| 1123 |
Evaluating Self-Supervised Learning in Medical Imaging: A Systematic Investigation of Robustness, Generalizability, and Multi-Domain Impact Valay Bundele (University of Tuebingen); Karahan Sarıtaş (University of Tuebingen); Bora Kargi (University of Tuebingen); Oğuz Ata Çal (University of Tuebingen); Kıvanç Tezören (University of Tuebingen); Zohreh Ghaderi (University of Tuebingen); Hendrik Lensch (University of Tuebingen) PDF Poster Video (Right click to download) |
| 1127 |
ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds Shanle Yao (University of North Carolina at Charlotte); Ghazal Alinezhad Noghre (University of North Carolina at Charlotte); Armin Danesh Pazho (University of North Carolina at Charlotte); Hamed Tabkhivayghan (University of North Carolina at Charlotte) PDF Poster Video (Right click to download) |
| 1132 |
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning Wenyi Lian (Uppsala University); Patrick Micke (Uppsala University); Joakim Lindblad (Uppsala University); Nataša Sladoje (Uppsala University) PDF Poster Video (Right click to download) |
| 1134 |
TRUDI and TITUS: A Multi-Perspective Dataset and A Three-Stage Recognition System for Transportation Unit Identification Emre Gülsoylu (Universität Hamburg); André Peter Kelm (Universität Hamburg); Lennart Bengtson (Universität Hamburg); Matthias Hirsch (Universität Hamburg); Christian Wilms (Universität Hamburg); Tim Rolff (Universität Hamburg); Janick Edinger (University of Hamburg); Simone Frintrop (Universität Hamburg) PDF Poster Video (Right click to download) |
| 1135 |
The Trauma THOMPSON Dataset for Real-World Emergency AI Yupeng Zhuo (Purdue University); Eddie Zhang (Purdue University); Xiangchen Yu (Purdue University); Aditya Pachpande (Purdue University); Andrew Wallace Kirkpatrick (University of Calgary); Kyle Couperus (The Geneva Foundation); Jessica Mckee (University of Calgary); Juan Wachs (Purdue University) PDF Poster Video (Right click to download) |
| 1137 |
ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates Hamidreza Dastmalchi (York University); Aijun An (York University); Ali Cheraghian (CSIRO) PDF Poster Video (Right click to download) |
| 1156 |
Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion Yu Zhu (Tohoku University); Naoya Chiba (Osaka University); Koichi Hashimoto (Tohoku University) PDF Poster Video (Right click to download) |
| 1161 |
Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data Alexander Hepburn (University of Bristol); Raúl Santos-Rodriguez (University of Bristol); Javier Portilla (Consejo Superior de Investigaciones Científicas (CSIC)) PDF Poster Video (Right click to download) |
| 1171 |
Conflict-Aware Adversarial Training Zhiyu Xue (UC Santa Barbara); Haohan Wang (University of Illinois at Urbana-Champaign); Yao Qin (UC Santa Barbara); Ramtin Pedarsani (UC Santa Barbara) PDF Poster Video (Right click to download) |
| 1177 |
Binarizing Severely Degraded Ancient Bamboo Slips: Dataset and Baseline Chongsheng Zhang (Henan University, Ludwig-Maximilians-Universität München); Wanwan Fu (Henan University); Qilong Li (Henan Univeristy); SAMRA ZAFAR (Henan Univeristy); Zhanshuo Zhang (Henan Univeristy); Qiyan Li (Henan Univeristy); Gaojuan Fan (Henan Univeristy); Christian Heumann (Ludwig-Maximilians-Universität München) PDF Poster Video (Right click to download) |
| 1180 |
MO-SHW: Hierarchy-Aware Multi-Objective Optimization for Open-World Segmentation Erico M. Pereira (Universidade Federal de Minas Gerais); Frederico Gadelha Guimarães (Universidade Federal de Minas Gerais); Jefersson A Dos Santos (University of Sheffield) PDF Poster Video (Right click to download) |
| 1183 |
Contrastive Point Feature Matching for Open-world Object Counting Ngo Xuan Cuong (University of Arkansas) PDF Poster Video (Right click to download) |
| 1187 |
Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers Kevin Raina (University of Ottawa); Tanya Schmah (University of Ottawa) PDF Poster Video (Right click to download) |
| 1196 |
IoSR: End-to-End Intraoral Scans Repairing Manel Farhat (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Achraf Ben-hamadou (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ahmed Rekik (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Ons Abida (Digital Research Center of Sfax Laboratory of Signals, Systems, Artificial Intelligence and Networks Technopark of Sfax); Oussama Smaoui (Biotech Dental Group) PDF Poster Video (Right click to download) |
| 1199 |
AULUNet: An Adaptive Ultra-Lightweight U-Net Framework for Efficient Skin Lesion Segmentation in Resource-Constrained Environments Md Maklachur Rahman (Texas A&M University); Soon Ki Jung (Kyungpook National University); Tracy Hammond (Texas A&M University) PDF Poster Video (Right click to download) |
| 1209 |
Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning Bahareh Nikpour (McGill University); Narges Armanfard (McGill University) PDF Poster Video (Right click to download) |
| 1220 |
Hair Strand Reconstruction based on 3D Gaussian Splatting Yimin Pan (Technical University of Munich); Matthias Nießner (Technical University of Munich); Tobias Kirschstein (Technical University of Munich) PDF Poster Video (Right click to download) |
| 1224 |
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans Bheeshm Sharma (IIT Bombay); Karthikeyan Jaganathan (IIT Bombay); Balamurugan Palaniappan (IIT Bombay) PDF Poster Video (Right click to download) |
| 1257 |
End-to-End LiDAR optimization for point cloud registration Siddhant Katyan (Université Laval); Marc-Andre Gardner (Bentley Systems); Jean-Francois Lalonde (Université Laval) PDF Poster Video (Right click to download) |
If there are any mistakes on this page, please do not hesitate to contact bmvc@bmvc2025.org