BMVC conference papers, supplementary material and video presentations can be found at: BMVC Papers
BMVC workshop papers can be found at: BMVC Workshop Papers
All the rooms listed are in Cutlers' Hall.
Keynote 3 - Angela Dai
09:00 - 10:00
| 09:00 - 10:00 | Title: Can Transformers Speak Geometry?
Abstract: What if generating a 3D mesh were as natural as predicting the next word in a sentence? Autoregressive modeling has rapidly become a unifying learning paradigm across data modalities, across language to images, and now offers a compelling approach for 3D geometry. This talk explores how transformer-based autoregressive models enable mesh generation by representing meshes as sequences. Framing mesh generation as a next-token prediction problem enables new ways to handle the compact, irregular structure of human-designed 3D assets, directly compatible with downstream graphics and vision applications. We explore sequence formulation and data representation, and address practical challenges in scaling to high-resolution meshes and interactive synthesis. This will enable more accessible and democratized 3D content creation, paving the way for interactive design, rapid prototyping, and simulation-ready assets, and unlocking new possibilities for both creative and computational exploration of 3D geometry.
Location: Main Hall
|
|---|
Doctoral Consortium
10:00 - 13:00
| Chairs: Dr Cass Zhixue Zhao and Dr Yang Long | Welcome & Overview (10:00–10:05, 5 mins) |
|---|---|
| Oral Presentations – Session 1 (10:05–10:45, 40 mins) 3 candidate presentations (10 mins each + 3 mins Q&A) |
|
|
1. Uncertainty Propagation and Robustness in Rotation Averaging Yaroslava Lochman (Chalmers University of Technology) 2. Complex-valued Neural Networks in Computer Vision Saurabh Yadav (IIIT Delhi) 3. Robust and Generalizable 3D Perception for Autonomous Systems Maciej Wozniak (KTH Royal Institute of Technology) |
|
| Tea & Coffee Break (10:45–11:15, 30 mins) | |
| Oral Presentations – Session 2 (11:20–12:20, 50 mins) 4 candidate presentations (10 mins each + 3 mins Q&A) |
|
|
1. Constructing and Maintaining Geometric Digital Twins of Road Conditions Percy Pui Hei Lam (University of Cambridge) 2. Visual Perception and Reconstruction under Challenging Lighting Conditions Ziteng Cui (The University of Tokyo) 3. Generalizable Multimodal Brain Decoding Weihao Xia (University College London) 4. Towards Visually-Plausible and Controllable 3D Representations Dongqing Wang (EPFL) |
|
| Mentor Discussion & Wrap-Up (12:20–12:55, 35 mins) | |
| Closing Remarks (12:55–13:00, 5 mins) | |
Location: Goodwin Room |
Poster Session 5 - Advanced Architectures, Robustness and Efficiency
10:00 - 11:45
| 10:00 - 11:45 |
Drawing Room Posters
Location: Drawing Room
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10:00 - 11:45 |
Hadfield Hall Posters
Location: Hadfield Hall
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Oral Session 6 - Advanced Architectures, Robustness and Efficiency
11:45 - 13:00
| Chair: Márjory Da Costa-Abreu (Sheffield Hallam University) | 11:45 | 1132 |
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
Wenyi Lian, Patrick Micke, Joakim Lindblad, Nataša Sladoje
|
|---|---|---|---|
| 12:00 | 511 |
LieMorph: Transformer-based Image Registration Using Flows on Lie Groups
Johannes Bostelmann, Jan Lellmann
|
|
| 12:15 | 239 |
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation
Maëlic Neau, Paulo Eduardo Santos, Anne-Gwenn Bosser, Akihiro Sugimoto, Cedric Buche
|
|
| 12:30 | 567 |
Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection
Zihan Wang, Samira Ebrahimi Kahou, Narges Armanfard
|
|
| 12:45 | 721 |
Gromov Wasserstein Optimal Transport for Semantic Correspondences
Francis Snelgar, Stephen Gould, Ming Xu, Liang Zheng, Akshay Asthana
|
|
|
Location: Main Hall
|
|||
Poster Session 6 - Human & Motion Analysis (+ Doctoral Consortium Posters)
14:00 - 15:45
| 14:00 - 15:45 |
Drawing Room Posters
Location: Drawing Room
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14:00 - 15:45 |
Hadfield Hall Posters
Location: Hadfield Hall
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
Oral Session 7 - Human & Motion Analysis
15:45 - 17:00
| Chair: Jianbo Jiao (University of Birmingham) | 15:45 | 66 |
ST-GDance: Long-Term and Collision-Free Group Choreography from Music
Jing Xu, Weiqiang Wang, Cunjian Chen, Jun Liu, Qiuhong Ke
|
|---|---|---|---|
| 16:00 | 566 |
OpenHuman4D: Open-Vocabulary 4D Human Parsing
Keito Suzuki, Bang Du, Runfa Li, Kunyao Chen, Lei Wang, Peng Liu, Ning Bi, Truong Nguyen
|
|
| 16:15 | 338 |
GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
Tianyue Wang, Shuang Yang, Shiguang Shan, Xilin Chen
|
|
| 16:30 | 675 |
Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network
Abdullah Tariq, Martin Masek, R Muhammad Atif Azad, Syed Zulqarnain Gilani
|
|
| 16:45 | 610 |
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
Hyogun Lee, Joohyun Chang, Soyeon Hong, Seong Jong Ha, Dongho Lee, Seong Tae Kim, Jinwoo Choi
|
|
|
Location: Main Hall
|
|||