BMVC conference papers, supplementary material and video presentations can be found at: BMVC Papers
BMVC workshop papers can be found at: BMVC Workshop Papers
All the rooms listed are in Cutlers' Hall.
Keynote 3 - Angela Dai
09:00 - 10:00
| 09:00 - 10:00 | Title: Can Transformers Speak Geometry?
Abstract: What if generating a 3D mesh were as natural as predicting the next word in a sentence? Autoregressive modeling has rapidly become a unifying learning paradigm across data modalities, across language to images, and now offers a compelling approach for 3D geometry. This talk explores how transformer-based autoregressive models enable mesh generation by representing meshes as sequences. Framing mesh generation as a next-token prediction problem enables new ways to handle the compact, irregular structure of human-designed 3D assets, directly compatible with downstream graphics and vision applications. We explore sequence formulation and data representation, and address practical challenges in scaling to high-resolution meshes and interactive synthesis. This will enable more accessible and democratized 3D content creation, paving the way for interactive design, rapid prototyping, and simulation-ready assets, and unlocking new possibilities for both creative and computational exploration of 3D geometry.
Location: Main Hall
|
|---|
Doctoral Consortium
10:00 - 13:00
| Chairs: Dr Cass Zhixue Zhao and Dr Yang Long |
Uncertainty Propagation and Robustness in Rotation Averaging
Yaroslava Lochman (Chalmers University of Technology)
|
|---|---|
|
Complex-valued Neural Networks in Computer Vision
Saurabh Yadav (IIIT Delhi)
|
|
|
Robust and Generalizable 3D Perception for Autonomous Systems
Maciej Wozniak (KTH Royal Institute of Technology)
|
|
|
Constructing and Maintaining Geometric Digital Twins of Road Conditions
Percy Pui Hei Lam (University of Cambridge)
|
|
|
Visual Perception and Reconstruction under Challenging Lighting Conditions
Ziteng Cui (The University of Tokyo)
|
|
|
Generalizable Multimodal Brain Decoding
Weihao Xia (University College London)
|
|
|
Towards Visually-Plausible and Controllable 3D Representations
Dongqing Wang (EPFL)
|
|
|
Location: Goodwin Room
|
|
Poster Session 5 - Advanced Architectures, Robustness and Efficiency
10:00 - 11:45
| 10:00 - 11:45 |
Drawing Room Posters
Location: Drawing Room
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10:00 - 11:45 |
Hadfield Hall Posters
Location: Hadfield Hall
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Oral Session 6 - Advanced Architectures, Robustness and Efficiency
11:45 - 13:00
| Chair: Márjory Da Costa-Abreu (Sheffield Hallam University) | 11:45 | 1132 |
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
Wenyi Lian, Patrick Micke, Joakim Lindblad, Nataša Sladoje
|
|---|---|---|---|
| 12:00 | 511 |
LieMorph: Transformer-based Image Registration Using Flows on Lie Groups
Johannes Bostelmann, Jan Lellmann
|
|
| 12:15 | 239 |
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation
Maëlic Neau, Paulo Eduardo Santos, Anne-Gwenn Bosser, Akihiro Sugimoto, Cedric Buche
|
|
| 12:30 | 567 |
Zero-Shot Anomaly Detection with Dual-Branch Prompt Selection
Zihan Wang, Samira Ebrahimi Kahou, Narges Armanfard
|
|
| 12:45 | 721 |
Gromov Wasserstein Optimal Transport for Semantic Correspondences
Francis Snelgar, Stephen Gould, Ming Xu, Liang Zheng, Akshay Asthana
|
|
|
Location: Main Hall
|
|||
Poster Session 6 - Human & Motion Analysis (+ Doctoral Consortium Posters)
14:00 - 15:45
| 14:00 - 15:45 |
Drawing Room Posters
Location: Drawing Room
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14:00 - 15:45 |
Hadfield Hall Posters
Location: Hadfield Hall
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
Oral Session 7 - Human & Motion Analysis
15:45 - 17:00
| Chair: Jianbo Jiao (University of Birmingham) | 15:45 | 66 |
ST-GDance: Long-Term and Collision-Free Group Choreography from Music
Jing Xu, Weiqiang Wang, Cunjian Chen, Jun Liu, Qiuhong Ke
|
|---|---|---|---|
| 16:00 | 566 |
OpenHuman4D: Open-Vocabulary 4D Human Parsing
Keito Suzuki, Bang Du, Runfa Li, Kunyao Chen, Lei Wang, Peng Liu, Ning Bi, Truong Nguyen
|
|
| 16:15 | 338 |
GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
Tianyue Wang, Shuang Yang, Shiguang Shan, Xilin Chen
|
|
| 16:30 | 675 |
Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network
Abdullah Tariq, Martin Masek, R Muhammad Atif Azad, Syed Zulqarnain Gilani
|
|
| 16:45 | 610 |
HERO-VQL: Hierarchical, Egocentric and Robust Visual Query Localization
Hyogun Lee, Joohyun Chang, Soyeon Hong, Seong Jong Ha, Dongho Lee, Seong Tae Kim, Jinwoo Choi
|
|
|
Location: Main Hall
|
|||