Skip to main content
Thesis defences

PhD Oral Exam - Jiaxun Guo, Information and Systems Engineering

Structured Probabilistic Latent Modeling for Representation Learning and 3D Perception


Date & time
Thursday, April 2, 2026
10 a.m. – 1 p.m.
Cost

This event is free

Organization

School of Graduate Studies

Contact

Dolly Grewal

Where

Engineering, Computer Science and Visual Arts Integrated Complex
1515 Ste-Catherine St. W.
Room 3.309

Accessible location

Yes - See details

When studying for a doctoral degree (PhD), candidates submit a thesis that provides a critical review of the current state of knowledge of the thesis subject as well as the student’s own contributions to the subject. The distinguishing criterion of doctoral graduate research is a significant and original contribution to knowledge.

Once accepted, the candidate presents the thesis orally. This oral exam is open to the public.

Abstract

Traditional representation learning and 3D perception methods commonly rely on Gaussian priors in latent spaces, which are inadequate for modeling complex structural asymmetries, directional uncertainty, and rotational symmetries. This thesis introduces structured probabilistic latent models with non-Gaussian distributions to address these challenges, enabling uncertainty-aware representation learning and robust 3D geometric perception. The research first investigates foundational probabilistic modeling for latent representations. We propose GamMM-VAE, a deep generative clustering framework that employs an asymmetric Gamma mixture prior together with a novel reparameterization to learn high-quality latent embeddings for non-negative vectors. To further model directional and axially symmetric data, we develop VIBinMM, a scalable variational inference framework for Bingham mixture models. By exploiting the mathematical relationship between Bingham and Gaussian distributions, the proposed approach bypasses intractable normalization constants and enables efficient inference in high-dimensional directional spaces. Building upon these foundations, the thesis extends structured latent priors to rotation-invariant 3D perception. We introduce the Shadow-informed Pose Feature (SiPF) and the Ga4DPF framework, which leverage Bingham distributions over unit quaternions to dynamically model global pose references. These mechanisms preserve strict rotation invariance while resolving local geometric ambiguities and mitigating feature collapse in symmetric structures. In addition, we present BRVSNet, a self-supervised framework for probabilistic rotation estimation and consistent multi-view generation. Extensive experiments on multiple benchmarks demonstrate that incorporating structured, non-Gaussian priors, particularly Gamma and Bingham distributions, leads to more expressive latent representations and robust, uncertainty-aware perception. The proposed models consistently outperform state-of-the-art methods across tasks ranging from unsupervised learning to fine-grained 3D spatial discrimination.

Back to top

© Concordia University