Skip to main content
Oral defences & examinations, Thesis defences

Masters Thesis Defense: Justin Whatley


Date & time
Friday, August 6, 2021
10 a.m. – 12 p.m.
Cost

This event is free

Where

Online

Candidate:

Justin Whatley

   
             

Thesis Title:

Fault Analysis Using Learning Models with Model Interpretation

             

Date & Time: 

August 6th, 2021 @ 10:00 AM

   
             

Location:

Zoom

   
             

Examining Committee:

         
             
 

Dr. Yann-Gael Gueheneuc

(Chair)

   
             
 

Dr. Thomas Fevens

(Supervisor)

   
             
 

Dr. Tristan Glatard

(Examiner)

 
             
 

Dr. Yann-Gael Gueheneuc

(Examiner)

 
             

Abstract

As machine learning moves from theoretical applications in academia to promising solutions to problems across industry and healthcare, effective interpretability strategies are critically important to adoption. However, model interpretability strategies can be extended to offer more than validation for the predictions a model is making. Learned models offer a proxy for the data by capturing relationships between feature inputs and target outcomes, offering a representation that can be analysed. To that end, this work describes a fault analysis system that leverages learned models to characterize faults by using SHapley Additive exPlanations (SHAP).

 

In particular, this fault analysis system was designed for large structured datasets such as those available in telecommunications networks. The strategy works by forming a learned representation with tree-based models using gradient-boosting. Once a problematic sample is selected for analysis, the computationally efficient implementation

of the SHAP algorithm specialized for tree-based models is employed to gauge feature contributions to the performance degradation observed in the sample. Thus, this fault analysis strategy effectively provides an explanation for the degradation in a problematic sample informed through a model-based representation of the relevance of input characteristics across contexts. An evaluation of the strategy is performed, demonstrating its reliability for structured communications data using a 4G LTE dataset.

Back to top

© Concordia University