PhD Defense Anaïs Boumendil

The defense will take place on july 6th, 2026 (2pm) at the Claude Chappe amphitheater (Lamarr Building).

Title
Towards energy-efficient machine learning models for resource-constrained platforms

Abstract
Neural networks have demonstrated high-level performance across various fields and have largely driven recent progress in artificial intelligence. However the substantial energy consumption of these models constitutes a major obstacle to deploying and, particularly, training neural networks on resource-constrained devices while also raising concerns about their environmental impact. Enhancing the energy efficiency of neural networks is therefore an urgent need regardless of the availability of hardware resources.

This thesis investigates the energy efficiency of neural networks training. To this end, we first propose a taxonomy of existing approaches. We then focus on the growing technique that extends the architecture size during training and on data selection which trains the model on a subset of examples rather than the entire dataset. We first analyze the trade-off that the two methods offer between energy consumption and prediction performance by comparing them to pruning, a standard approach for optimizing computational costs. Overall, results demonstrate that data selection offers proportional energy savings to the number of discarded examples while training on smaller subsets incurs larger performance drops. Growing preserves accuracy while energy consumption is influenced by when the expansion occurs. This timing of growing indeed determines how long each architecture size is used during training, which directly impacts the costs.

Following the comparison results, we further investigate the impact of the growing timing by comparing the one-shot schedule which extends the neural network to a target size in a single step to the iterative schedule which grows the model in multiple steps. We find that one-shot growing offers a competitive accuracy, despite being rarely considered in prior works. It can also enhance energy gains, compared to the iterative schedule, particularly when costly initialization strategies are employed for grown neurons. We then explore when one-shot growing should occur during training to improve the accuracy-efficiency trade-off and find that this choice is governed by a combination of factors including dataset complexity as well as the target performance.

After investigating the growing timing, we focus on adaptive data selection that allows updating the subset throughout training. We consider loss-based methods, a common selection criteria. Such methods require a forward phase on all examples to compute selection losses which can result in a significant overhead and hinder the benefits of training with less data. We explore using the early exit mechanism to build a smaller sub-model to compute selection losses more efficiently and improve the overall computational gains of loss-based data selection. Results show that such a strategy is more beneficial on larger models. We then explore a combination of loss-based and random selection criteria while decreasing the number of selected training examples as training progresses. We employ loss-based selection during first epochs to focus on challenging examples before switching to the efficient random criterion to maximize energy gains. This strategy combines the strengths of the two approaches and improves the accuracy with a careful choice of the switching epoch. 

Jury
– Anne-Laure Ligozat (Full professor, ENSIIE): Reviewer
– Denis Trystram (Full professor, Grenoble INP): Reviewer
– Virginie Fresse (Associate professor HDR, Université Jean Monnet Saint Etienne): Examiner
– Anne-Cécile Orgerie (Senior researcher, CNRS): Examiner
– Jalil Boukhobza (Full professor, ENSTA-Bretagne): Examiner
– Walid Bechkit (Full professor, Université Lumière Lyon 2) : Supervisor
– Frédéric Le Mouël (Full professor, INSA Lyon) : Co-supervisor
– Malcolm Egan (Researcher, Inria):  Co-supervisor

 


PHD defense Alex Huang Feng

The defense will take place on March 02, 2026

Title
Service Provider Network Anomaly Detection: Standards, Architecture, and Algorithms for actual use by Network Operators

Abstract
IP networks are critical infrastructure. Disruptions can impact essential services and cause serious economic and societal harm. Rapid detection is therefore essential for service providers.
This thesis investigates an architecture and approach for detecting service anomalies in real-world service provider networks. While statistical outlier detection has been widely studied for this purpose, such methods often misalign with how engineers monitor and diagnose networks, limiting their actual adoption in production.
This thesis proposes a network-centric anomaly detection approach based on deterministic, rule-based inspections. Instead of outlier detection, expert knowledge is encoded into rules that reflect how engineers interpret network telemetry data. The system processes telemetry data, maps it to customer identifiers, and applies rules to detect deviations from expected behaviors. Because these rules mirror operational data inspections from network engineers, alerts are directly actionable: they identify affected customers and specify the underlying symptoms. We apply this method to the two most revenue-generating connectivity services: BGP/MPLS Layer 3 VPNs and Internet access.

Our work addresses three questions:
(i) what operational requirements must an anomaly detection system meet in service provider networks,
(ii) which telemetry protocols best align with network operations, and
(iii) which detection strategies best reflect engineers’ mental models.

This is the first systematic study of anomaly detection in BGP/MPLS Layer 3 VPNs. For Internet access, we focus on issues within the service provider network, including those invisible to end users but economically relevant. Our system is deployed in Swisscom’s production environment, monitoring over 13,000 VPN customers. It processes 760,000 telemetry messages per second and has detected incidents such as software defects, misconfigured DNS blacklists, and mistakenly decommissioned fiber links. Since its deployment, it has identified over 20 disruptions, enabling operators to quickly acknowledge incidents and postpone maintenance windows. This thesis presents production cases showing how the system reduced detection time and supported operational workflows.
Our findings show that effective anomaly detection in service provider environments must go beyond data-driven approaches. Systems need to reflect how network engineers actually work.Rule-based data inspections, grounded in established operational practices, offer a practical, actionable alternative to statistical models.

Jury
– CONTRERAS MURILLO Luis Miguel, Chercheur, Telefonica, Rapporteur
– PELSSER Cristel,  Professeur des Universités, UCLouvain, Rapporteuse
– IANNONE Luigi,  Chercheur, Huawei, Examinateur
– ROOSE Philippe, Professeur des Universités, IUT de Bayonne, Examinateur
– FRANCOIS Pierre, Professeur, INSA de Lyon, Directeur de thèse
– FRENOT Stéphane, Professeur des Universités, INSA de Lyon, Co-directeur de thèse