CITI seminar – Jérôme Nika (Ircam) – 4/7 at 10:00

Title: Technologies génératives pour la création musicale : composer à l’échelle du comportement ou de la narration.

Date and Place: 10h Monday 4/7/2022 in TD-C

Speaker: Jérôme Nika (Ircam)

Hosts: Romain Michon and Tanguy Risset (Emeraude project-team)

 

Abstract: 

Une machine sera-t-elle bientôt capable de remplacer l’humain dans la création musicale ? Pour toute une partie des artisans de l’intelligence artificielle appliquée à la musique, artistes comme scientifiques, il est difficile de répondre à cette question récurrente car ce n’est pas celle qui se pose. En effet, si on “apprend” la musique à des ordinateurs dotés d’une “mémoire” musicale inspirée de la cognition humaine, l’enjeu réside précisément dans le fait de partir de ces modèles pour explorer la production d’une musique nouvelle plutôt que la reproduction d’une musique crédible.

Les recherches associées au sein de l’équipe Représentations Musicales de l’Ircam s’articulent autour de la notion de « mémoire musicale » : son apprentissage, sa modélisation, et sa mobilisation dans un contexte créatif. Elles ont donné naissance à DYCI2lib, une librairie d’agents génératifs pour la performance et la composition musicale combinant les approches libres, planifiées et spécifiées, et réactives de la génération à partir d’un corpus. Ces travaux ont été mis en oeuvres dans le cadre de nombreuses collaborations artistiques et productions musicales, notamment dans le jazz et les musiques improvisées (Steve Lehman, Bernard Lubat, Benoît Delbecq, Rémi Fox, Orchestre National de Jazz), la musique contemporaine (Pascal Dusapin, Ensemble Modern, Alexandros Markeas, Marta Gentilucci), et l’art contemporain (Le Fresnoy – Studio National des Arts Contemporains, Vir Andres Hera) La présentation des pratiques musicales permises par ces instruments génératifs au service de la créativité humaine sera illustrée par des extraits de ces productions récentes.

 

Bio:

Jérôme Nika est chercheur en technologies génératives pour la création musicale, réalisateur en informatique musicale, et musicien. Diplômé des écoles ENSTA ParisTech et Télécom ParisTech ainsi que du master ATIAM (Acoustique, Traitement du signal, Informatique, Appliqués à la Musique – Sorbonne Université / Ircam / Télécom ParisTech), il a également étudié la composition musicale. Il s’est spécialisé dans l’application de l’informatique et du traitement du signal à la création numérique et à la musique à travers un doctorat (« Prix Jeune Chercheur Science/Musique 2015 », « Prix Jeune Chercheur 2016 », Association Française d’Informatique Musicale) puis en tant que chercheur à l’Ircam (Institut de Recherche et Coordination Acoustique/Musique).

En 2019, il entre au Fresnoy – Studio National des Arts Contemporain en tant que chercheur invité. Cette même année, il travaille sur 3 projets : le projet évolutif Lullaby Experience, du compositeur Pascal Dusapin et deux projets de musique improvisée : Silver Lake Studies en duo avec Steve Lehman et C’est pour ça en duo avec Rémi Fox (projet lauréat de l’aide DICRéAM du CNC pour 2020) .

En 2020, il devient chercheur permanent dans l’équipe Représentations Musicales de l’Ircam où il développe des instruments logiciels en interaction avec des musiciens experts. Plus de 60 concerts et performances artistiques ont mis ces outils en jeu depuis 2016 (Onassis Center, Athènes, Grèce; Ars Electronica Festival, Linz, Autriche; Frankfurter Positionen festival, Frankfurt; Annenberg Center, Philadelphia, USA; Centre Pompidou, Collège de France, LeCentquatre, Paris, France; Montreux Jazz festival, etc.). La dernière production en date, pour laquelle il crée l’électronique générative est le concert “Ex Machina”, une collaboration entre Steve Lehman, Frédéric Maurin, et l’Orchestre National de Jazz, créé le 11 février 2022 à la Maison de la Radio et diffusé sur France Musique.

 


PhD Defence: “User Association in Flexible and Agile Mobile Networks”, Romain PUJOL, amphi, Chappe Building, 6th of July 2022 at 2 PM

 

The defense will take place on Wenesday, July 6 at 2 pm, in the amphitheatre of the Telecommunications Department (Claude Chappe building), INSA Lyon, Villeurbanne.

The presentation will be available online: https://insa-lyon-fr.zoom.us/j/91599742042

 

Title

User Association in Flexible and Agile Mobile Networks

 

 

Abstract

The user association process in cellular networks consists in choosing the base station with which the user equipment will negotiate radio resources. The current association is based on measuring the signal strength received by the user equipment from each of the base stations. The association must now deal with the diversification of user application needs and the growing heterogeneity of cellular networks.

 

In this thesis, we show experimentally that the current association process has reached its limits, that it is agnostic of the configurations of the base stations and that it does not allow to control the throughput that the user equipment will obtain following the association. We set up for the realization of our measurements an experimental platform based on the software suite of cellular networks srsRAN and software radios USRP NI-2091.

 

We propose in this thesis a new metric to be used during the association process. This metric, broadcasted by the base station, is a load information which in our case will be represented by the number of user equipment connected to the base station. We also discuss other metrics that can be used as load information. We show, once again experimentally, by modifying the source code of the srsRAN software suite ourselves, that if the user equipment takes this load information into account in the association process, the association decision is improved in 22% of cases.

 

Jury

  • Tara ALI YAHIA, Associate Professor HDR, Université Paris Saclay (Reviewer)
  • Vania CONAN, HDR, Thales (Reviewer)
  • Véronique VEQUE, Professor, Université Paris Saclay (Examiner)
  • Frédéric LAUNAY, Associate Professor, IUT de Poitiers (Examiner)
  • Stéphane FRENOT, Professor, INSA-Lyon (Examiner)
  • André-Luc BEYLOT, Professor, ENSEEIHT (Examiner)
  • Fabrice VALOIS, Professor, INSA-Lyon (Thesis supervisor)
  • Razvan STANICA, Associate Professor HDR, INSA-Lyon (Thesis co-supervisor)

PhD Defence: “Activity Models and Bayesian Estimation Algorithms for Wireless Grant-Free Random Access”, Lélio CHETOT, amphi, Chappe Building, 7th of July 2022 at 2 PM

 

The defense will take place on Thursday, July 7 at 2 pm, in the amphitheatre of the Telecommunications Department (Claude Chappe building), INSA Lyon, Villeurbanne.

The presentation will be available online: https://youtu.be/hX3t9pKPcoc

 

Title

Activity Models and Bayesian Estimation Algorithms for Wireless Grant-Free Random Access

 

 

Abstract

The new 5G’s wireless networks have recently started to be deployed all around the world. With them, a large spectrum of services are about to emerge, resulting in new stringent requirements so that 5G targets performance exceed that of 4G by a factor of 10. The services are centered around the use cases of enhanced mobile broadband (eMBB), ultra reliable and low-latency communication (uRLLC) and massive machine-type communication (mMTC) where each of which has required the ongoing development of key new technologies. Many of these technologies will also play an important role in the emergence of 6G.

In this thesis, the focus is on grant-free RA (GFRA) as an enabler of uRLLC and mMTC. GFRA is a new protocol introduced in 5G new radio (5G-NR) for reducing the data overhead of the random access (RA) procedure. This results in a significant reduction in the latencies of the user equipments (UEs) access to a connected medium via an access point (AP). Achieving efficient GFRA is of key importance for many 5G applications, e.g. for large scale internet of things (IoT) wireless networks. The study of new non-orthogonal multiple access (NOMA) signal processing techniques is then considered. Using tools from the theory of compressed sensing (CS), and particularly from Bayesian CS, new algorithms within the family of approximate message passing (AMP) are developed to address the joint active user detection and channel estimation (AUDaCE) problem. The active user detection is crucial to properly identify transmitting UEs within the context of large-scale dense network; the channel estimation is equally important so that an AP can reliably transmit back data to the detected UEs.

In this thesis, in contrast to existing work on this topic, the AUDaCE is studied for wireless networks where the activity of the UEs is assumed to be correlated, as is typical for many large-scale dense networks. To this end, two new activity models are introduced. The first one assumes that the activity of the UEs in the network can be modeled via group-homogeneous activity (GHomA) where devices in the same group have common pairwise correlations and marginal activity probabilities. The second model accounts for more general dependence structure via group-heterogeneous activity (GHetA). Novel approximate message passing algorithms within the hybrid GAMP (HGAMP) framework are developed for each of the models. With the aid of latent variables associated to each group for modeling the activity probabilities of the UEs, the GHomA-HGAMP algorithm can perform AUDaCE for GFRA leveraging such a group homogeneity. When the activity is heterogenous, i.e. each UE is associated with a latent variable modeling its activity probability correlated with the other variables, it is possible to develop GHetA-HGAMP using the copula theory.

Extensive numerical studies are performed, which highlight significant performance improvements of GHomA-HGAMP and GHetA-HGAMP over existing algorithms (modified generalized AMP (GAMP) and group-sparse HGAMP (GS-HGAMP)), which do not properly account for correlation in activity. In particular, the channel estimation and active user detection capability are enhanced in many scenarios with up to a 4dB improvement with twice less user errors.

As a whole, this thesis provides a systematic approach to AUDaCE for wireless networks with correlated activities using tools from Bayesian CS. We then conclude by showing how it could be used for multi-carrier orthogonal frequency division multiplexing (OFDM) scenarios with possible extensions for grant-free (GF) data transmission leveraging joint data recovery, active user detection and channel estimation (DrAUDaCE).

 

Jury

  • Catherine DOUILLARD, Professor @ IMT Atlantique, Reviewer
  • Dejan VUKOBRATOVIC, Professor @ Univ. Novi Sad, Reviewer
  • Aline ROUMI, Senior Research Scientist @ INRIA Rennes, Examiner
  • Philippe CIBLAT, Professor @ Telecom Paris, Examiner
  • Cedomir STEFANOVIC, Professor @ Univ. Aalborg, Examiner
  • Jean-Marie GORCE, Professor @ INSA Lyon, Director
  • Malcolm EGAN, Research Scientist @ INRIA Lyon, Supervisore

PhD Defence: “Large-scale Automatic Learning of Autonomous Agent Behavior with Structured Deep Reinforcement Learning”, Edward Beeching, amphi, Chappe Building, 3rd of May 2022 at 10:00 AM

 

The defense willtake place on Tuesday, May 3 at 10:00 am, in the amphitheatre of the Telecommunications Department (Claude Chappe building), INSA Lyon, Villeurbanne.

The presentation will be available on Youtube at the following link: https://youtu.be/k7dh-thqbSk

 

Title

Large-scale Automatic Learning of Autonomous Agent Behavior with Structured Deep Reinforcement Learning

 

 

Abstract

Autonomous robotic agents have begun to impact many aspects of our society,with application in automated logistics, autonomous hospital porters, manufacturing and household assistants. The objective of this thesis is to explore Deep Reinforcement Learning approaches to planning and navigation in large and unknown 3D environments. In particular, we focus on tasks that require exploration and memory in simulated environments. An additional requirement is that learned policies should generalize to unseen map instances. Our long-term objective is the transfer of a learned policy to a real-world robotic system. Reinforcement learning algorithms learn by interaction. By acting with the objective of accumulating a task-based reward, an Embodied AI agent must learn to discover relevant semantic cues such as object recognition and obstacle avoidance, if these skills are pertinent to the task at hand. This thesis introduces the field of Structured Deep Reinforcement Learning and then describes 5 contributions that were published during the PhD.

We start by creating a set of challenging memory-based tasks whose performance is benchmarked with an unstructured memory-based agent. We then demonstrate how the incorporation of structure in the form of a learned metric map, differentiable inverse projective geometry and self-attention mechanisms; augments the unstructured agent, improving its performance and allowing us to interpret the agent’s reasoning process.

We then move from complex tasks in visually simple environments, to more challenging environments with photo-realistic observations, extracted from scans of real-world buildings. In this work we demonstrate that augmenting such an agent with a topological map can improve its navigation performance. We achieve this by learning a neural approximation of a classical path planning algorithm, which can be utilized on graphs with uncertain connectivity.

From work undertaken over the course of a 4-month internship at the research and development department of Ubisoft, we demonstrate that structured methods can also be used for navigation and planning in challenging video game environments. Where we couple a lower level neural policy with a classical planning algorithm to improve long-distance planning and navigation performance in vast environments of 1km×1km. We release an open-source version of the environment as a benchmark for navigation in large-scale environments.

Finally, we develop an open-source Deep Reinforcement Learning interface for the Godot Game Engine. Allowing for the construction of complex virtual worlds and the learning of agent behaviors with a suite of state-of-the-art algorithms. We release the tool with a permissive open-source (MIT) license, to aid researchers in their pursuit of complex embodied AI agents.

 

 

Jury

    • Mme. Elisa Fromont – Université de Rennes 1 – Rapporteur
    • M. David Filliat – ENSTA Paris – Rapporteur
    • M. Cédric Démonceaux – Université de Bourgogne – Examinateur
    • M. Karteek Alahari – INRIA Grenoble – Examinateur
    • Mme. Christine Solnon – INSA-Lyon – Examinateur
    • M. Olivier Simonin – INSA-Lyon – Directeur de thèse
    • M. Jilles Dibangoye – INSA-Lyon – Co-encadrant de thèse
    • M. Christian Wolf – Naver Labs Europe – Co-directeur de thèse

PhD Defence: “Applications of Deep Learning to the Design of Enhanced Wireless Communication Systems”, Mathieu Goutay, Showcase room, Chappe Building, 28th of January 2022 at 2:00 PM

This thesis has been done within Nokia Bell Labs France and the Maracas team of the CITI laboratory.

The defense willtake place on Friday, January 28 at 2:00 pm, in the showcase room of the Telecommunications Department (first floor), INSA Lyon, Villeurbanne.

The presentation will be available on Zoom at the following link: https://insa-lyon-fr.zoom.us/j/96199554997

 

Title

Applications of Deep Learning to the Design of Enhanced Wireless Communication Systems

 

 

Abstract

Innovation in the physical layer of communication systems has traditionally been achieved

by breaking down the transceivers into sets of processing blocks, each optimized independently based on mathematical models. This approach is now challenged by the ever-growing demand for wireless connectivity and the increasingly diverse set of devices and use-cases. Conversely, deep learning (DL)-based systems are able to handle increasingly complex tasks for which no tractable models are available. By learning from the data, these systems could be trained to embrace the undesired effects of practical hardware and channels instead of trying to cancel them. This thesis aims at comparing different approaches to unlock the full potential of DL in the physical layer.

First, we describe a neural network (NN)-based block strategy, where an NN is optimized to replace one or multiple block(s) in a communication system. We apply this strategy to introduce a multi-user multiple-input multiple-output (MU-MIMO) detector that builds on top of an existing DL-based architecture. The key motivation is to replace the need for retraining on each new channel realization by a hypernetwork that generates optimized sets of parameters for the underlying DL detector. Second, we detail an end-to-end strategy, in which the transmitter and receiver are modeled as NNs that are jointly trained to maximize an achievable information rate. This approach allows for deeper optimizations, as illustrated with the design of waveforms that achieve high throughputs while satisfying peak-to-average power ratio (PAPR) and adjacent channel leakage ratio (ACLR) constraints. Lastly, we propose a hybrid strategy, where multiple DL components are inserted into a traditional architecture but trained to optimize the end-to-end performance. To demonstrate its benefits, we propose a DL-enhanced MU-MIMO receiver that both enable lower bit error rates (BERs) compared to a conventional receiver and remains scalable to any number of users.

Each approach has its own strengths and shortcomings. While the first one is the easiest to implement, its individual block optimization does not ensure the overall system optimality. On the other hand, systems designed with the second approach are computationally complex and do not comply with current standards, but allow the emergence of new opportunities such as high-dimensional constellations and pilotless transmissions. Finally, even if the block-based architecture of the third approach prevents deeper optimizations, the combined flexibility and end-to-end performance gains motivate its use for short-term practical implementations.

 

 

Jury

    • Reviewer: Didier LE RUYET, Professor, CNAM, Paris, France
    • Reviewer: Charlotte LANGLAIS, Permanent Research Staff, IMT Atlantique, Brest, France
    • Examiner: Inbar FIJALKOW, Professor, ENSEA, Cergy, France
    • Examiner: Stephan TEN BRINK, Professor, University of Stuttgart, Stuttgart, Allemagne
    • Thesis supervisor: Jean-Marie GORCE, Professor, INSA Lyon, Villeurbanne, France
    • Thesis co-supervisor: Jakob HOYDIS, Principal Research Scientist, Nvidia, France*
    • Thesis co-supervisor Fayçal AIT AOUDIA, Senior Research Scientist, Nvidia, France*

* Previously at Nokia Bell Labs France


CITI seminar – Frédéric Prost (UGA) – 27/01 at 14:00

Title: Projet Samildanach – Communication Scientifique dans une Culture Digitale

Date and Place: January 27th 14h, Amphi Chappe

Lien visio : https://insa-lyon-fr.zoom.us/j/97446195543

Speaker: Dr. Frédéric Prost (UGA)

 

Abstract: 

L’objectif de ce séminaire est de présenter un projet relatif à la communication scientifique. L’idée est d’utiliser plusieurs technologies récentes comme la blockchain, les tables de hachage décentralisées ainsi que les calculs de réputation issus des réseaux sociaux pour les appliquer au domaine de la publication scientifique. La blockchain est utilisée pour certifier,présenter une résistance à la censure, assurer la non répudiation et une structure d’incitations pour le  développement du réseau (rémunération des acteurs qui aident le réseau).   Ce projet est intrinsèquement multi-disciplinaire et au croisement de nombreuses technologies et domaines de l’informatique.

Biographie:

Frédéric Prost est MdC à l’université Grenoble Alpes et au laboratoire LIG. Il a  principalement travaillé dans la théorie des langages de programmation (réécriture de graphes, sémantique des langages d’interrogation des BD graphes) et les problématiques de confidentialité (analyse de non interférence, anonymisation de bases de données graphe).

 

 

 


PhD Defence: “On the Performance of Spatial Modulation and Full Duplex Radio Architectures”, Yanni Zhou, Amphitheater, CITI, 10th of December 2021 at 10:00 PM

The defense will be held in the Amphitheater, Claude Chappe, and will be streamed live here.

 

Title

On the Performance of Spatial Modulation and Full Duplex Radio Architectures

 

 

Abstract

Index modulation techniques have exhibited great potential in the scenarios foreseen in next-generation wireless networks. Applying in the spatial domain, spatial modulation (SM) as a single radio-frequency (RF) multiple-input–multiple-output (MIMO) solution has attracted wide attention. The SM system has only one transmitting antenna activated for each time slot which results in low system complexity and cost. It exploits the index of the transmitting antennas to convey additional information bits.

To analyze the SM performance, a simulated framework over the time-varying Rician fading channel is built with ADS and Matlab software and a channel state information (CSI) detector is highlighted. The simulation results are verified by the experimental implementation based on the National Instruments (NI) PXI chassis hardware and LabVIEW programming environment. In the practical analysis, two models of the propagation environments are considered, where a channel sounding method is employed in order to extract the channel coefficients.

Despite issues on system complexity and cost, a shortage of spectrum resources can also restrict the development of mobile communications technology. Full duplex (FD) communications have been developed to double the radio link data rate and spectral efficiency through simultaneous and bidirectional communication. The main challenge of FD systems is self-interference (SI), which is caused by the coupling of the transmitting antenna with the receiving one. The combination of FD and SM will not only maintain spectral efficiency but also decrease the complexity of the self-interference cancellation (SIC) because of the single RF chain.

Based on these, a full duplex spatial modulation (FDSM) system is proposed as well as the SIC method. Moreover, the impact of SIC accuracy on the system performance is studied. We focus on the FDSM system imperfections including IQ imbalance, phase noise, power amplifier (PA) nonlinearities and RF switch nonidealities. The bit error rate (BER) performance under different scenarios with these imperfections is analyzed, along with the estimation and cancellation method.

 

 

Jury

  • Marco Di Renzo, Research Director at CNRS, Reviewer
  • Matthieu Crussière, Professor at INSA-Rennes, Reviewer
  • Christelle Aupetit-Berthelemot, Professor at Université de Limoges, Examiner
  • Taneli Riihonen, Associate Professor at Tampere University, Examiner
  • Jean-Marie Gorce, Professor at INSA-Lyon, Examiner
  • Dinh-Thuy Phan-Huy, Engineer at Orange Lab, Examiner
  • Guillaume Villemaud, Associate Professeur at INSA-Lyon, Thesis director
  • Florin-Doru Hutu, Associate Professor at INSA-Lyon, Co-director

CITI seminar – Miriam Kolar (Stanford University) – 16/11 at 14:00

Title: Human Centered Archaeoacoustics

Date and Place: November 16th 14h00-16h00, Amphi Chappe (CITI Lab, INSA-Lyon, Batiment Claude Chappe), 6 avenue des arts, 69621 Villeurbanne

Speaker: Dr. Miriam Kolar (Adjunct professor at Stanford University)

 

Abstract: 

Despite over 125 years of modern room acoustics, spatial acoustics has only recently been applied in archaeological research. Likewise, musical instrument acoustics remains a growing but infrequent archaeometric approach. Auditory science is even less frequently incorporated. Since 2008, Miriam Kolar has led archaeoacoustics fieldwork and instrument performance studies at the 3,000-year-old UNESCO World Heritage Centre archaeological complex Chavín de Huántar, Perú, with a second Andean project about sound as an Inca administrative tool. In this presentation, Dr. Kolar will share case-study examples from her work in developing methodologies for “human-centered” archaeometric research, relating acoustics to human experience and social behavior in ancient contexts. Acoustical and psychoacoustical experiments in archaeological settings and with artifact sound-producing instruments enable data-driven reconstructions of heritage sites and instruments in use. Physics-based evaluations of human sensory perspectives support the ecological validity of heritage acoustics, opening a new technological frontier for cultural heritage research, preservation, and knowle.

 

Biography:

Miriam A. Kolar, M.F.A., Ph.D., is an Adjunct Professor at the Center for Computer Research in Music and Acoustics (CCRMA) at Stanford University and a Visiting Professor at Amherst College (USA). She studies human-sonic interrelationships across time and geography, applying acoustical and auditory perceptual science methodologies within an anthropological framework. Principal investigator of the integrative archaeoacoustics project at the 3,000-year-old Andean ceremonial site and UNESCO World Heritage Centre Chavín de Huántar, Peru, Dr. Kolar collaborates on novel applications of digital technologies for cultural heritage research and engagement. Her cultural acoustics research (http://culturalacoustics.org) leverages cross-disciplinary theories and tools to understand sonic experiential aspects of past and present life. In current work, and as co-organizer of the NEH-supported Digital Aural Heritage project (http://auralheritage.org), she explores the potential of auralizations for scholarship and public interfacing. Topics of interest include contextual knowledge representation, information ethics, and ecological validity.

This talk is organized in the context of the creation of the Emeraude team which is a collaboration between the Grame institute and Citi-lab at Insa-Lyon (https://team.inria.fr/emeraude/).

 


Health and Privacy-Preserving Machine Learning Workshop, 28th of October 2021 at 2pm, CITI, Campus de la Doua, Lyon

Health and Privacy-Preserving Machine Learning Workshop 2021

La nature sensible des informations de santé est un aspect critique pour le déploiement de l’Intelligence Artificielle (IA) en médecine. L’apprentissage automatique préservant la confidentialité et l’apprentissage fédéré deviennent des paradigmes d’apprentissage centraux pour garantir la confidentialité, la propriété et la sécurité des données médicales sensibles dans les applications d’IA. Malgré les récentes avancées techniques et méthodologiques, l’adoption efficace de paradigmes d’apprentissage sécurisés dans des applications de soins de santé présente encore des défis non résolus et pose des questions juridiques et éthiques. Ce workshop traitera de ces problématiques à travers plusieurs présentations d’expert du domaine.

Inscription sur le site web du workshop.

Retransmission du workshop ici : https://bbb.insa-lyon.fr/b/bou-dzs-2hg-otl