About

Background

This project focuses on the statistical analysis of metagenomic data derived from 16S rRNA sequencing, conducted as part of a Master’s 1 thesis in Bioinformatics (MISO track) at the University of Lille.

The analysis is performed using reproducible workflows based on R, phyloseq, and Quarto, which allow for the structuring of the steps involved in processing, visualizing, and interpreting microbiome data.

Goal

The goal of this blog is to:

  • Provide a comprehensive analysis of 16S metagenomic data
  • Explore the taxonomic composition of microbial communities
  • Examine alpha and beta diversity across sample groups
  • Apply statistical methods suited to count data and their complex structure
  • Produce clear and interpretable visualizations (PCoA, taxonomic bar plots, etc.)
  • Establish a reproducible approach to adapt the pipeline to different datasets

Contributors

This project was carried out collaboratively, drawing on skills in biological data analysis, statistics, and bioinformatics.

The contributors are Ines Bakli, Paul Lemonnier, and Alden Sneath. We would also like to thank our referent, Clément Poupelin, for his guidance throughout this project.