Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/79258

TítuloImproving analysis of meta-omics data with the MOSCA framework
Autor(es)Sequeira, João Carlos Sequeira
Rocha, Miguel
Alves, M. M.
Salvador, Andreia Filipa Ferreira
Palavras-chavemetagenomics
metatranscriptomics
metaproteomics
functional annotation
Data9-Mai-2022
CitaçãoSequeira, J. C.; Rocha, Miguel; Alves, M. Madalena; Salvador, Andreia F., Improving analysis of meta-omics data with the MOSCA framework. ICBM 2022 - 4th International Conference on Biogas Microbiology. No. OC-OM-06, Braga, Portugal, May 9-11, 44, 2022.
Resumo(s)Introduction: Meta-omics is an emergent field of research with many resources available in the form of databases and software. The information stored in databases is not always easily accessible, and software tools for meta-omics are often difficult to utilize. In this work, we present Meta-Omics Software for Community Analysis (MOSCA), a software framework that implements pipelines for the integrated analysis of metagenomics (MG), metatranscriptomics (MT) and metaproteomics (MP) data. This framework integrates tools allowing access to databases, handling of data and a complete workflow for meta-omics data analysis. Methodology and results: MOSCA was developed in Python 3, takes as input raw files obtained from Next-generation sequencing (in FastQ format), and from mass spectrometry (mass spectra in vendor or peak-picked formats), and integrates several tools for MG, MT and MP analysis. These tools are connected through their inputs/outputs by snakemake, in a fully automated workflow. MG analysis starts with preprocessing of sequencing reads, which automatically configures Trimmomatic to remove adapters and low-quality reads based on FastQC quality reports, and SortMeRNA for rRNA reads removal. Assembly can be performed with MetaSPAdes or Megahit and is followed by binning with MaxBin2 and CheckM for quality check. Genes are identified with FragGeneScan and are annotated with both UPIMAPI (homology-based annotation) and reCOGnizer (domain-based annotation), with reference to UniProt KB and eight databases included in the Conserved Domains Database, respectively. Bowtie2 is used to align reads to metagenomes. Protein identification and quantification can be performed with either SearchCLI coupled to PeptideShaker (performing peptide-to-spectrum matching and spectra count) or using MaxQuant (with quantification at the MS1 level). Differential gene expression analysis is performed with DESeq2, and heatmaps, volcano plots and PCA plots are generated. The expressed enzymes are plotted into hundreds of KEGG metabolic maps with the tool KEGGCharter, showing the metabolic functions that are differentially expressed and the taxonomic assignment. Tables, heatmaps and other representations obtained with MOSCA provide an interactive, accessible and comprehensive representation of the information obtained from MG, MT and MP analyses. Conclusions: MOSCA performs automatic analyses of MG, MT and MP datasets, integrating over 20 tools to obtain a comprehensive and easy to understand representation of microbial activity in different processes and conditions.
TipoResumo em ata de conferência
URIhttps://hdl.handle.net/1822/79258
Versão da editorahttps://www.ceb.uminho.pt/Events/Details/4296
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CEB - Resumos em Livros de Atas / Abstracts in Proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
document_55670_1.pdf520,33 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID