Thalassomics Report

Toolbox

MultiQC Toolbox

Highlight Samples

Regex mode off

Rename Samples

Click here for bulk input.

Paste two columns of a tab-delimited table here (eg. from Excel).

First column should be the old name, second column the new name.

Regex mode off

Show / Hide Samples

Regex mode off

Explain with AI

Configure AI settings to get explanations of plots and data in this report.

Anonymize samples off

Aspect ratio

Plot scaling

Download the raw data used to create the plots in this report below:

Format:

Note that additional data was saved in multiqc_data when this report was generated.

Choose Plots

If you use plots from MultiQC in a publication or presentation, please cite:

MultiQC: Summarize analysis results for multiple tools and samples in a single report
Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
Bioinformatics (2016)
doi: 10.1093/bioinformatics/btw354
PMID: 27312411

Settings are automatically saved. You can also save named configurations below.

Save Settings

You can save the toolbox settings for this report to the browser or as a file.

Load Settings

Choose a saved report profile from the browser or load from a file:

Tool Citations

Please remember to cite the tools that you use in your analysis.

To help with this, you can download publication details of the tools mentioned in this report:

About MultiQC

This report was generated using MultiQC, version 1.29

You can see a YouTube video describing how to use MultiQC reports here: https://youtu.be/qPbIlO_KWN0

For more information about MultiQC, including other videos and extensive documentation, please visit http://multiqc.info

You can report bugs, suggest improvements and find the source code for MultiQC on GitHub: https://github.com/MultiQC/MultiQC

MultiQC is published in Bioinformatics:

MultiQC: Summarize analysis results for multiple tools and samples in a single report
Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
Bioinformatics (2016)
doi: 10.1093/bioinformatics/btw354
PMID: 27312411

MultiQC is developed by:

REPORTE DE MUESTRA

Los datos para este muestra provienen del artículo A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae de Nookaew et al., en el cual se estudia la cepa de S. cerevisiae CEN.PK 113-7D (levadura) bajo dos condiciones metabólicas distintas: exceso de glucosa (batch) y limitación de glucosa (chemostat).

Nombre: Thalassomics
Contacto: info@thalassomics.com
Sitio Web: https://thalassomics.com
Tipo de proyecto: RNA-seq
Platafora: HiSeq 2500 High Output V4
Setup: 2x125

JavaScript Disabled

MultiQC reports use JavaScript for plots and toolbox functions. It looks like you have JavaScript disabled in your web browser. Please note that many of the report functions will not work as intended.

Resumen general

Showing ¹²/₁₂ rows and ⁶/₁₆ columns.

Sample Name	% Duplication	% > Q30	Mb Q30 bases	Reads After Filtering	GC content	% PF	% Adapter	Error rate	Non-primary	Reads mapped	% Mapped	% Proper pairs	% MapQ 0 reads	Total seqs	Mean insert	% Aligned
batch1								0.16%	0.0M	0.1M	99.9%	99.2%	0.0%	0.1M	167.6bp	99.9%
batch1_raw	0.0%	92.8%	7.6Mb	0.1M	43.4%	99.4%	1.5%
batch2								0.15%	0.0M	0.1M	99.9%	99.4%	0.0%	0.1M	172.8bp	99.9%
batch2_raw	0.0%	92.9%	10.6Mb	0.1M	43.4%	100.0%	0.6%
batch3								0.16%	0.0M	0.1M	99.9%	99.3%	0.0%	0.1M	168.8bp	99.8%
batch3_raw	0.0%	92.8%	8.1Mb	0.1M	43.6%	99.5%	1.4%
chem1								0.26%	0.0M	0.1M	99.9%	98.9%	0.0%	0.1M	166.1bp	99.9%
chem1_raw	0.0%	92.9%	6.7Mb	0.1M	43.4%	100.0%	0.7%
chem2								0.25%	0.0M	0.1M	99.9%	99.0%	0.0%	0.1M	172.7bp	99.9%
chem2_raw	0.0%	93.1%	8.4Mb	0.1M	43.3%	100.0%	0.5%
chem3								0.25%	0.0M	0.1M	99.9%	99.1%	0.0%	0.1M	172.3bp	99.9%
chem3_raw	0.0%	93.1%	10.1Mb	0.1M	43.4%	100.0%	0.6%

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order. Table ID: general_stats_table_table

Sort	Group	Column	Description	ID	Scale
\|\|	fastp	% Duplication	Duplication rate before filtering	`fastp-pct_duplication`
\|\|	fastp	% > Q30	Percentage of reads > Q30 after filtering	`fastp-after_filtering_q30_rate`
\|\|	fastp	Mb Q30 bases	Bases > Q30 after filtering (millions)	`fastp-after_filtering_q30_bases`	base_count
\|\|	fastp	Reads After Filtering	Total reads after filtering (millions)	`fastp-filtering_result_passed_filter_reads`	read_count
\|\|	fastp	GC content	GC content after filtering	`fastp-after_filtering_gc_content`
\|\|	fastp	% PF	Percent reads passing filter	`fastp-pct_surviving`
\|\|	fastp	% Adapter	Percentage adapter-trimmed reads	`fastp-pct_adapter`
\|\|	Samtools: stats	Error rate	Error rate: mismatches (NM) / bases mapped (CIGAR)	`samtools_stats-error_rate`
\|\|	Samtools: stats	Non-primary	Non-primary alignments (millions)	`samtools_stats-non_primary_alignments`	read_count
\|\|	Samtools: stats	Reads mapped	Reads mapped in the bam file (millions)	`samtools_stats-reads_mapped`	read_count
\|\|	Samtools: stats	% Mapped	% Mapped reads	`samtools_stats-reads_mapped_percent`
\|\|	Samtools: stats	% Proper pairs	% Properly paired reads	`samtools_stats-reads_properly_paired_percent`
\|\|	Samtools: stats	% MapQ 0 reads	% of reads that are ambiguously placed (MapQ=0)	`samtools_stats-reads_MQ0_percent`
\|\|	Samtools: stats	Total seqs	Total sequences in the bam file (millions)	`samtools_stats-raw_total_sequences`	read_count
\|\|	Samtools: stats	Mean insert	Average insert size	`samtools_stats-insert_size_average`
\|\|	Bowtie 2 / HiSAT2	% Aligned	overall alignment rate	`bowtie_2_hisat2-overall_alignment_rate`

fastp

Version:


                      
                        0.23.4

Análisis de calidad de lecturas con Fastp.URL: https://github.com/OpenGene/fastpDOI: 10.1093/bioinformatics/bty560

Este módulo analiza la calidad de las lecturas utilizando Fastp, proporcionando estadísticas detalladas sobre la calidad de las secuencias.

Filtered Reads

Filtering statistics of sampled reads.

Created with MultiQC

Insert Sizes

Insert size estimation of sampled reads.

Created with MultiQC

Sequence Quality

Average sequencing quality over each base of all reads.

Created with MultiQC

GC Content

Average GC content over each base of all reads.

Created with MultiQC

N content

Average N content over each base of all reads.

Created with MultiQC

Samtools

Version:


                      
                        1.20

HTSlib:


                      
                        1.21

Análisis de archivos BAM con Samtools.URL: http://www.htslib.orgDOI: 10.1093/bioinformatics/btp352

Este módulo utiliza Samtools para analizar archivos BAM, proporcionando estadísticas sobre la alineación y cobertura de las lecturas.

Percent mapped

Alignment metrics from samtools stats; mapped vs. unmapped reads vs. reads mapped with MQ0.

For a set of samples that have come from the same multiplexed library, similar numbers of reads for each sample are expected. Large differences in numbers might indicate issues during the library preparation process. Whilst large differences in read numbers may be controlled for in downstream processings (e.g. read count normalisation), you may wish to consider whether the read depths achieved have fallen below recommended levels depending on the applications.

Low alignment rates could indicate contamination of samples (e.g. adapter sequences), low sequencing quality or other artefacts. These can be further investigated in the sequence level QC (e.g. from FastQC).

Reads mapped with MQ0 often indicate that the reads are ambiguously mapped to multiple locations in the reference sequence. This can be due to repetitive regions in the genome, the presence of alternative contigs in the reference, or due to reads that are too short to be uniquely mapped. These reads are often filtered out in downstream analyses.

Created with MultiQC

Alignment stats

This module parses the output from samtools stats. All numbers in millions.

Created with MultiQC

Sample Name	Total sequences	Mapped & paired	Properly paired	Duplicated	QC Failed	Reads MQ0	Mapped bases (CIGAR)	Bases Trimmed	Duplicated bases	Diff chromosomes	Other orientation	Inward pairs	Outward pairs
batch1	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	8.2Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.0M	0.0M
batch2	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	11.4Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.1M	0.0M
batch3	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	8.7Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.0M	0.0M
chem1	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	7.2Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.0M	0.0M
chem2	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	9.0Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.0M	0.0M
chem3	0.1M	0.1M	0.1M	0.0M	0.0M	0.0M	10.8Mb	0.0Mb	0.0Mb	0.0M	0.0M	0.1M	0.0M

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order. Table ID: samtools-stats-dp_table

Sort	Column	Description	ID	Scale
\|\|	Total sequences	Total sequences	`raw_total_sequences`	read_count
\|\|	Mapped & paired	Paired-end technology bit set + both mates mapped	`reads_mapped_and_paired`	read_count
\|\|	Properly paired	Proper-pair bit set	`reads_properly_paired`	read_count
\|\|	Duplicated	PCR or optical duplicate bit set	`reads_duplicated`	read_count
\|\|	QC Failed	QC Failed	`reads_QC_failed`	read_count
\|\|	Reads MQ0	Reads mapped and MQ=0	`reads_MQ0`	read_count
\|\|	Mapped bases (CIGAR)	Mapped bases (CIGAR)	`bases_mapped__cigar`	base_count
\|\|	Bases Trimmed	Bases Trimmed	`bases_trimmed`	base_count
\|\|	Duplicated bases	Duplicated bases	`bases_duplicated`	base_count
\|\|	Diff chromosomes	Pairs on different chromosomes	`pairs_on_different_chromosomes`	read_count
\|\|	Other orientation	Pairs with other orientation	`pairs_with_other_orientation`	read_count
\|\|	Inward pairs	Inward oriented pairs	`inward_oriented_pairs`	read_count
\|\|	Outward pairs	Outward oriented pairs	`outward_oriented_pairs`	read_count

Bowtie 2 / HiSAT2

Alineamiento de lecturas con Bowtie2.URL: http://bowtie-bio.sourceforge.net/bowtie2; https://ccb.jhu.edu/software/hisat2DOI: 10.1038/nmeth.1923; 10.1038/nmeth.3317; 10.1038/s41587-019-0201-4

Este módulo utiliza Bowtie2 para alinear las lecturas a un genoma de referencia, proporcionando estadísticas sobre el alineamiento.

Paired-end alignments

This plot shows the number of reads aligning to the reference in different ways.

Please note that single mate alignment counts are halved to tally with pair counts properly.

There are 6 possible types of alignment:

PE mapped uniquely: Pair has only one occurence in the reference genome.
PE mapped discordantly uniquely: Pair has only one occurence but not in proper pair.
PE one mate mapped uniquely: One read of a pair has one occurence.
PE multimapped: Pair has multiple occurence.
PE one mate multimapped: One read of a pair has multiple occurence.
PE neither mate aligned: Pair has no occurence.

Created with MultiQC

GffCompare

Version:


                      
                        0.12.9

Tool to compare, merge and annotate one or more GFF files with a reference annotation in GFF format.URL: https://ccb.jhu.edu/software/stringtie/gffcompare.shtmlDOI: 10.12688/f1000research.23297.1

Accuracy values

Displayed are the accuracy values precisiond and sensitivity for different levels of genomic features. The metrics are calculated for the comparison of a query and reference GTF file.

Accuracy metrics are calculated as described in Burset et al. (1996). Sensitivity is the true positive rate, Precision True Positives are query features that agree with features in the reference. The exact definition depends on the feature level:

Base: True positives are bases reported at the same coordinates.
Exon: Comparison units are exons that overlap in query and reference with same coordinates.
Intron chain: True positives are query transcripts for which all introns coordinates match those in the reference.
Transcript: More stringent then intron chain, all Exon coordinates need to match. Outer exon coordinates (start + end) can vary by 100 bases in default settings
Locus: Cluster of exons need to match.

A more in depth description can be found here.

Created with MultiQC

Novel features

Number of novel features, present in the query data but not found in the reference annotation.

Created with MultiQC

Missing features

False negative features, which are found in the reference annotation but missed (not present) in the query data.

Created with MultiQC

Matriz de distancia

Created with MultiQC

Mapa de calor de abundancias

Mapa de calor (heatmap) con los valores de abundancia de los genes/transcritos expresados diferencialmente (valores normalizados).

Created with MultiQC

Análisis PCA

Este gráfico muestra los resultados del Análisis de Componentes Principales (PCA) con valores normalizados (VSD) con colores por grupo.

Created with MultiQC

Gráfico Volcano

Gráfico de volcan los resultados del análisis de expresión diferencial.

Created with MultiQC

Software Versions

Software Versions lists versions of software tools extracted from file contents.

Group	Software	Version
GffCompare	GffCompare	`0.12.9`
Samtools	HTSlib	`1.21`
	Samtools	`1.20`
fastp	fastp	`0.23.4`

Toggle navigation Powered by: v1.29

MultiQC Toolbox

Apply Highlight Samples

Apply Rename Samples

Apply Show / Hide Samples

Explain with AI

Export Plots

Choose Plots

Save Settings

Load Settings

Tool Citations

About MultiQC

Resumen general

General Statistics: Columns

fastp

Filtered Reads

Insert Sizes

Sequence Quality

GC Content

N content

Samtools

Percent mapped Help

Alignment stats

Samtools: stats: Alignment Stats: Columns

Bowtie 2 / HiSAT2

Paired-end alignments Help

GffCompare

Accuracy values Help

Novel features

Missing features

Matriz de distancia

Mapa de calor de abundancias

Análisis PCA

Gráfico Volcano

Software Versions

Powered by:

v1.29

Highlight Samples

Rename Samples

Show / Hide Samples

Percent mapped

Paired-end alignments

Accuracy values