Select the most conserved/variable regions across the Ebola genome
The scatter plots describe a sliding window analysis (30 nt windows with 1 nt of overlap; regions with multiple gaps not considered)
of the following measures of sequence conservation for each
multiple sequence alignment used the database:
-PIS: percentage of identical sites
-PPI: percentage of pairwise identity
Select the alignment to display in the graph below:
Alignment info
Alig01
Complete or near complete genome sequences from
different species of the Ebolavirus genus (Bundibugyo,
Reston, Sudan, Tai Forest and Zaire ebolavirus). Sequences
retrieved from the NCBI Entrez Nucleotide database.
Alig02
Alignment of 101 Ebola virus complete genomes used
by Gire et al. Science 345, 1369 (2014). The alignment includes
81 sequences from the 2014 outbreak and 20 sequences from earlier
outbreaks (from 1976 to 2008).
Alig03
Alignment of 124 ebolavirus genomes used by Gire et al. Science 345, 1369 (2014).
Alig04
Multiple sequence alignment of 158 Ebola virus and 2 Marbug virus
sequences featured in the UCSC Genome Browser. June 2014 (eboVir3)
Ebola Portal.
Sequences identified with accession numbers. Downloaded from here.
Alig05
Multiple sequence alignment of 158 Ebola virus and 2 Marbug virus
sequences featured in the UCSC Genome Browser. June 2014 (eboVir3)
Ebola Portal. Sequences identified with strain names. Downloaded from here.
Alig06
Multiple sequence alignment of 240 Ebola virus genomes featured in VIPR Ebola database.
Alig07
1610 full Ebola virus (EBOV) genomes sampled between 17 March 2014 and 24 October 2015 in West Africa and reference genome. Sequences were aligned using MAFFT (Katoh et al., 2002) and edited manually. Further details at Gytis Dudas et al. 2016
Define below a window to see on the multiple sequence alignment:
or use a window midpoint (roll over a dot on the graph) to visualize the corresponding section of the alignment
Click and drag on the graphs to zoom in
List of the most conserved genomic regions considering the PIS (30 nt windows):
|
List of the most conserved genomic regions considering the PPI (30 nt windows):
|