U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



idEpiBalt1.1

Organism name:
Episyrphus balteatus (marmalade hoverfly)
BioSample:
SAMEA7520035
BioProject:
PRJEB54851
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2022/09/13
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_945859705.1 (latest)
RefSeq assembly accession:
GCF_945859705.1 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in GenBank: chromosome MT
  • Data displayed for RefSeq version
WGS Project:
CAMAOR01
Genome coverage:
28x
Linked assembly:
GCA_945859675.1 (alternate pseudohaplotype of diploid)

IDs: 13799901 [UID] 35905078 [GenBank] 42517398 [RefSeq]

See Genome Information for Episyrphus balteatus

There are 3 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly idEpiBalt1.1 is based on 28x PacBio data, 10X Genomics Chromium data, and Arima Hi-C data generated by the Darwin Tree of Life Project (https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation ... with Hifiasm, retained haplotig separation with purge_dups, short-read polishing using FreeBayes-called variants from 10X Genomics Chromium reads aligned with LongRanger, and Hi-C based scaffolding with YaHS. The mitochondrial genome was assembled using MitoHiFi. Finally, the primary assembly was analysed and manually improved using rapid curation. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size.  more

Global statistics

Total sequence length535,345,329
Total ungapped length535,311,729
Gaps between scaffolds0
Number of scaffolds18
Scaffold N50133,640,122
Scaffold L502
Number of contigs186
Contig N505,909,399
Contig L5026
Total number of chromosomes and plasmids5
Number of component sequences (WGS or clone)18

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCF_945859704.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OX244017.1=NC_079134.12
Chromosome 2OX244018.1=NC_079135.10
Chromosome 3OX244019.1=NC_079136.10
Chromosome 4OX244020.1=NC_079137.10
Chromosome XOX244021.1=NC_079138.11
unplacedn/an/an/a10

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule535,345,32918535,311,729133,640,1221680
Chromosome 1AllAssembled moleculeUnlocalized scaffolds176,634,976176,573,26561,711312176,619,776176,558,06561,711176,573,265176,573,26535,06976760000
Chromosome 2Assembled molecule133,640,1221133,635,122133,640,122250
Chromosome 3Assembled molecule130,410,0381130,402,638130,410,038370
Chromosome 4Assembled molecule83,506,989183,501,98983,506,989250
Chromosome XAllAssembled moleculeUnlocalized scaffolds10,092,7019,603,207489,49421110,091,7019,602,207489,4949,603,2079,603,207489,494550000
unplacedAssembled molecule1,060,503101,060,503161,84500