Description
- Assembly Name: Linum_grandiflorum_primary_assembly_n_2025_03
- Assembly Type: haploid
- Assembly Level: contig
- Genome Representation: full
- Accession: GCA_965225045.1
Assembly Statistics
| Statistic | Value |
|---|---|
| Assembly length (Mbp) | 809.67 |
| GC % | 41.86 |
| Contig # | 646 |
| Contig N50 (Mbp) | 69.92 |
| Contig L50 | 5 |
| Contig N90 (Mbp) | 13.12 |
| Contig L90 | 12 |
| Scaffold # | 646 |
| Scaffold N50 (Mbp) | 69.92 |
| Scaffold L50 | 5 |
| Scaffold N90 (Mbp) | 13.12 |
| Scaffold L90 | 12 |
| Scaffolds >= 10 kb | 646 |
| BUSCO % (eudicots_odb10) | C:89.7% [S:85.6%, D:4.1%], F:1.3%, M:9.0%, n:2326 |
Annotation Statistics
| Statistic | Value |
|---|---|
| Gene # | 42070 |
| Transcript # | 176281 |
| Avg exons per transcript | 5.2 |
| Avg gene length (bp) | 2717 |
| Avg transcript length (bp) | 2860 |
| Avg exon length (bp) | 276 |
| Avg intron length (bp) | 319 |
Notes: Assembly statistics were calculated for the primary genome assembly (Lgrand_primary_v1.fasta.gz) using Quast (v5.2.0; Mikheenko et al. 2018). Busco statistics (Manni et al., 2021) were provided by the authors. Annotation statistics were calculated for Lgrand_primary_v1_genes.gff.gz using AGAT (v1.4.1; Dainat, 2024).
BUSCO notation: C: Complete; S: Single-copy; D: Duplicated; F: Fragmented; M: Missing; n: Total BUSCO genes included in the dataset (here: eudicots_odb10). See also the official BUSCO manual.
BUSCO statistics for the haplotype assemblies:
Linum grandiflorum hap1 assembly: C:89.7% [S:85.9%, D:3.8%], F:1.7%, M:8.6%, n:2326
Linum grandiflorum hap2 assembly: C:84.0% [S:79.6%, D:4.4%], F:5.0%, M:11.0%, n:2326.
Publication(s)
The data for Linum grandiflorum displayed in the genome portal comes from:
Zervakis, P.-I., Postel, Z., Losvik, A., Fracassetti, M., Solér, L., Proux-Wéra, E., Bunikis, I., Churcher, A., & Slotte, T. (2025). Genomic studies in Linum shed light on the evolution of the distyly supergene and the molecular basis of convergent floral evolution. New Phytologist, 247(6), 2964–2981. https://doi.org/10.1111/nph.70392The tools used to calculate the statistics shown on top of this page are described in:
Dainat J. (2024). AGAT: Another Gff Analysis Toolkit to handle annotations in any GTF/GFF format. (Version v1.4.1). Zenodo. https://www.doi.org/10.5281/zenodo.3552717
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A., & Zdobnov, E. M. (2021). BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Molecular Biology and Evolution, 38(10), 4647–4654. https://doi.org/10.1093/molbev/msab199
Mikheenko, A., Prjibelski, A., Saveliev, V., Antipov, D., & Gurevich, A. (2018). Versatile genome assembly evaluation with QUAST-LG. Bioinformatics, 34(13), i142–i150. https://doi.org/10.1093/bioinformatics/bty266
Funding
The study in which the genome data was generated (Zervakis et al., 2025) acknowledges funding from:
- Supergene evolution in a classic plant system - bringing the study of distyly into the genomic era (Grant agreement ID: 757451) European Research Council
- AdaptiveTarget: Accessing haplotype variation at complex loci with optimized targeting and adaptive sampling (Grant agreement ID: 101132305) European Research Council
- The role of structural variation for the origin and evolution of a classic supergene (Grant number: 2019-04452_VR) Swedish Research Council
- Convergent evolution and loss of a classic floral polymorphism – genetic causes and genomic consequences (Grant number: 2023-04532_VR) Swedish Research Council
- Carl Tryggers foundation grant (CTS21:1471)
- Erik Philip-Sörensen foundation
- Nilsson-Ehle foundation
Page last updated: 16/03/2026
