Description
- Assembly Name: Linum_perenne_primary_assembly_n_2025_03
- Assembly Type: haploid
- Assembly Level: contig
- Genome Representation: full
- Accession: GCA_965231395.1
Assembly Statistics
| Statistic | Value |
|---|---|
| Assembly length (Mbp) | 773.53 |
| GC % | 41.45 |
| Contig # | 807 |
| Contig N50 (Mbp) | 23.40 |
| Contig L50 | 10 |
| Contig N90 (Mbp) | 4.61 |
| Contig L90 | 34 |
| Scaffold # | 807 |
| Scaffold N50 (Mbp) | 23.40 |
| Scaffold L50 | 10 |
| Scaffold N90 (Mbp) | 4.61 |
| Scaffold L90 | 34 |
| Scaffolds >= 10 kb | 807 |
| BUSCO % (eudicots_odb10) | C:91.6% [S:86.2%, D:5.4%], F:1.2%, M:7.2%, n:2326 |
Annotation Statistics
| Statistic | Value |
|---|---|
| Gene # | 45373 |
| Transcript # | 139586 |
| Avg exons per transcript | 5.3 |
| Avg gene length (bp) | 2803 |
| Avg transcript length (bp) | 3074 |
| Avg exon length (bp) | 279 |
| Avg intron length (bp) | 352 |
Notes: Assembly statistics were calculated for the primary genome assembly GCA_965231395.1 (CBDDTU01.fasta.gz) using Quast (v5.2.0; Mikheenko et al. 2018). Busco statistics (Manni et al., 2021) were provided by the authors. Annotation statistics were calculated for GCA_965231395.1_Linum_perenne_primary_assembly_n_2025_03_genomic.gff.gz using AGAT (v1.4.1; Dainat, 2024).
BUSCO notation: C: Complete; S: Single-copy; D: Duplicated; F: Fragmented; M: Missing; n: Total BUSCO genes included in the dataset (here: eudicots_odb10). See also the official BUSCO manual.
BUSCO statistics for the haplotype assemblies:
Linum perenne hap1 assembly: C:91.2% [S:86.3%, D:4.9%], F:1.4%, M:7.4%, n:2326
Linum perenne hap2 assembly: C:84.8% [S:80.7%, D:4.1%], F:5.2%, M:10.0%, n:2326
Publication(s)
The data for Linum perenne displayed in the genome portal comes from:
Zervakis, P.-I., Postel, Z., Losvik, A., Fracassetti, M., Solér, L., Proux-Wéra, E., Bunikis, I., Churcher, A., & Slotte, T. (2025). Genomic studies in Linum shed light on the evolution of the distyly supergene and the molecular basis of convergent floral evolution. New Phytologist, 247(6), 2964–2981. https://doi.org/10.1111/nph.70392The tools used to calculate the statistics shown on top of this page are described in:
Dainat J. (2024). AGAT: Another Gff Analysis Toolkit to handle annotations in any GTF/GFF format. (Version v1.4.1). Zenodo. https://www.doi.org/10.5281/zenodo.3552717
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A., & Zdobnov, E. M. (2021). BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Molecular Biology and Evolution, 38(10), 4647–4654. https://doi.org/10.1093/molbev/msab199
Mikheenko, A., Prjibelski, A., Saveliev, V., Antipov, D., & Gurevich, A. (2018). Versatile genome assembly evaluation with QUAST-LG. Bioinformatics, 34(13), i142–i150. https://doi.org/10.1093/bioinformatics/bty266
Funding
The study in which the genome data was generated (Zervakis et al., 2025) acknowledges funding from:
- Supergene evolution in a classic plant system - bringing the study of distyly into the genomic era (Grant agreement ID: 757451) European Research Council
- AdaptiveTarget: Accessing haplotype variation at complex loci with optimized targeting and adaptive sampling (Grant agreement ID: 101132305) European Research Council
- The role of structural variation for the origin and evolution of a classic supergene (Grant number: 2019-04452_VR) Swedish Research Council
- Convergent evolution and loss of a classic floral polymorphism – genetic causes and genomic consequences (Grant number: 2023-04532_VR) Swedish Research Council
- Carl Tryggers foundation grant (CTS21:1471)
- Erik Philip-Sörensen foundation
- Nilsson-Ehle foundation
Page last updated: 16/03/2026
