Banner background

Genome assembly

Species

Linum perenne L.

Blue flax

Description

  • Assembly Name: Linum_perenne_primary_assembly_n_2025_03
  • Assembly Type: haploid
  • Assembly Level: contig
  • Genome Representation: full
  • Accession: GCA_965231395.1

Assembly Statistics

StatisticValue
Assembly length (Mbp)773.53
GC %41.45
Contig #807
Contig N50 (Mbp)23.40
Contig L5010
Contig N90 (Mbp)4.61
Contig L9034
Scaffold #807
Scaffold N50 (Mbp)23.40
Scaffold L5010
Scaffold N90 (Mbp)4.61
Scaffold L9034
Scaffolds >= 10 kb807
BUSCO % (eudicots_odb10)C:91.6% [S:86.2%, D:5.4%], F:1.2%, M:7.2%, n:2326

Annotation Statistics

StatisticValue
Gene #45373
Transcript #139586
Avg exons per transcript5.3
Avg gene length (bp)2803
Avg transcript length (bp)3074
Avg exon length (bp)279
Avg intron length (bp)352

Notes: Assembly statistics were calculated for the primary genome assembly GCA_965231395.1 (CBDDTU01.fasta.gz) using Quast (v5.2.0; Mikheenko et al. 2018). Busco statistics (Manni et al., 2021) were provided by the authors. Annotation statistics were calculated for GCA_965231395.1_Linum_perenne_primary_assembly_n_2025_03_genomic.gff.gz using AGAT (v1.4.1; Dainat, 2024).

BUSCO notation: C: Complete; S: Single-copy; D: Duplicated; F: Fragmented; M: Missing; n: Total BUSCO genes included in the dataset (here: eudicots_odb10). See also the official BUSCO manual.

BUSCO statistics for the haplotype assemblies:

  • Linum perenne hap1 assembly: C:91.2% [S:86.3%, D:4.9%], F:1.4%, M:7.4%, n:2326

  • Linum perenne hap2 assembly: C:84.8% [S:80.7%, D:4.1%], F:5.2%, M:10.0%, n:2326

Publication(s)

The data for Linum perenne displayed in the genome portal comes from:

Zervakis, P.-I., Postel, Z., Losvik, A., Fracassetti, M., Solér, L., Proux-Wéra, E., Bunikis, I., Churcher, A., & Slotte, T. (2025). Genomic studies in Linum shed light on the evolution of the distyly supergene and the molecular basis of convergent floral evolution. New Phytologist, 247(6), 2964–2981. https://doi.org/10.1111/nph.70392

The tools used to calculate the statistics shown on top of this page are described in:

  • Dainat J. (2024). AGAT: Another Gff Analysis Toolkit to handle annotations in any GTF/GFF format. (Version v1.4.1). Zenodo. https://www.doi.org/10.5281/zenodo.3552717

  • Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A., & Zdobnov, E. M. (2021). BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Molecular Biology and Evolution, 38(10), 4647–4654. https://doi.org/10.1093/molbev/msab199

  • Mikheenko, A., Prjibelski, A., Saveliev, V., Antipov, D., & Gurevich, A. (2018). Versatile genome assembly evaluation with QUAST-LG. Bioinformatics, 34(13), i142–i150. https://doi.org/10.1093/bioinformatics/bty266

Funding

The study in which the genome data was generated (Zervakis et al., 2025) acknowledges funding from:

  • Supergene evolution in a classic plant system - bringing the study of distyly into the genomic era (Grant agreement ID: 757451) European Research Council
  • AdaptiveTarget: Accessing haplotype variation at complex loci with optimized targeting and adaptive sampling (Grant agreement ID: 101132305) European Research Council
  • The role of structural variation for the origin and evolution of a classic supergene (Grant number: 2019-04452_VR) Swedish Research Council
  • Convergent evolution and loss of a classic floral polymorphism – genetic causes and genomic consequences (Grant number: 2023-04532_VR) Swedish Research Council
  • Carl Tryggers foundation grant (CTS21:1471)
  • Erik Philip-Sörensen foundation
  • Nilsson-Ehle foundation

Page last updated: 16/03/2026