A Dictionary based Informational Genome Analysis

Alberto Castellini, Giuditta Franco, and Vincenzo Manca

 
 

Homo sapiens chr. 1

  • Length: 247.000.000 bp

 

  • Multiplicity-Comultiplicity 6-distribution: "multiplicity is the number of occurrences of single 6-words" while "comultiplicity is the number of different 6-words having a given occurrence"



  • Cardinality trends of genomic k-dictionaries: all k-mers, k-hapaxes, and k-repeats