A Dictionary based Informational Genome Analysis

Alberto Castellini, Giuditta Franco, and Vincenzo Manca


Homo sapiens chr. 1

  • Length: 247.000.000 bp


  • Multiplicity-Comultiplicity 6-distribution: "multiplicity is the number of occurrences of single 6-words" while "comultiplicity is the number of different 6-words having a given occurrence"

  • Cardinality trends of genomic k-dictionaries: all k-mers, k-hapaxes, and k-repeats