作者
Akash Nag, Sunil Karforma
发表日期
2017/9/8
期刊
International Journal of Education and Management Engineering
卷号
7
期号
5
页码范围
1-6
出版商
MECS Press, Hongkong
简介
This paper introduces a simple and fast lossless compression algorithm, called CAD, for the compression of protein sequences. The proposed algorithm is specially suited for compressing proteomes, which are the collection of all proteins expressed by an organism. Maintaining a changing dictionary of actively used aminoacid residues, the algorithm uses the adaptive dictionary together with Huffman coding to achieve an average compression rate of 3.25 bits per symbol, better than most other existing protein-compression and generalpurpose compression algorithms known to us. With an average compression ratio of 2.46: 1 and an average compression rate of 1.32 M residues/sec, our algorithm outperforms every other compression algorithm for compressing protein sequences in terms of the balance in compression-time and compression rate.
引用总数
201720182019202020211111
学术搜索中的文章