Bit DNA Squeezer (BDNAS) : A Unique Technique for Dna Compression
Keywords:
DNA Sequences, Bit-Based Model, Position Map, Compression, Decompression, DatasetsAbstract
Data compression plays a vital role in analyzing, decreasing and transferring DNA sequences, hence escalating the creation of DNA compression techniques in order to store and transfer tremendous amount of genomic data. A fresh flow of interest in the development of novel algorithms and tools for storing and managing genomic sequences highlights the increasing demand for efficient methods for DNA compression. This is basically, the motivating force behind the development of high-performance compression tools designed specifically for genomic data. Most of the earlier DNA compression methods were essentially dictionary-based methods or statistical methods. Recently 2-bit coding methods have become prominent where the 4 nucleotide bases {A, C, G, T} in DNA sequences are assigned values 00, 01, 10 and 11 respectively. In this paper an attempt has been made to present an approach where a single bit (0 or 1) is assigned for each nucleotide base {A,C,G,T} in a DNA sequence depending on the count of each nucleotide. This proposed technique compresses large bytes of DNA sequences with the average compression ratio of approximately 1.4 bits per base.
References
- Alam Jahaan, Dr T.N. Ravi, Dr. S. Panneer Arokiaraj, A Comparative Study and Survey on Existing DNA Compression Techniques, IJARCS, p-ISSN: 0976-5697, volume8,No.3, March-April2017. Online:www.ijarcs.info.
- Satyanvesh, D., Balleda, K., Padyana, A., et al., 2012, GenCodex - A Novel Algorithm for Compressing DNA sequences on Multi-cores and GPUs, Proc. IEEE, 19th International Conf. on High Performance Computing (HiPC), Pune, India, No 37
- Nour S. Bakr et al.: "DNA Lossless Compression Algorithms: Review", American Journal of Bioinformatics Research, p-ISSN: 2167-6992 e-ISSN: 2167-6976, 2013; 3(3): 72-81, doi:10.5923/j.bioinformatics.20130303.04
- Rajeswari, P. R., and Apparao, A., 2010, Genbit Compress Tool (GBC): A Java-Based Tool To Compress DNA Sequences and Compute Compression Ratio (BITS/BASE) Of Genomes, International Journal of Computer Science and Information Technology, 2(3), 181-191
- Afify, H., Islam, M., Abdel-Wahed, M., et al., 2010, Genomic Sequences Differential Compression Model, Proc., 27th National Radio Science Conf., Egypt
- Rajeswari, P. R., and Apparao, A., 2011, DNABIT Compress - Genome compression algorithm, Bioinformation, 5(8), 350-360
- Prasad, V. H., and Kumar, P. V., 2012, A New Revised DNA Cramp Tool Based Approach of Chopping DNA Repetitive and Non-Repetitive Genome Sequences, International Journal of Computer Science Issues (IJCSI), 9(6), 448-454.
- Prasad, V. H., 2013, A new revisited compression technique through innovation partition group binary compression: a novel approach, International Journal of Computer Engineering & Technology (IJCET), 4(2), 94-101.
- Alam Jahaan ,Dr T.N. Ravi, "Scrutiny Of Lossless Compression Techniques Using A Few Quality Measures", International Journal Of Advanced Research In Computer Science And Applications Issn 2321- 872x, Volume 4, Issue 3, March 2016.
- S. R. Kodituwakku Et. Al. "Comparison Of Lossless Data Compression Algorithms For Text Data", Indian Journal Of Computer Science And Engineering, Vol 1 No 4 416-425
- S. Grumbach and F. Tahi, "Compression of DNA Sequences," in Proc. of the Data Compression Conf., (DCC '93), 1993, 340-350.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.