Bit DNA Squeezer (BDNAS) : A Unique Technique for Dna Compression

Authors

  • Alam Jahaan  Computer Science, PERIYAR EVR College, Trichy, Tamil Nadu, India
  • Dr. T. N. Ravi  Assistant Professor in Computer Science, PERIYAR EVR College, Trichy, Tamil Nadu, India
  • Dr. S. Panneer Arokiaraj  Associate Professor, Computer Science, PERIYAR EVR College, Trichy, Tamil Nadu, India

Keywords:

DNA Sequences, Bit-Based Model, Position Map, Compression, Decompression, Datasets

Abstract

Data compression plays a vital role in analyzing, decreasing and transferring DNA sequences, hence escalating the creation of DNA compression techniques in order to store and transfer tremendous amount of genomic data. A fresh flow of interest in the development of novel algorithms and tools for storing and managing genomic sequences highlights the increasing demand for efficient methods for DNA compression. This is basically, the motivating force behind the development of high-performance compression tools designed specifically for genomic data. Most of the earlier DNA compression methods were essentially dictionary-based methods or statistical methods. Recently 2-bit coding methods have become prominent where the 4 nucleotide bases {A, C, G, T} in DNA sequences are assigned values 00, 01, 10 and 11 respectively. In this paper an attempt has been made to present an approach where a single bit (0 or 1) is assigned for each nucleotide base {A,C,G,T} in a DNA sequence depending on the count of each nucleotide. This proposed technique compresses large bytes of DNA sequences with the average compression ratio of approximately 1.4 bits per base.

References

  1. Alam Jahaan, Dr T.N. Ravi, Dr. S. Panneer Arokiaraj, A Comparative Study and Survey on Existing DNA Compression Techniques, IJARCS, p-ISSN: 0976-5697, volume8,No.3, March-April2017. Online:www.ijarcs.info.
  2. Satyanvesh, D., Balleda, K., Padyana, A., et al., 2012, GenCodex - A Novel Algorithm for Compressing DNA sequences on Multi-cores and GPUs, Proc. IEEE, 19th International Conf. on High Performance Computing (HiPC), Pune, India, No 37
  3. Nour S. Bakr et al.: "DNA Lossless Compression Algorithms: Review", American Journal of Bioinformatics Research, p-ISSN: 2167-6992    e-ISSN: 2167-6976, 2013;  3(3): 72-81, doi:10.5923/j.bioinformatics.20130303.04
  4. Rajeswari, P. R., and Apparao, A., 2010, Genbit Compress Tool (GBC): A Java-Based Tool To Compress DNA Sequences and Compute Compression Ratio (BITS/BASE) Of Genomes, International Journal of Computer Science and Information Technology, 2(3), 181-191
  5. Afify, H., Islam, M., Abdel-Wahed, M., et al., 2010, Genomic Sequences Differential Compression Model, Proc., 27th National Radio Science Conf., Egypt
  6. Rajeswari, P. R., and Apparao, A., 2011, DNABIT Compress - Genome compression algorithm, Bioinformation, 5(8), 350-360
  7. Prasad, V. H., and Kumar, P. V., 2012, A New Revised DNA Cramp Tool Based Approach of Chopping DNA Repetitive and Non-Repetitive Genome Sequences, International Journal of Computer Science Issues (IJCSI), 9(6), 448-454.
  8. Prasad, V. H., 2013, A new revisited compression technique through innovation partition group binary compression: a novel approach, International Journal of Computer Engineering & Technology (IJCET), 4(2), 94-101.
  9. Alam Jahaan ,Dr T.N. Ravi, "Scrutiny Of Lossless Compression Techniques Using A Few Quality Measures", International Journal Of Advanced Research In Computer Science And Applications Issn 2321- 872x, Volume 4, Issue 3, March 2016.
  10. S. R. Kodituwakku Et. Al. "Comparison Of Lossless Data Compression Algorithms For Text Data", Indian Journal Of Computer Science And Engineering, Vol 1 No 4 416-425
  11. S. Grumbach and F. Tahi, "Compression of DNA Sequences," in Proc. of the Data Compression Conf., (DCC '93), 1993, 340-350.

Downloads

Published

2017-08-31

Issue

Section

Research Articles

How to Cite

[1]
Alam Jahaan, Dr. T. N. Ravi, Dr. S. Panneer Arokiaraj, " Bit DNA Squeezer (BDNAS) : A Unique Technique for Dna Compression , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 4, pp.512-517, July-August-2017.