A Comprehensive Study on Application and Future Trends in Data Science

Authors

  • Manohar Vemula  MCA,MCPD,CSM, Software Engineer, Hyderabad, Telangana, India
  • Shaik Balasaidulu  MCA,Osmania University, Hyderabad, Telangana, India

Keywords:

Information, Data, Unstructured Data, Visualization, Management, Preservation

Abstract

Data Science refers to an emerging area of work concerned with the collection, preparation, analysis, visualization, management, and preservation of large collections of information. Although the name Data Science seems to connect most strongly with areas such as databases and computer science, many different kinds of skills including non-mathematical skills are also needed here. Data Science is much more than simply analyzing data. There are many people who enjoy analyzing data who could happily spend all day looking at histograms and averages, but for those who prefer other activities, data science offers a range of roles and requires a range of skills. Data science includes data analysis as an important component of the skill set required for many jobs in the area, but is not the only skill. Data scientists play active roles in the design and implementation work of four related areas such as data architecture, data acquisition, data analysis and data archiving. In the present paper the authors will try to explore the different issues, implementation and challenges in area called Data science.

References

  1. Dhar, V. (2013). "Data science and prediction". Communications of the ACM 56.
  2. Jeff Leek (2013-12-12). "The key word in 'Data Science' is not Data, it is Science". Simply Statistics.
  3. Hal Varian on how the Web challenges managers. http://www.mckinsey.com/insights/innovation/hal_varian_on_how_the_web_challenges_managers
  4. Parsons, MA, MJ Brodzik, and NJ Rutter. 2004. Data management for the cold land processes experiment: improving hydrological science.HYDROL PROCESS. 18:3637-653. http://www3.interscience.wiley.com/cgi-bin/jissue/109856902
  5. Data Munging with Perl. DAVID CROSS. MANNING. Chapter 1 Page 4.
  6. What is Data Science? http://www.datascientists.net/what-is-data-science
  7. The Data Science Venn Diagram. http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
  8. Tukey, John W. The Future of Data Analysis. Ann. Math. Statist. 33 (1962), no.1,1--67. doi:10.1214/aoms/1177704711. http://projecteuclid.org/euclid.aoms/1177704711.
  9. Tukey, John W. (1977). Exploratory Data Analysis. Pearson. ISBN 978-0201076165.
  10. Peter Naur: Concise Survey of Computer Methods, 397 p. Studentlitteratur, Lund, Sweden, ISBN 91-44-07881-1, 1974
  11. KDD-89: IJCAI-89 Workshop on Knowledge Discovery in Databases. August 20, 1989, Detroit MI, USA
  12. Database Marketing Business Week. September 04, 1994
  13. From Data Mining to Knowledge Discovery in Databases. Usama Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth. AI Magazine Volume 17 Number 3 (1996)
  14. "Statistics=Data Science?" C.F.Jeff Wu. University of Michigan, Ann Arbor. http://www2.isye.gatech.edu/~jeffwu/presentations/datascience.pdf
  15. Data Mining and Knowledge Discovery. ISSN: 1384-5810 (print version). ISSN: 1573-756X (electronic version). Journal no. 10618
  16. Mining Data for Nuggets of Knowledge Dec 10, 1999 Mining Data for Nuggets of Knowledge. http://knowledge.wharton.upenn.edu/article/mining-data-for-nuggets-of-knowledge/
  17. Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics William S. Cleveland Statistics Research, Bell Labs. http://www.stat.purdue.edu/~wsc/papers/datascience.pdf
  18. Statistical Modelling: the two cultures Leo Breiman. Statistical Science. Vol. 16 No.3 (August 2001) 199-215.
  19. Davenport, Thomas H. (January 1, 2006). "Competing on Analytics". Harvard Business Review
  20. Glossary of Terms. Machine Learning - Special issue on applications of machine learning and the knowledge discovery process archive. Volume 30 Issue 2-3, Feb/March, 1998. Pages 271-274
  21. Bolton, R. & Hand, D. (2002). Statistical Fraud Detection: A Review (With Discussion). Statistical Science 17(3): 235–255.
  22. Neural data mining for credit card fraud detection. Brause, R.; Langsdorf, T.; Hepp, M. Tools with Artificial Intelligence, 1999. Proceedings. 11th IEEE International Conference on. Publication Year: 1999, Pages: 103-106
  23. ReinhardKlette (2014). Concise Computer Vision. Springer. ISBN 978-1-4471-6320-6.
  24. Hutchins, W. John; Somers, Harold L. (1992). An Introduction to Machine Translation. London: Academic Press. ISBN 0-12-362830-X.
  25. Akshi Kumar, Teeja Mary Sebastian. Sentiment Analysis on Twitter. IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 4, No 3, July 2012. ISSN (Online): 1694-0814
  26. Raul Isea. The Present-Day Meaning of the Word Bioinformatics, Global Journal of Advanced Research, 2015. Vol-2, Issue-1 PP. 70-73. ISSN: 2394-5788.
  27. Brumfiel, Geoff (19 January 2011). "High-energy physics: Down the petabyte highway". Nature 469. pp. 282–83. doi:10.1038/469282a
  28. Matthew Francis. Future telescope array drives development of exabyte processing. http://arstechnica.com/science/2012/04/future-telescope-array-drives-development-of-exabyte-processing/
  29. "Supercomputing the Climate: NASA's Big Data Mission". CSC World. Computer Sciences Corporation. http://www.csc.com/cscworld/publications/81769/81773-supercomputing_the_climate_nasa_s_big_data_mission
  30. Hype Cycle for Emerging Technologies, 2013.

Downloads

Published

2018-10-30

Issue

Section

Research Articles

How to Cite

[1]
Manohar Vemula, Shaik Balasaidulu, " A Comprehensive Study on Application and Future Trends in Data Science, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 7, pp.239-246, September-October-2018.