Transforming Unstructured Data into Conceptual Representation Using WORDNET

Authors(2) :-C. Bhargavi, Dr. A. Brahmananda Reddy

Transcript evacuation is an expanding current field with the aim of exercises toward accumulate essential in grouping as of typical words preparing term. It may be there uncertainly prominent in light of the fact that the way of investigative writings toward brings out in grouping with the expectation to be reasonable occurrence demanding purposes. For this situation, the mining portrayal fit for confine arrangements that distinguish the ideas of the decision or archive, which inclines toward see the topic of the report. In an empty employment, the idea based taking out portrayal be used only expected for common transcript accreditations grouping in amassing to bunched the transcript parts of the certifications in tally to competently finds vital similar ideas between qualifications, agreeing toward the semantics sentence. However the negative part of the activity be with the expectation of the open occupation can't subsist associated toward net qualifications bunching alongside the transcript classification planned for the accreditations be an undependable solitary. Idea Based illustration out portrayal utilized for appealing transcript Clustering.

Authors and Affiliations

C. Bhargavi
PG Scholar (M.TECH), Department of Computer Science and Engineering, VNR Vignan Jyothi Institute of Engineering and Technology, Hyderabad, Telangana, India
Dr. A. Brahmananda Reddy
Associate Professor, Department of Computer Science and Engineering, VNR Vignan Jyothi Institute of Engineering and Technology, Hyderabad, Telangana, India

Concept-based drawing out form, Concept-based similarity, Text clustering, Document clustering

  1. Shady Shehata, Fakhri Karray and Mohamed S. Kamel, "An Efficient Concept-Based Mining Model for Enhancing Text Clustering", IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No.10, pp. 1360 – 1371, October 2010.
  2. B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures and Algorithms. Prentice Hall, 1992.
  3. K. Aas and L. Eikvil, "Text Categorisation: A Survey," Technical Report 941, Norwegian Computing Center, June 1999.
  4. G. Salton, A. Wong, and C.S. Yang, "A Vector Space Model for Automatic Indexing," Comm. ACM, vol. 18, no. 11, pp. 112-117, 1975.
  5. G. Salton and M.J. McGill, Introduction to Modern Information Retrieval. McGraw-Hill, 1983.
  6. S. Pradhan, W. Ward, K. Hacioglu, J. Martin, and D. Jurafsky, "Shallow Semantic Parsing Using Support Vector Machines," Proc. Human Language Technology/North Am. Assoc. for Computational Linguistics (HLT/NAACL), 2004.
  7. C. Fillmore, "The Case for Case," Universals in Linguistic Theory, Holt, Rinehart and Winston, 1968.
  8. S.Y. Lu and K.S. Fu, "A Sentence-to-Sentence Clustering Procedure for Pattern Analysis," IEEE Trans. Systems, Man, and Cybernetics, vol. 8, no. 5, pp. 381-389, May 1978.
  9. T. Honkela, S. Kaski, K. Lagus, and T. Kohonen, "WEBSOM—Self-Organizing Maps of Document Collections," Proc. Workshop Self- Organizing Maps (WSOM ’97), 1997.
  10. D. Jurafsky and J.H. Martin, Speech and Language Processing. Prentice Hall, 2000.
  11. U.Y. Nahm and R.J. Mooney, "A Mutually Beneficial Integration of Data Mining and Information Extraction," Proc. 17th Nat’l Conf. Artificial Intelligence (AAAI ’00), pp. 627-632, 2000.
  12. L. Talavera and J. Bejar, "Generality-Based Conceptual Clustering with Probabilistic Concepts," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 2, pp. 196-206, Feb. 2001.
  13. H. Jin, M.-L. Wong, and K.S. Leung, "Scalable Model-Based Clustering for Large Databases Based on Data Summarization," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 11, pp. 1710-1719, Nov. 2005.
  14. T. Hofmann, "The Cluster-Abstraction Model: Unsupervised Learning of Topic Hierarchies from Text Data," Proc. 16th Int’l Joint Conf. Artificial Intelligence (IJCAI ’99), pp. 682-687, 1999.
  15. M. Steinbach, G. Karypis, and V. Kumar, "A Comparison of Document Clustering Techniques," Proc. Knowledge Discovery and Data Mining (KDD) Workshop Text Mining, Aug. 2000.
  16. K. Aas and L. Eikvil. Text categorisation: A survey. technical report 941. Technical report, Norwegian Computing Center, June 1999.
  17. M. Collins. Head-Driven Statistical Model for Natural Language Parsing. PhD thesis, University of Pennsylvania, 1999.
  18. R. Feldman and I. Dagan. Knowledge discovery in textual databases (kdt). In Proceedings of First International Conference on Knowledge Discovery and Data Mining, pages 112 - 117, 1995.
  19. S. Shehata, F. Karray, and M. Kamel. Enhancing text clustering using conceptbased mining model. In ICDM, pages 1043{1048, 2006.
  20. W. Francis and H. Kucera. Manual of information to accompany a standard corpus of present-day edited americanenglish, for use with digital computers, 1964.
  21. T. Joachims. Text categorization with support vector machines: learning with many relevant features. Proceedings of ECML-98, 10th European Conference on Machine Learning, number 1398, pages 137-142, Chemnitz, DE, 1998. Springer Verlag, Heidelberg, DE.
  22. S. Pradhan, W. Ward, K. Hacioglu, J. Martin, and D. Jurafsky. Shallow semantic parsing using support vector machines. In Proceedings of the Human Language Technology/North American Association for Computational Linguistics (HLT/NAACL),2004.

Publication Details

Published in : Volume 3 | Issue 1 | January-February 2018
Date of Publication : 2017-12-31
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 389-395
Manuscript Number : CSEIT1726309
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

C. Bhargavi, Dr. A. Brahmananda Reddy, "Transforming Unstructured Data into Conceptual Representation Using WORDNET", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.389-395, January-February-2018.
Journal URL :

Article Preview

Follow Us

Contact Us