Medical Papers Cluster Supported Medication-Symptom Names Using Multi Read Nonnegative Matrix Factorization

T. Gowthami

doi:10.32628/CSEIT1833410

Authors

T. Gowthami MCA Sri Padmathi College of Computer Sciences and Technology Tiruchanoor, Andhra Pradesh, India

Keywords:

Nonnegative Matrix Factorization, multi-view NMF, medication symptom, clinical documents

Abstract

Clinical documents are rich free-text knowledge sources containing valuable medication and symptom data, that have a great potential to enhance health care. Existing system, a brand new convolutional neural network primarily based multimodal disease risk prediction algorithmic rule mistreatment structured and unstructured knowledge from hospital. To the simplest of our information, none of the existing work focused on each knowledge types within the space of medical big knowledge analytics. Compared to many typical prediction algorithms, the prediction accuracy of our proposed algorithm reaches with a convergence speed. Proposed system, we tend to build an integrating system for extracting medication names and symptom names from clinical notes. Then we apply nonnegative Matrix factorization (NMF) and multi-view NMF to cluster clinical notes into purposeful clusters supported sample-feature matrices. Our experimental results show that multi-view NMF could be a preferred methodology for clinical document cluster. Moreover, we discover that using extracted medication symptom names to cluster clinical documents outperforms simply using words.

References

J.M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
C. Li, J. Han, G. He, X. Jin, Y. Sun, Y. Yu, and T. Wu. Fast computation of simrank for static and dynamic information networks. In EDBT, pages 465-476, 2010.
D. Lizorkin, P. Velikhov, M.N. Grinev, and D. Turdakov. Accuracy estimate and optimization techniques for simrank computation. PVLDB, 1(1):422-433, 2008.
P.A. McKee, W.P. Castelli, P.M. McNamara, and W.B. Kannel. The natural history of congestive heart failure: The framingham study. N Engl J Med., 285:1441-1446, 1971.
S. Meystre, G. Savova, K.K. Schuler, and J. Hurdle. Extracting information from textual documents in the electronic health record: A review of recent research. IMIA Yearbook of Medical Informatics Methods Inf Med 2008, 2008. 47 Suppl 1:128-44.
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
A. Pathak, S. Chakrabarti, and M.S. Gupta. Index design for dynamic personalized pagerank. In ICDE, pages 1489-1491, 2008.
Y. Sun, Y. Yu, and J. Han. Ranking-based clustering of heterogeneous information networks with star network schema. In John F. Elder IV, Françoise Fogelman-Soulié, Peter A. Flach, and Mohammed Javeed Zaki, editors, KDD, pages 797-806. ACM, 2009
H. Tong, C. Faloutsos, and J.Y. Pan. Random walk with restart: fast solutions and applications. Knowl. Inf. Syst., 14:327-346, March 2008.
H. Tong, S. Papadimitriou, P.S. Yu, and C. Faloutsos. Proximity tracking on time-evolving bipartite graphs. In SDM, pages 704-715, 2008.
Y. Wang. Annotating and recognising named entities in clinical notes. In ACL/AFNLP (Student Research Workshop), pages 18-26. The Association for Computer Linguistics, 2009.
H.Xu, S.P. Stenner, S. Doan, K.B. Johnson, L.R. Waitman, and J.C. Denny. Medex: a medication information extraction system for clinical narratives. Journal of American Medical Informatics Association, 17(1):19-24, Jan-Feb 2010.
Aronson, A.R., Metamap: Mapping text to the umls metathesaurus. Bethesda, MD: NLM, NIH, DHHS, 2006.
Sondhi, P., et al. SympGraph: a framework for mining clinical notes through symptom relation graphs. in Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. 2012. ACM.
Williamson, D.P., The primal-dual method for approximation algorithms. Mathematical Programming, 2002. 91(3): p. 447-478.
Mitchell, J.E., Branch-and-cut algorithms for combinatorial optimization problems. Handbook of Applied Optimization, 2002: p. 65-77.
Makhorin, A., GNU linear programming kit. Moscow Aviation Institute, Moscow, Russia, 2001. 38.
RinnooyKan, A. and J. Telgen, The complexity of linear programming. Statistica Neerlandica, 1981. 35(2): p. 91-107.
Uzuner, O., I. Solti, and E. Cadag, Extracting medication information from clinical text. Journal of the American Medical Informatics Association, 2010. 17(5): p. 514-518.
Davis, J. and M. Goadrich. The relationship between Precision-Recall and ROC curves. in Proceedings of the 23rd international conference on Machine learning. 2006. ACM.
Kim, M.-Y., et al., Patient Information Extraction in Noisy Tele-health Texts, in In the IEEE International Conference on Bioinformatics and Biomedicine (BIBM13)2013: Shanghai, China.

Medical Papers Cluster Supported Medication-Symptom Names Using Multi Read Nonnegative Matrix Factorization

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite