• A System For Health Document Classification Using Machine Learning

  • REFRENCES -- [Total Page(s) 1]

    Page 1 of 1

    • REFERENCE
      Russell Power, Jay Chen, Trishank Karthik and Lakshminarayanan Subramanian (2018),“Document Classification for Focused Topics” https://cs.nyu.edu/~jchen/publications/aaai4d-power.pdf.
      Hull D., J. Pedersen, and H. Schutze (1996), “Document routing as statistical classification,” in AAAI Spring Symp. On Machine Learning in Information Access Technical Papers, Palo Alto.
      Fox C. (1992), “Lexical analysis and stoplist,” in Information Retrieval Data Structures and Algorithms, W. Frakes and R. Baeza-Yates, Eds. Prentice Hall, , pp. 102–130.
      Geisser S. (1992), Predictive Inference. NY: Chapman and Hall.
      Liu H. and Motoda (1998), Feature Extraction, construction and selection: A Data Mining Perspective. Boston, Massachusetts: Springer.
      Wang Y. and X. Wang (2005), “A new approach to feature selection in text classification,” in Proceedings of 4th International Conference on Machine Learning and Cybernetics, vol. 6, pp. 3814–3819.
      Montanes E. (2003), J. Ferandez, I. Diaz, E. F. Combarro, and J. Ranilla, “Measures of rule quality for feature selection in text categorization,” in 5th international Symposium on Intelligent data analysis. Germany: Springer-Verlag, , pp. 589–598.
      Aurangzeb K., B. Baharum, L. H. Lee, and K. Khairullah (2010), “A review of machine learning algorithms for text-documents classification,” Journal of Advances in Information Technology, vol. 1, no. 1.
      Wang Z.-Q., X. Sun, D.-X. Zhang, and X. Li (2006), “An optimal svm based text classification algorithm,” in Fifth International Conference on Machine Learning and Cybernetics, pp. 13–16.
      PAZIENZA, M. T., ed. 1997. Information Extraction. Lecture Notes in Computer Science, Vol. 1299. Springer, Heidelberg, Germany. RILOFF.
      E. 1995. Little words can make a big difference for text classification. In Proceedings of SIGIR-95, 18th ACM International Conference on Research and Development in Information Retrieval (Seattle, WA, 1995), 130–136.
      Leopold, Edda & Kindermann, Jörg (2002), "Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?", Machine Learning 46, pp. 423 - 444.
      Brank J., Grobelnik M., Milic-Frayling N., Mladenic D. (2002), "Interaction of Feature Selection Methods and Linear Classification Models", Proc. of the 19th International Conference on Machine Learning, Australia.
      Forman, G., An Experimental Study of Feature Selection Metrics for Text Categorization. Journal of Machine Learning Research, 3 2003, pp. 1289-1305.
      LI Y. H. AND A. K. JAIN (1998), Classification of Text Documents, THE COMPUTER JOURNAL, Vol. 41, No. 8.
      Goller C., J. Löning, T. Will, W. Wolff (2000), Automatic Document Classification: A thorough Evaluation of various Methods, International Symposiums for Information swissenschaft (ISI 2000), Darmstadt, 8. –
      10.    November 2000.
      Ankit Basarkar (2017), DOCUMENT CLASSIFICATION USING
      MACHINE LEARNING, San Jose State University SJSU ScholarWorks http://scholarworks.sjsu.edu/?utm_source=scholarworks.sjsu.edu Retrieved 5 March 2018.
      Kim S. B. , Rim H. C. , Yook D. S. and Lim H. S. (2002),"Effective Methods for Improving Naive Bayes Text Classifiers", LNAI 2417, 2002, pp. 414-423.
      Klopotek M. and Woch M. (2003), "Very Large Bayesian Networks in Text Classification", ICCS 2003, LNCS 2657, 2003, pp. 397-406
      Shanahan J. and Roma N. (2003), Improving SVM Text Classification Performance through Threshold Adjustment, LNAI 2837, 2003, 361-
  • REFRENCES -- [Total Page(s) 1]

    Page 1 of 1

    • ABSRACT - [ Total Page(s): 1 ]ABSTRACTDue to the massive increase in medical documents every day (including books, journals, blogs, articles, doctors' instructions and prescriptions, emails from patients, etc.), it is becoming very challenging to handle and to categorize them manually. One of the most challenging projects in information systems is extracting information from unstructured texts, including medical document classification. The discovery of knowledge from medical datasets is important in order to make effective ... Continue reading---

         

      APPENDIX A - [ Total Page(s): 2 ]APPENDIX A ... Continue reading---

         

      APPENDIX C - [ Total Page(s): 1 ]APPENDIX Cen-diseases.trainMalaria is a life-threatening mosquito-borne blood disease caused by a Plasmodium parasite Malaria was eliminated from the U.S. in the early 1950sMalaria is typically spread by mosquitoesMalaria symptoms can be classified into two categoriesMalaria happens when a bite from the female Anopheles mosquito infects the body with PlasmodiumMalaria is a mosquito-borne infectious disease affecting humans and other animals caused by parasitic protozoansMalaria is a mosquito-bor ... Continue reading---

         

      APPENDIX B - [ Total Page(s): 11 ]APPENDIX B ... Continue reading---

         

      CHAPTER ONE - [ Total Page(s): 2 ]CHAPTER ONE1.0    INTRODUCTIONThis chapter introduces the topic of the project work A System for Health Document Classification Using Machine Learning. In this chapter, we will consider the background of the study, statement of the problem, aims and objectives, methodology used to design the system, scope of the study, its significance, definition of terms, and we conclude with the project layout or organization of the project work.1.1    BACKGROUND OF THE STUDYContemporarily, most hospita ... Continue reading---

         

      CHAPTER TWO - [ Total Page(s): 3 ]CHAPTER TWOLITERATURE REVIEW2.0    DOCUMENT CLASSIFICATIONClassification can be divided in two principal phases. The first phase is document representation, and the second phase is classification. The standard document representation used in text classification is the vector space model. The difference of classification systems is in document representation models. The more relevant the representation is, the more relevant the classification will be. The second phase includes learning from tr ... Continue reading---

         

      CHAPTER THREE - [ Total Page(s): 3 ]3.4    SEQUENCE DIAGRAMSequence diagrams are simple subsets of interaction diagrams. They map out sequential events in an engineering or business process in order to streamline activities. Sequence diagrams are used to show how objects interact in a given situation. An important characteristic of a sequence diagram is that time passes from top to bottom: the interaction starts near the top of the diagram and ends at the bottom (i.e. Lower equals Later).3.5    CLASS DIAGRAMSWe begin our OOD ... Continue reading---

         

      CHAPTER FOUR - [ Total Page(s): 5 ]CHAPTER FOUR SYSTEM IMPLEMENTATION4.0    INTRODUCTIONAfter careful requirement gathering, analysis and design, the system is implemented. Implementation involves testing the system with required data and observing the results to see if the system has been properly deigned or if it contains bugs. This is usually done with data which has known results. In this chapter we will implement the system designed.4.1    SYSTEM REQUIREMENTSTo implement the application, the computer on which it will r ... Continue reading---

         

      CHAPTER FIVE - [ Total Page(s): 1 ]CHAPTER FIVE SUMMARY AND CONCLUSION5.0    INTRODUCTIONThis chapter summarizes and concludes the project work; it also gives recommendations and insight to future work.5.1    SUMMARYIn this project work we were able to succeed in applying Natural Language Processing which is a branch of Machine Learning to Classifying Health related documents. We made use of the OpenNLP Application Programming Interface which is a Java API for training a model and classifying the documents. We make use of M ... Continue reading---