A Novel Semi Supervised Algorithm for Text Classification Using BPNN by Active Search
Demand of Text Classification is increasing with the evolution of huge amount of text data available in internet, news, institutes , To make an effective text classifier we need large amount of labeled data in the form of training samples, to get labeled data is not only expensive but also time consuming, tedious task, whereas unlabelled data is easily available inexpensive. This paper proposes an algorithm that just makes use of some root words from expert followed by active search. Our algorithm also makes use of a very effective Term weighting method based on relevance factor that is used for feature representation, this text is train by BPNN. The proposed algorithm is compared on test data and on standard data 20 Newsgroup and mini Newsgroup on the basis of micro-average and macro-averaged F1 measure The Experimental results depicts the best micro averaged F1 measure of 0.95 at 2400 epochs for test data, 0.67 for 20 news group and is 0.95 for Mini Newsgroup which are comparable with the well known supervised Text classification.
Keywords: Semi Supervised, text classification, Active search, term weighting method, Neural network
Download Full-Text
ABOUT THE AUTHORS
Mahak Motwani
Mahak Motwani received B.E Degree in Computer science & Engineering from Ravi Shankar University, M.Tech in Computer science & engineering from RGPV, Bhopal. She is currently Pursuing PhD in the field of Data Mining from RGPV, Bhopal. She has been working from 2008 to 2013 as Assistant Professor in Computer Science Department of Truba institute of Engineering & Information Technology, Bhopal. Currently she is working as Assistant Professor in Computer science department of Truba College of Science and Technology, Bhopal, India
Aruna Tiwari
Aruna Tiwari received her B.E..and M.E. degree in computer science from SGSITS, Indore . PhD degree in Computer Science from RGPV, Bhopal. She worked as Lecturer in Shri Vaishanav Inst. Of Tech. & Sc., Indore from 1997 to 2001, she was working with SGSITS, Indore from 2001 to 2012 as Associate Professor, Currently She is working in Computer science department of Indian Institute of Technology, Indore, India.
Mahak Motwani
Mahak Motwani received B.E Degree in Computer science & Engineering from Ravi Shankar University, M.Tech in Computer science & engineering from RGPV, Bhopal. She is currently Pursuing PhD in the field of Data Mining from RGPV, Bhopal. She has been working from 2008 to 2013 as Assistant Professor in Computer Science Department of Truba institute of Engineering & Information Technology, Bhopal. Currently she is working as Assistant Professor in Computer science department of Truba College of Science and Technology, Bhopal, India
Aruna Tiwari
Aruna Tiwari received her B.E..and M.E. degree in computer science from SGSITS, Indore . PhD degree in Computer Science from RGPV, Bhopal. She worked as Lecturer in Shri Vaishanav Inst. Of Tech. & Sc., Indore from 1997 to 2001, she was working with SGSITS, Indore from 2001 to 2012 as Associate Professor, Currently She is working in Computer science department of Indian Institute of Technology, Indore, India.