Sunday 21st of January 2018

Document Segmentation And Region Classification Using Multilayer Perceptron

Priyadharshini N and Vijaya Ms

A document comprises lot of knowledge and documents are considered as the common mode of sharing information to others. Pursuance of information from documents involves lot of human effort, time consuming and can severely restrict the usage of information systems. Thus automatic information pursuance from the document has become a significant issue. It has been shown that document segmentation can help to overcome such issues. Document segmentation is a process of splitting the document into distinct regions. This paper proposes a new approach to segment and classify the document regions as text, image, graphics and table. Document image is segmented into blocks using Run length smearing algorithm and features are extracted from each blocks. Multilayer perceptron, a supervised learning technique has been used to construct the classifier and found 97.49% classification accuracy.

Keywords: Document analysis, Information retrieval, Classification, Feature extraction, Document segmentation

Download Full-Text


Priyadharshini N
she is pursing Master of Philosophy in Computer Science in PSGR Krishnammal college for women under the guidance of MS.Vijaya. Her research interests are data mining, image processing, and pattern recognition.

Vijaya Ms
she is presently working as Associate Professor in GR Govindarajulu School Of Applied Computer Technology, PSGR Krishnammal college for women, Coimbatore, India. She has 22 years of teaching experience and 8 years of research experience. She has completed her doctoral programme in the area of Natural Language Processing. Her areas of interest include Data Mining, Support Vector Machine, Machine learning, Pattern Recognition, Natural Language Processing and Optimization Techniques. She has presented 22 papers in National conferences and she has to her credit 17 publications in International conference proceedings and Journals. She is a member of Computer Society of India, International Association of Engineers (Hong Kong), International Association of Computer Science and Information Technology (IACSIT Singapore). She is also a reviewer of International Journal of Computer Science and Information Security.

IJCSI Published Papers Indexed By:





IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482

More contact details »