Saturday 20th of April 2024
 

Validation of Architecture of Migrating Parallel Web Crawler using Finite State Machine


Md Faizan Farooqui, Md Rizwan Beg and Md Qasim Rafiq

The process of downloading web pages is known as web crawling. In this paper we validate the architecture of Migrating parallel web crawler using finite state machine. The method for Migrating Parallel Web Crawling approach will detect changes in the content and structure. Also Domain specific crawling will yield high quality pages. The crawling process will migrate to host or server with specific domain and start downloading pages within specific domain. Incremental crawling will keep the pages in local database fresh thus increasing the quality of download-ed pages. The crawling strategy makes web crawling system more effective and efficient. Test cases are generated for the validation of the architecture. The approach for generating the test cases through FSM is very reliable and efficient and does not support for the invalid test cases. Valid input strings are generated as test cases.

Keywords: Web crawling, parallel migrating web crawler, search engine, validation

Download Full-Text


ABOUT THE AUTHORS

Md Faizan Farooqui
Department of Computer Application Integral University Lucknow

Md Rizwan Beg
Department of Computer Science and Engineering Integral University Lucknow

Md Qasim Rafiq
Department of Computer Engineering Aligarh Muslim University Aligarh


IJCSI Published Papers Indexed By:

 

 

 

 
+++
About IJCSI

IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us
FAQs

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482
Email: info@ijcsi.org

More contact details »