Ranking Search Engine Result Pages based on Trustworthiness of Websites
The World Wide Web (WWW) is the repository of large number of web pages which can be accessed via Internet by multiple users at the same time and therefore it is Ubiquitous in nature. The search engine is a key application used to search the web pages from this huge repository, which uses the link analysis for ranking the web pages without considering the facts provided by them. A new algorithm called Probability of Correctness of Facts(PCF)-Engine is proposed to find the accuracy of the facts provided by the web pages. It uses the Probability based similarity function (SIM) which performs the string matching between the true facts and the facts of web pages to find their probability of correctness. The existing semantic search engines, may give the relevant result to the user query but may not be 100% accurate. Our algorithm computes trustworthiness of websites to rank the web pages. Simulation results show that our approach is efficient when compared with existing Voting and Truthfinder[1] algorithms with respect to the trustworthiness of the websites.
Keywords: Data Quality, Page Rank, Search Engine, Trustworthiness, Web Content Mining, Web Mining
Download Full-Text
ABOUT THE AUTHORS
Srikantaiah K C
Srikantaiah K C is an Associate Professor in the Department of Computer Science and Engineering at S J B Institute of Technology, Bangalore, India. He obtained his B.E and M.E degrees in Computer Science and Engineering from Bangalore University, Bangalore. He is presently pursuing his Ph.D programme in the area of Web mining in Bangalore University. His research interest is in the area of Data mining, Web mining and Semantic Web.
Srikanth P L
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Tejaswi V
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Shaila K
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Venugopal K R
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
L M Patnaik
Honorary Professor, Indian Institute of Science, Bangalore, India
Srikantaiah K C
Srikantaiah K C is an Associate Professor in the Department of Computer Science and Engineering at S J B Institute of Technology, Bangalore, India. He obtained his B.E and M.E degrees in Computer Science and Engineering from Bangalore University, Bangalore. He is presently pursuing his Ph.D programme in the area of Web mining in Bangalore University. His research interest is in the area of Data mining, Web mining and Semantic Web.
Srikanth P L
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Tejaswi V
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Shaila K
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
Venugopal K R
Department of Computer Science and Engineering University Visvesvaraya College of Engineering, Bangalore University, Bangalore-560 001, India
L M Patnaik
Honorary Professor, Indian Institute of Science, Bangalore, India