Tuesday 23rd of April 2024
 

The effect of N-gram indexing on Arabic documents retrieval


Emad Fawzi Al-Shalabi

This article presents a comparison between 3-gram and 4-gram term indexing in Arabic document retrieval. The calculation of similarity between query and documents is performed using single term and two term query, based on corpora of Arabic language documents collected from Arabic news websites available online.

Keywords: n-gram, Arabic text indexing, information retrieval, text similarity.

Download Full-Text


ABOUT THE AUTHOR

Emad Fawzi Al-Shalabi
Department of Information Technology, AL-BALQA Applied University, Al-Huson University College, Irbid, Al-Huson, 50, Jordan


IJCSI Published Papers Indexed By:

 

 

 

 
+++
About IJCSI

IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us
FAQs

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482
Email: info@ijcsi.org

More contact details »