Cross-language Plagiarism Detection (CLPD) is used to automatically identify and extract plagiarism among documents in different languages. The main challenge of cross-language plagiarism detection is the difference of text languages, where the original source can be analyzed and translated. This book proposes an Arabic-English cross-language plagiarism detection method by automatically detect the semantic relatedness between the words of two suspect targeted files. The proposed method consists of six phases: The first phase is a pre-processing phase, The second involves keyphrase extraction and translation, The third phase retrieves the candidate document that match with the key phrase of the proposed plagiarism text. The fourth phase is a similarity measurement between the key phrases by measuring the similarity between the original text and plagiarism text, The fifth phase is the classification process using Linear Logistic Regression (LLR) approach and the last phase is an evaluation phase using Precision, Recall and F-measure on dataset consisting of Wikipedia articles. The experimental implementation was down with C# language and achieved excellent results.

Book Details:

ISBN-13:

978-3-330-84467-4

ISBN-10:

3330844671

EAN:

9783330844674

Book language:

عربي

By (author) :

Mohammed Hasan Abdulameer Almayali
Zaid Alaa
Sabrina Tiun

Number of pages:

92

Published on:

2017-01-23

Category:

Internet