Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities

Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities

Keywords:plagiarism detection, text mining, ARPaD, LERP RSA
Abstract:Text Mining for Plagiarism DetectionMultivariate Pattern Detection for Recognition of Text Similarities The problem of plagiarism the recent years has been intensified by the availability of information in digital form and the accessibility of the electronic libraries through the InternetAs a resultplagiarism detection has been transformed into a big da Retrieval and IndexingThis technology can help search and compare documents in a large database to detect plagiarismprimarily involving the retrieval and indexing aspects This article mainly discusses the intensifying problem of plagiarism in the digital age due to the availability of digital information and the ease of accessing electronic libraries via the internetTo address this problemplagiarism detection has transformed into a big data analysis problem because the number of digital sources is vastand new documents need to be compared with millions of existing documentsThe article proposes a text mining method that can detect all common patterns between documents and reference databasesThis technique is based on pattern detection algorithms and corresponding data structuresenabling the algorithm to detect all common patterns