Questions tagged [plagiarism-detection]

74 questions
3
votes
2 answers

Designing a noise filter for Plagiarism Detection Engine in Ruby

I have been working on an Implementation of a Plagiarism Detection Engine based on the academic paper behind MOSS(Measure of Software Similarity) Link to MOSS For designing a noise filter for a language like C/C++/Java, I have some decisions to…
Nitish Upreti
  • 6,312
  • 9
  • 50
  • 92
3
votes
2 answers

How can I mirror the results of MOSS plagiarism detection?

MOSS is a well-known server for checking software plagiarism. It allows teachers to send homework submissions, calculates the similarity between different submissions, and colors code blocks that are very similar. Here is an example of the results…
Erel Segal-Halevi
  • 33,955
  • 36
  • 114
  • 183
3
votes
0 answers

Block specific domain (qaru.site) from google search results

recently I started seeing results from qaru.site in the first 10 search results in google. This is kinda annoying as I hate plagiarism and it's just machine translated questions from StackOverflow, e.g. java jaxb simple parsing is requiring…
skwisgaar
  • 880
  • 2
  • 13
  • 31
3
votes
1 answer

Fast String Searching Algorithm for large strings

I'm trying to implement a plagiarism detection software using pattern matching algorithms. I came across the KMP Algorithm Here and tried out the c# implementation. I can see that it's not as fast for actual documents (not strings, I uploaded two…
3
votes
1 answer

Plagiarism with Jplag for different languages

I have a bunch of codes from students for a coding question. I am using Jplag to find the similarities between their codes. java -jar jplag-yourVersion.jar -l java17 -r /tmp/jplag_results_exerise1/ -s /path/to/exercise1 This kind of syntax works…
Siva Praveen
  • 2,213
  • 17
  • 20
3
votes
1 answer

How to use Google API for Plagiarism detection in php

I am building a small plagiarism detecting system in php for practice. Well I did some research on Google figured that I may use Google API (custom search API) to build a plagiarism detecting software. Well I found this question very helpful [How…
Bikal Basnet
  • 1,639
  • 1
  • 21
  • 31
3
votes
1 answer

Find similar source code on the Internet

I suspect that one of my students has downloaded his solution from the Internet as it is a very common basic task. The assignment was to write merging of sorted sequences into a single sorted sequence in C++. I would like to find the source of the…
Notinlist
  • 16,144
  • 10
  • 57
  • 99
2
votes
1 answer

Rabin Karp Implementation too slow in Ruby

I have been working on a small Plagiarism detection engine which uses Idea from MOSS. I need a Rolling Hash function, I am inspired from Rabin-Karp Algorithm. Code I wrote --> #!/usr/bin/env ruby #Designing a rolling hash function. #Inspired from…
Nitish Upreti
  • 6,312
  • 9
  • 50
  • 92
2
votes
1 answer

If someone copy your github repository and created a similar repository and claimed that it is coded by them. Is thst considered as plagiarism?

Someone copy my github repository and created another github repository simillar to my own. That person changed my name with his codename. And didn't even put a credits. Is that considered as Plagiarism? If no, can you explain why? And if yes, can…
Zech
  • 31
  • 4
2
votes
2 answers

What are the alternative's to using the iThenticate service for content comparison?

What are the alternative's to using the iThenticate service for content comparison?
curious
2
votes
1 answer

Running stanford moss because of institution proxy

I am running moss by Stanford. I am using Perl script sent by Stanford at the time of registration. but it is giving error as : Checking files . . . OK Could not connect to server moss.stanford.edu: Connection refused I tried adding institute proxy…
KrunalParmar
  • 1,062
  • 2
  • 18
  • 31
1
vote
1 answer

how to tokenize a c++ source code file (convert to sequence of tokens)?

Possible Duplicate: How to create a plagiarism detector of c++ files a simple lexer.cpp to convert a simle c++ file to a sequence of tokens hi i have a project "plagiarism detector for c++ files" and a need to know how to convert a c++ file to…
moradpro
  • 17
  • 1
  • 3
1
vote
0 answers

highlight similar sentences in two documents and not just display similarity score

I am working on a problem where I need to find exact or similar sentences in two or more documents. I read a lot about cosine similarity and how it can be used to detect similar text. Here is the code that I tried: my_file = open("test.txt",…
1
vote
2 answers

check similarity/plagiarism between articles in mysql via python

i have a mysql database where i add news articles, and before adding to it it try to compere that article with 100 last articles if it has any similarity. so if is 95% similar i can tag it as same as article 122 or if it is 70-95% similar i can tag…
artodm
  • 11
  • 1
1
vote
0 answers

Elasticsearch The best way to compare large texts

I have several thousand texts in elasticsearch which I have to compare with text segments and detect plagiarism (complete coincidence). I plan to take several large segments from different parts of the checked texts and find them in documents loaded…
Kadimon
  • 27
  • 1
  • 5