-3

hi a am a student and a have a project "plagiarism detector in c++ and java files" and i am in trouble

i want(i hope to want) to see an example exist , a project in that title or a documentation in this field (in practice part)

Name is Nilay
  • 2,743
  • 4
  • 35
  • 77
moradpro
  • 17
  • 1
  • 3
  • I don't know what plagiarism detection is, because the tag has no wiki summary. What is it? – Mr Lister Mar 13 '12 at 16:12
  • 2
    @Mr. Lister: It's... the detection of plagiarism... – Niklas B. Mar 13 '12 at 16:12
  • 14
    So you want to copy the answer of how to create a plagiarism detector from someone else on SO? – Martin Beckett Mar 13 '12 at 16:14
  • 1
    Are you trying to use c++/java to detect plagiarism in text files, or are you trying to detect plagiarism *in the code itself*? – tmpearce Mar 13 '12 at 16:14
  • And that has a tag of its own? Ah... OK. – Mr Lister Mar 13 '12 at 16:15
  • The top FAQ in `plagiarism-detection`: http://stackoverflow.com/questions/5294447/how-can-i-find-source-code-copying – tmpearce Mar 13 '12 at 16:16
  • To start, similar file size and line count, then do a checksum of every line and compare to around the same line in the "original" file. Fuzzy text comparison algorithms might be good too. – Some programmer dude Mar 13 '12 at 16:18
  • This is an open research topic. Google for 'software clones' or 'clone detection'. – alfa Mar 14 '12 at 12:30
  • possible duplicate of [a simple lexer.cpp to convert a simle c++ file to a sequence of tokens](http://stackoverflow.com/questions/9969836/a-simple-lexer-cpp-to-convert-a-simle-c-file-to-a-sequence-of-tokens) – bmargulies Apr 02 '12 at 01:38

1 Answers1

3

I think firstly you could create double originality() measurement function, which will give you float point value between 0 and 1, and then use it for your plagiarism detector via formula plagiarism = 1. - originality(). Then you will define threshold level and vous a la.

Dmitriy Kachko
  • 2,804
  • 1
  • 19
  • 21