Questions tagged [plagiarism-detection]

74 questions
1
vote
0 answers

How can TF-IDF be used for programming source code plagiarism detection?

i briefly understand how TF-IDF works, for detecting plagiarism in articles, it does make sense. Now i was told to use it against programming source code, how can this work ? In article most words are natural language words say English, you can…
user3552178
  • 2,719
  • 8
  • 40
  • 67
1
vote
1 answer

Will Word2Vec be more efficient in text based Plagiarism detection than WordNet or any other word embeddings like GloVe, fastText etc?

I am a beginner in learning Word2Vec and just started to do some study on Word2vec from the Internet. I have gone through almost all the questions in Quora and StackOverflow but didn't get my answer anywhere from the previous questions. So my…
1
vote
0 answers

Plagiarism of common code

I'm a student currently at Rutgers University. I've taken Data Structures before, now I've been assigned a program that must add/multiply polynomials w/ a singly linked list. It's really simple, now my issue is that I must use a sorting technique…
1
vote
0 answers

Chunking documents to test for plagiarism

I am building a plagiarism checker for text files. I did every thing of preprocessing (stop word removal, stemming, etc.) and build my index. and filtered results. the system almost done. I chunked corpus and user document by sentences (sentence…
ahmad
  • 11
  • 2
1
vote
1 answer

web design how to prevent leeching. Flash anti leeching methods

I am designing a website with flash video content. If I pass in the URL to a .flv file as a flash parameter to the embedded video player someone can easily extract the URL and download the flv video file. How to do I prevent this? Can someone refer…
user515
  • 11
  • 2
1
vote
1 answer

Using sherlock to check for plagiarism through python subprocess module

When i write on terminal: ./sherlock *.txt it works but when I try to do the same using python subprocess like import subprocess subprocess.call(['./sherlock','*.txt']) import subprocess subprocess.call('./sherlock','*.txt',shell=True) import…
Vipul
  • 566
  • 5
  • 29
1
vote
2 answers

Compare the textual content of websites

I'm experimenting a bit with textual comparison/basic plagiarism detection, and want to try this on a website-to-website basis. However, I'm a bit stuck in finding a proper way to process the text. How would you process and compare the content of…
Sune Rievers
  • 2,676
  • 3
  • 25
  • 29
1
vote
1 answer

plagiarism detection using damerau levenshtein algorithm

how will i simulate the damerau leveshtein distance algorithm so as to detect plagiarism in documents? thanks!
user188881
1
vote
3 answers

How was the Google Books' Popular passages feature developed?

I'm curious if anyone understands, knows or can point me to comprehensive literature or source code on how Google created their popular passage blocks feature. However, if you know of any other application that can do the same please post your…
0
votes
0 answers

Unauthorized 401 on calling scan URL CopyLeaks API

I am trying to call an api of copyleaks using Postman https://api.copyleaks.com/v3/scans/submit/url/my-special-id but I am getting this error { "type": "https://tools.ietf.org/html/rfc7235#section-3.1", "title": "Unauthorized", "status":…
0
votes
0 answers

How to get the percentage of matched text using the kmp and naive string algorithms in js , node and express js?

I'm finding it difficult to retrieve the percentage of matched text for a plagiarism project I'm working on. The project is being built via Node and Express JS. The matched text occurs from comparing text to a pattern to find any similarities. With…
0
votes
0 answers

Bing exact match not working (how to find plagiarized content)?

We used to use Bing to find plagiarized content, but they stopped supporting exact match in search "". Do you know of any solution to arrive at results similar to exact match in Bing or another way of checking who copied our content to some other…
Forlis
  • 177
  • 2
  • 3
  • 12
0
votes
1 answer

Integrating Copyleaks SDK with Angular

I am trying to integrate the Copyleaks SDK with Angular to be able to check for plagiarism on two text area fields in an HTML form. On the HTML form, I am trying to integrate two buttons, which checks for plagiarism on one text area field, and the…
Hans P
  • 1
  • 1
0
votes
1 answer

Returned properties for AI detected text

Please provide an example of the returned object for the AI generated text. I'm providing some text to scan using the /v3/scans/submit/file/{scanId} endpoint with the properties.aiGeneratedText.detect flag enabled. I was expecting to receive an…
learntheropes
  • 444
  • 3
  • 11
0
votes
0 answers

I have an error ImportError: cannot import name 'PlagiarismChecker' from 'plagcheck'

I want to write code for checking for plagiarism. I chose the library plagcheck ‘https://github.com/codeclassroom/PlagCheck' . I downloaded, installed library, wrote the code, but when I launch a code, I get error ImportError: cannot import name…