Plagiarism in the hiring process compromises the integrity of evaluations, making it difficult to assess a candidate’s ability. Plagiarism detection tools help maintain fairness, creating a level playing field that leads to better hiring decisions.
HackerRank employs two primary models to detect plagiarism specifically in coding questions:
-
MOSS (Measure of Software Similarity): This is enabled by default for all tests. The model compares the candidate code against others in our database. This method is effective in identifying plagiarism from external sources or copied test cases.
-
AI Plagiarism Detection: To detect sophisticated and AI-aided plagiarism attempts, HackerRank leverages an advanced AI-based model.
This article talks in detail about the MOSS model.
Plagiarism Detection Using MOSS
Moss tokenizes candidate code and compares these tokens to detect substantial overlaps. Common strategies to bypass plagiarism detection, such as changing variable names or introducing white spaces, are generally ineffective, as Moss focuses on the overall structure of the code.
Plagiarism detection is conducted for the same question across different candidates and versions of programming languages. The system flags a candidate’s recent submission based on behavioral signals against previous ones, also considering overlapping attempt durations.
MOSS plagiarism detection is turned on by default for all HackerRank tests.
Verifying Plagiarism in the Candidate Report
To review flagged plagiarism activity:
- Go to the Tests page and select the relevant test.
- Navigate to the Candidates tab and select the flagged candidate (indicated by a red triangle).
- The Suspicious Activity tile in the candidate’s test summary will provide plagiarism details.
- Scroll down to the question that is flagged for plagiarism and click on it to view the detailed report.
- The Attempt Activity section will display the flagged code. Review it and confirm or dismiss the plagiarism flag.
-
Click Yes or No based on the review:
- Yes retains the plagiarism flag.
- No removing it.
- Use the View Matches option to see code similarities, and click View in the Output Diff column for detailed differences.
Code Matching and Output Diff
The Match Percentage shows the degree of similarity between candidates’ code. Use the View Matches option to see highlighted differences in the codes, which helps in identifying potential plagiarism. Based on the percentage, flags are categorized as follows:
- High: ≥ 90%
- Medium: 80%–90%
- Low: < 80%
Plagiarism Indicators in Reports
In reports, plagiarism detection is presented as follows:
- Excel Report: The Attempt Plagiarism column indicates whether plagiarism was detected during a candidate's attempt. The result is displayed as either Yes or No.
- PDF Report: If plagiarism is detected in any of the candidate's responses, those specific question scores are flagged.
Limitations:
The limitation of this model is that it cannot determine the exact source of plagiarism. Hence it is recommended that you review the code playback for the flagged attempt before making a final decision.
Related Articles
- For more details, see the Plagiarism Best Practices Guide.
- Tab Proctoring and Copy-paste tracking can also be used for questions without plagiarism detection capabilities.
- For further guidance, refer to the Downloading PDF and Test Reports article.