First, the principle of duplicate checking.
1. hownet dissertations are tested as a whole article, and the format may affect the test results, so it is necessary to submit the final submission format for testing.
To minimize the impact, the impact of dozens of small segments may not be detected. Papers over 30,000 words can be ignored.
Yes
The comparative databases are: online publishing database of China academic journals, full-text database of Chinese doctoral dissertations/excellent master's dissertations of China.
Full-text database, full-text database of national important conference papers, full-text database of national important newspapers and periodicals, full-text database of China patents, personal comparison.
Library, other comparison libraries. Some books are not in the HowNet Library and cannot be detected.
2. After uploading the paper, the system will automatically detect the chapter information of the paper. If there is automatically generated directory information, the system will discuss it.
Text is detected by section, otherwise it will be automatically detected by section.
Some students reported that they explicitly quoted or copied paragraphs or sentences from other documents in their own paragraphs. Why didn't they detect them?
Please, it's normal. China Knowledge Network has set a threshold for the sensitivity of this detection system, which is 5%, which is lower than 5% in terms of paragraphs.
Plagiarism or quotation cannot be detected, which is common in small concepts such as clauses or large paragraphs. For example, if the paragraph 1 is detected.
There are 10000 words, so if a single document is cited with less than 500 words, it will not be detected. In fact, there is also a revised version for students.
The method is not to choose an article to quote from paragraph plagiarism, but to choose as many documents as possible and intercept a few words from an article. This is the case.
It won't be noticed.
4. How to detect plagiarism in a paper? The condition of hownet paper detection is that 13 consecutive similar or plagiarized words will be red.
Note, but you must meet the prerequisite in 3: that is, the total number of words in a document you quoted or plagiarized should reach each test paragraph.
To 5%.
Second, seven methods of quick duplicate checking
Method 1: Translation of foreign documents
Consult foreign literature in the research field, especially those in high-level journals, such as Science, Nature and Water Resources. , and put
Translate the theoretical explanation into Chinese and put it in your own paper.
Advantages: 1, everyone's language habits are different, and the translated Chinese is bound to be different. Therefore, even the same paragraph is translated by different people.
There was no plagiarism after that. Reading foreign literature can improve your English and broaden your professional horizons.
Disadvantages: Students with poor English, especially those with poor professional English, are more difficult to implement.
Method 2: Change the wording method
Rewrite the words in other people's papers, or change the sentence structure, active voice and passive voice, or change the key words, or pass
Excessive increase or decrease. Of course, if it is a classic sentence, it should be quoted in a classic way.
Advantages: 1 After the text is modified, according to HowNet program and algorithm, as long as there are no repeated 13 continuous words and keywords.
, it will not be marked as red. I know every word and sentence of the paper like the palm of my hand, and I know it by heart, and I will be like a duck to water when I reply.
Disadvantages: word-for-word revision is time-consuming and laborious.
Method 3: Change it into a passive sentence.
Advantages: convenient and quick, and can be modified in large sections.
Disadvantages: Sometimes it needs to be translated several times, and it has to be translated from Chinese into English, then into Algerian, and then into Chinese.
Method 4: Transform picture method.
Cut the words in other people's papers into pictures and put them in your own papers. Because at present, the duplicate checking system of HowNet can only look up words, but not.
Look for pictures and tables, so as to avoid double checking.
Advantages: It is more convenient and faster than Google Translation.
Disadvantages: If it is convenient to use, it is easy to see that the whole page is full of pictures, which will affect the number of words in the whole paper.
Method 5: Insert document method
Insert some quoted words into the paper in the form of word documents.
Advantages: This method is even better than method 4, because it can be re-edited in the inserted document later and the picture is changed.
After changing the law, it is not convenient to modify it.
Disadvantages: not found yet.
Method 6: Insert space method
Insert spaces between all the words in the article, and then adjust the spacing between empty words to a minimum. Because the basis of repeated checking is word-based,
Spaces cut off words and naturally skip the double-check system.
Advantages: Based on the principle of duplicate checking system, it has high reliability.
Disadvantages: the workload is huge, and the course can be completed through macros, but you need to learn the compilation of macros.
Method 7: Self-original method
Write your own paper, or don't copy and paste the original text when writing; Please add the quotation correctly.
Advantages: Basically, you will never worry about not passing the duplicate check, even if the threshold of the duplicate check system is lowered.
Disadvantages: If there are advantages and disadvantages, it is that after writing the graduation thesis, more brain cells may die. Ha ha. . .
Detailed description of hownet system calculation standard:
1. After reading the introduction of this system, I have a question. This system is good for text copy recognition, but what about other contents, such as data and charts? Isn't it useless to detect it?
Among all kinds of academic misconduct, plagiarism is the most common and serious. At present, the detection system has reached a high level. The detection of plagiarism and tampering of charts, formulas and data is currently under development and has made great progress. You are welcome to continue to pay attention to the progress of this detection system and put forward more critical and constructive opinions and suggestions.
2. According to this system, less than 39% is displayed in yellow, so does it mean that it is within the tolerable limit? Recently, I read the news that the national social science fund project of a teacher in Shanghai University was cancelled because two papers he published were plagiarized, accounting for 25% and 30% respectively. Please specify what the warning line is.
Percentage only describes the proportion of overlapping words in the detected documents, and does not refer to the plagiarism of the documents. It can only be said that the greater the percentage, the more overlapping words, and the greater the possibility of plagiarism. Whether it is plagiarism or not and the severity of plagiarism need to be decided by experts after review.
3. How to prevent the academic misconduct detection system of dissertations from becoming a platform for personal revenge?
This is something we are seriously considering. At present, this detection system is only used by users at the institutional level. We have established a strict management process. At the same time, technically, we have also taken various measures to prevent malicious acts as much as possible, including a series of strict identity authentication and login.
4. The minimum detection unit is one sentence, so you can't detect one or two words in each sentence?
We also deal with sentences accordingly, and have an algorithm of sentence similarity. It is not the same sentence that is judged to be the same. Sentences have sentence-level similarity algorithms, and paragraphs have paragraph-level similarity algorithms. Calculating whether a document or paragraph is similar to other documents is based on this.
5. If the original word is taken from relevant books, but the word has been copied from relevant documents in the database, that is to say, the previous article also took the same word from relevant books, but the words marked in my paper are from relevant books, is this academic plagiarism?
The detection system can't draw a conclusion, whether it is plagiarism or not, and finally there is manual review. So if it is the situation you describe, experts will make corresponding judgments. Our system only provides all kinds of clues and basis, so that people can quickly grasp the information of test literature.
6. The authority of HowNet detection system?
The detection system of academic misconduct documents has not reached a conclusion, that is, the detection system does not characterize the detection documents, but only shows the similarities between the detection documents and other published documents and lists objective facts, and whether such detection documents belong to academic misconduct needs the final examination and confirmation of experts.
How to detect plagiarism in papers? The condition of hownet paper detection is that 13 words with similar or plagiarized words will be marked in red, but the precondition in 3 must be met: that is, the sum of A documents you quoted or plagiarized should reach 5% in each detection paragraph.