1, all the carriers of knowledge are recorded-the definition of literature. These four elements are: knowledge content, information symbols, carrier materials and recording methods.
2. Original documents, secondary documents and tertiary documents-original documents, ordered documents and carefully classified secondary documents.
3. Document carriers: printed, miniature, audio-visual and electronic.
4. Literature publishing types and identification: books, periodicals, conference documents, scientific reports, patents, standard documents, dissertations and product materials.
5. Types and identification of electronic documents: electronic books, electronic periodicals and dynamic electronic information.
6. Document communication methods: direct communication (informal communication) and indirect communication (formal communication)
7. The law of literature distribution: the distribution of publication time, the latest information obtained through the Internet and the latest newspapers, magazines, radio and television media; 2 months to 10 years should be mainly solved through online books and periodicals database; In addition to periodicals, materials from 5 to 20 years ago can also be found through printed books and manual reference books.
Subject distribution, a subject-related literature is divided into three categories: core literature, related literature and marginal literature.
8. The internal and external characteristics of literature. Content characteristics and appearance characteristics: information irrelevant to the theme of document information is called appearance characteristics of document information. The information closely related to the subject content of document information is called the content characteristics of document information.
9. Search languages and types: Search languages mainly include classified languages characterized by code languages and subject languages characterized by object names and terms.
10 retrieval system type: catalog, title record, abstract, index.
1 1 classification language: Hypotaxis reflects the inclusion and subordination of conceptual extension.
12. Multi-concept topic classification: apply single concept topic classification, and pay attention to the juxtaposition of concept topic and application topic.
13. At present, the commonly used theme languages are keyword language and narrative language. Forbidden words: words with no retrieval meaning.
14, keywords and narrative language p43-44
Detailed: Keyword is a natural theme language, which refers to words frequently used by authors in literary works.
15, search steps: 1, and specify the subject requirements; 2. Understand the subject background knowledge and summarize the subject content of the course; 3. Analyze and extract the concepts involved in the topic; 4. Select the retrieval system; 5. Search and obtain the original text.
16, retrieval methods: direct method, indirect method: backtracking method, common method and comprehensive method.
17. Literature retrieval objects: literature, data and fact retrieval. Literature retrieval: bibliography and full-text retrieval
18. Necessary conditions for machine inspection: material conditions and personnel conditions. Material conditions include database, communication system and retrieval terminal. The quality of personnel includes: 1, the degree of understanding of the subject; 2. Master the retrieval system; 3, the degree of language mastery
19. Definition and composition of database: documents, records and fields. Three types: text type, digital type and multimedia type. The database includes at least one embedded document and one reverse document.
20. Inline documents: All records in the database are arranged according to the size of record numbers. Reverse document: record-based feature identification.
2 1, record expression: in a full-text database, a record is equivalent to a complete document; In bibliographic database, records are equivalent to abstracts or bibliographies.
22. Fields: basic fields and auxiliary fields.
23. Retrieval concept expansion methods: based on the same concept, based on content analysis and based on retrieval results.
24. Word cutting method: Word cutting characters generally include "?" 、"*"、"$"、"%"。 There are three problems to be paid attention to in word truncation: First, the word truncation is followed by the stem, and there can be no space between the word truncation and the stem. The second is to avoid cutting the stems of search words too short, which should generally be more than three letters. If the deadline is too short, not only the result is irrelevant, but also the system overhead is very high. Third, the rational use of word-cutting. Fourth, extract as many public letters as possible from the words you want to appear, so as to improve the accuracy.
25. Boolean logic operator: OR/AND/NOT
26. Priority Operator: Just add ().
27. position operators: w, n. With, Near .p69-70.
Details: W (band) and N (near), as we all know, we won't introduce them!
S (Emotion): It means that the two searched words must appear in the same S (sentence, but the order of the two words can be reversed!
F (field): refers to the search on both sides must appear in the same sentence, but the order of the two words can be reversed.
28. Domain restrictions: basic domain restrictions and auxiliary domain restrictions.
29. Adjustment methods of retrieval results: recall and precision. Recall rate: 1, decrease and increase or; 2. Use word breakers (? ); 3. Delete the existing field restrictions and position operator restrictions. Accuracy: 1, reducing synonyms or related words of the same family; 2. Add the concept of restriction AND connect them with AND; 3. Use field restrictions