How to narrow the retrieval scope of network information resources retrieval?

Question and answer \ x0d \ x0d \ 1. Briefly describe the concepts of information, knowledge and literature and their relationships. \x0d\ 1。 A: Information: It is the reflection of the way things exist, the state of motion and their characteristics, and it is the signal and news sent by things. \x0d\ knowledge: it is the result of thinking analysis, processing and refining, systematization and theorization of information reflection of various phenomena and laws in nature and human society. \x0d\ Literature: All carriers of knowledge are recorded \x0d\ Conceptually, information is greater than knowledge and literature, knowledge is a part of information, it is theoretical and systematic information, and literature is the recorded part of knowledge. \x0d\\x0d\2。 According to the different carrier forms, what kinds of documents are divided into, and examples are given. \x0d\2。 Answer: Literature can be divided into \x0d\ handwritten literature, such as Oracle Bone Inscriptions, Jinwen \x0d\ printed literature, such as books, periodicals \x0d\ microfilm, microfilm \x0d\ audio-visual literature, such as video tapes, audio tapes and sci-tech movies. \x0d\3。 A: According to the degree of processing, documents are divided into zero-level, first-level, second-level and third-level documents. \x0d\ Zero-order documents refer to unpublished experimental records, original recordings (images), letters, manuscripts, oral communication information or objects; The zero-order documents are published after being processed by the author, and become documents when they enter the field of social circulation; According to the characteristics of primary documents, the documents are sorted, processed and concentrated according to certain rules and methods, and the obtained documents are secondary documents; A document formed by synthesizing, analyzing, processing and refining the contents of a large number of first-class documents by using the clues of second-class documents is a third-class document. \x0d\\x0d\4。 How does computer search narrow the search scope? \x0d\4。 Answer: In computer retrieval, the methods to narrow the search range are: \x0d\( 1) adding search words connected by and, or \x0d\(2) defining \ x0d \ with specific sub-keywords; (3) defining the search with fields, such as titles. (4) The retrieval of literature types, languages, important journals, clinical core journals and years is limited. \x0d\(5) Search the input word \x0d\\x0d\5 in a more specific category. What are the methods to obtain the full text of literature? \x0d\5。 Answer: The methods of obtaining full-text are: \x0d\( 1) searching online full-text database \x0d\(2) using online publishing houses and magazines \x0d\(3) using library collection catalogue (joint collection catalogue) \x0d\(4) using Internet \x0d\6. \ x0d \x0d\7。 Briefly describe the principle of information retrieval \ x0d \ 7. A: The principle of information retrieval is: in essence, it is a process of comparing and selecting the information needs of users with the information stored in the information set, that is, matching. \x0d\\x0d\8。 What steps does computer retrieval usually include? \x0d\8。 Answer: The steps of information retrieval include: \x0d\( 1) analyzing the retrieval subject, and defining the purpose and requirements \x0d\(2) selecting the appropriate retrieval tool \x0d\(3) selecting the retrieval approach and determining the retrieval mark. \x0d\(4) Looking for literature clues. \x0d\(5) Browse the search results and get the original document. \x0d\\x0d\9。 Briefly describe the arrangement rules of IM subject index. \x0d\9。 Answer: The arrangement rules of subject index are as follows \x0d\( 1). The whole index is arranged by subject words \x0d\(2) Under the same subject word, it is arranged by sub-subject words \x0d\(3) The relevant contents of document titles are included in the corresponding subject words or subject words/sub-subject words respectively. General documents are directly under the subject words, and specific documents are under the corresponding sub-subject words; You can enter \ x0d \ under multiple subject words for the same document title. (4) Arrange the document titles under the same subject word or sub-subject word first, and then arrange the non-English documents. English translation titles of non-English documents are enclosed by "[]" to show the difference \x0d\(5) English documents are arranged in alphabetical order of publication names \x0d\(6) Non-English documents. \x0d\\x0d\ 10。 What kinds of reference systems are there in the word order table of medical thesaurus (MeSH)? Illustrate its significance with examples. \x0d\ 10。 A: The first group: substitute reference, which is used to deal with the equivalence between words. In the network thesaurus, there is only one scientific and commonly used synonym as the standard subject word, and other words as non-subject words (that is, entry words). It is manifested as "the item words see the subject words" and "the subject words x item words". For example, cancer see neo-plasmas neo-plasmas x cancer \ x0d \ group II: a generic reference, which is used to deal with the hierarchical relationship between words, that is, the relationship between the whole and the parts, the upper concept and the lower concept. For example, under some keywords with larger categories, some keywords with smaller categories are included, and the words with wider content range are keywords, while the words with narrower content range are not used as keywords, but only as secondary keywords. It is represented by "sub-subject words see under the subject words" and "subject words make multiple subject words". For example, please refer to Health Planning. Under Health Planning, Xu Health Priority cancels this reference relationship from 199 1, and all secondary keywords are upgraded to keywords. \x0d\ The third group: related references, which are used to deal with the correlation between words, thus expanding the search and improving the recall rate. It is represented by "keywords see related keywords" and "keywords XR keywords". For example, for population control, please refer to related family planning xr population control \ x0d \ In order to improve the recall rate and facilitate retrieval, "still consider" and "keyword/sub-topic phrase matching reference" have been added in 199 1 992 respectively. \x0d\\x0d\ 1 1。 According to the different storage contents, what types of databases are usually classified? \x0d\ 1 1。 Answer: Bibliographic database, fact database, numerical database, full-text database, image database \ x0d \ x0d \ 12. What is extended keyword retrieval and what are its advantages? \x0d\ 12。 Answer: By selecting the extended function of the subject word, the subject word and all its subordinate words can be retrieved at the same time, so that all documents indexed by the selected subject word and more specific subject words can be detected, and the retrieval recall rate can be better improved. Without expansion, only the documents indexed by the selected subject words will be detected, but the subordinate words, that is, those documents indexed by the subject words, will not be detected. \ x0d \ x0d \ 13。 What are recall and precision, and what is the relationship between them? \x0d\ 13。 A: The recall rate refers to the ratio of the amount of relevant documents detected by the system to the total amount of relevant documents in the system document library, which reflects the degree to which the actual amount of relevant documents in the system document library has been retrieved. \x0d\ recall ratio = number of related documents detected/total number of related documents in the document library × 100%\x0d\ precision ratio refers to the ratio of the number of related documents detected by the system to the total number of detected documents in a certain retrieval, reflecting how many documents are actually detected from the system document library at a time. \x0d\ precision ratio = number of related documents detected/total number of detected documents × 100%\x0d\ recall ratio and precision ratio are reciprocal. To search completely, it is necessary to gradually relax the scope and restrictions of retrieval, and as a result, many irrelevant documents will be brought in, which will affect the precision rate. It is not easy to try to improve recall and precision at the same time. On the one hand, it is inappropriate to emphasize and on the other hand, it is inappropriate to ignore. The recall and precision should be adjusted reasonably according to the requirements of specific topics to ensure the retrieval effect. \ x0d \ x0d \ 14。 What are the factors that affect the recall and precision? \x0d\ 14。 A: (1) Factors affecting the recall rate \x0d\ From the perspective of document storage, the main factors affecting the recall rate are: incomplete documents contained in the document library; Index vocabulary lacks control and specificity; The lexical structure is incomplete; The relationship between words is vague or incorrect; Unknown index; Inconsistent index; The indexer missed important concepts and inappropriate words in the original text. In addition, from the perspective of information retrieval, there are mainly: the retrieval strategy is too simple; Improper choice of words and logical combination; There are too few retrieval ways and methods; Retrieval personnel are unfamiliar with the business and lack patience; The retrieval system does not have the function of word segmentation and feedback, so it can't fully describe the retrieval requirements. \x0d\(2) Factors affecting the precision ratio \x0d\ The main factors affecting the precision ratio are: index words can't accurately describe the literature subject and retrieval requirements; The assembly rules are not strict; The choice of words and the relationship between words are incorrect; Index is too detailed; Assembly error; The specificity of the search words used in retrieval is not enough, and the retrieval range is wider than the retrieval requirements; The retrieval system does not have logical negation function and feedback function; The number of words allowed in the search formula is limited; Improper cut-off position, improper use of logical "or" in retrieval formula, etc. \x0d\\x0d\ 15。 What is the working principle of independent search engine? \x0d\ 15。 A: The working principle of an independent search engine is: \x0d\ Run search software regularly (or manually) to collect network information. \x0d\ Use indexing software to automatically index and establish a database. \x0d\ provides users with relevant information resource navigation, directory index and retrieval interface in the form of web pages, and users can input retrieval formulas. \x0d\ Search for matches through search software, and output them in the order of relevance. \x0d\\x0d\ 16。 How to improve the recall of retrieval? \x0d\ 16。 A: The way to improve the recall rate: \x0d\( 1) Reduce the specificity of search words, and select some superordinate words and related words to add to the search formula. \x0d\(2) Adjust the trap degree of the search formula and reduce the combination of logical AND. \x0d\(3) For family search, we can use classified search, or use a group of synonyms, synonyms and related words, and connect them with "or" in the search formula. \x0d\(4) Use word truncation technology to cancel some restrictive characters. \x0d\\x0d\ 17。 How to improve the accuracy of retrieval? \x0d\ 17。 Answer: The way to improve the precision rate: \x0d\( 1) Improve the specificity of the search term, and use standard word or free words with strong specificity. \x0d\(2) Add "and" to further define the theme concept. \x0d\(3) Define the detectable domain where the search words are located, and use position operators to control the order and position between words of the search words. \x0d\(4) Restrict the external characteristics of the exported document, such as the year, language and document type. \x0d\(5) Use logical negation to restrict the output of documents irrelevant to the problem. \x0d\(6) Search by subject words, not free words or less free words. \ x0d \ x0d \ 18。 According to China's patent law, what are the types of patents and how long are they valid? \x0d\ 18。 A: According to the patent law, the objects of protection are divided into three categories: invention patents, utility model patents and design patents \x0d\ invention patents, which are the main objects of protection in the patent law. The validity period of the invention patent right varies from country to country, and in China it is 20 years from the date of application. \x0d\ utility model patent. The validity period of utility model patent in China is 10 year from the date of application. \x0d\ design patent, the patent right of design in China is valid for 10 year from the date of application. \ x0d \ x0d \ 19。 What are the basic elements of an information retrieval system? \x0d\ 19。 A: The information retrieval system consists of \x0d\( 1) retrieval documents, that is, an organic collection of document retrieval marks. \x0d\(2) Technical equipment, that is, various technical equipment that can store information and its identification and realize storage and retrieval operations. \x0d\(3) People who act on the system, such as document processors, users, system maintainers and administrators. \x0d\(4) Search tools such as search language, document indexing rules and input/output standards. \x0d\\x0d\20。 What are inline documents and inline documents of a database? \x0d\20。 A: Sequential documents are directory documents with document records as the information storage unit and arranged in descending order according to the access times of document records. Because it stores the most complete information about each document, it is usually called a master file, which is equivalent to the text of a printed retrieval tool. Reverse document is a document obtained by extracting all detectable fields or attribute values from records and reorganizing them in a certain order. Reverse gear is derived from the main file, so it is also called auxiliary file. \x0d\\x0d\2 1。 What are the main tools for obtaining network information resources? \x0d\2 1。 Answer: The main tools for obtaining network information resources are: \ x0d \ (1) e-mail \ x0d \ (2) FTP \ x0d \ (3) Archie \ x0d \ (4) telnet \ x0d \ (. \x0d\22。 A: A few journals that publish a large number of high-quality professional papers in a certain discipline are called core journals. Their main features are: \x0d\( 1) professional documents have high density and large amount of information; \x0d\(2) has a high level, which represents the latest development level of this discipline; \x0d\(3) The publication is relatively stable and the published literature has a long life; \x0d\(4) High utilization rate and cited rate. \x0d\\x0d\23。 Try to describe the concept and types of Internet search engines. \x0d\23。 A: Search engines are the general term for web pages with query function on the Internet, that is, they allow users to submit queries, retrieve the results list of web pages related to queries, and sort them. \x0d\ can be divided into comprehensive search engines and professional search engines according to the retrieval contents; \x0d\ Classification by retrieval function: directory search engine, full-text search engine and intelligent search engine; \x0d\ Classification by search scope: independent search engines and meta-search engines. \x0d\\x0d\24。 Briefly describe the concept and characteristics of conference papers, and what conference paper retrieval systems are there at home and abroad? \x0d\24。 A: Conference documents refer to papers, reports, speeches and other conference-related documents published at various scientific and technological conferences. \x0d\ conference documents have the characteristics of timely information dissemination, focused theme, novel content, strong professionalism, high quality, large quantity and various publishing types, which often represent the latest academic research achievements in a certain discipline or professional field and basically reflect the academic level, research trends and development trends of the discipline or profession. Conference literature is one of the important information sources for sci-tech novelty retrieval. \x0d\ Foreign conference papers retrieval tools include: ISI's proceedings(ISTP+is shp);); Papers first (sub-library of OCLC first search); IEEE/IEEE electronic library; AIPCofference Proceedings\x0d\ The domestic systems that can retrieve conference papers are: China full-text academic conference papers database (Wan Fang data); China hownet \ x0d \ x0d \ 25. Hong Zhou found 172 documents for the first time when searching for a topic. What method do you suggest to narrow down the search? \x0d\25。 A: There are ways to narrow the search scope: \x0d\( 1) to improve the specificity of search words and replace them with standard word or free words with strong specificity. \x0d\(2) Add "and" to further define the theme concept. \x0d\(3) Limit the detectable domain where the search term is located. \x0d\(4) Use positional operators to control the order and position between words. \x0d\(5) Restrict the external characteristics of the exported document, such as the year, language and document type. \x0d\(6) Use logical negation to restrict the output of documents irrelevant to the problem. \x0d\(7) Search by subject words, not free words or less free words. \ x0d \ \ x0d \ 26。 What's the difference between MEDLINE and Pubmed? \x0d\26。 A: The difference between Pubmed and MEDLINE is mainly reflected in the scope of \x0d\( 1) literature: Pubmed includes not only MEDLINE database, but also PREMEDLINE database and data directly provided to Pubmed by publishers. \x0d\(2) Database update time: MEDLINE online database is updated weekly, MEDLINE CD reports the time difference of 1-3 months, and Pubmed updates it daily \x0d\(3) Retrieval mechanism: MEDLINE only has two topics and authors, and mastering the Mesh table is the first condition for retrieval, while Pubmed has the functions of automatic vocabulary conversion and matching and automatic keyword expansion. \x0d\(4) Link function: Pubmed has the link function with the original text, related literature, related books and related molecular biology databases \x0d\(5) Free full text: Pubmed provides free full-text browsing and retrieval function for more than 60 journals. \x0d\(6) Fees: MEDLINE CD needs to rent the database every year, and Pubmed is free. \ x0d \ x0d \ 27。 What are the concepts and characteristics of patent documents? \x0d\27。 A: Patent documents are the general names of official documents and their publications produced by countries and international organizations that implement the patent system in the process of patent examination and approval. Patent literature has the following characteristics: \x0d\( 1) Patent literature is a huge and extensive strategic information resource, which integrates technical, legal and economic information; \x0d\(2) Facilitate the dissemination of the latest technical information in the literature; \x0d\(3) The patent document format is standardized, with a high degree of standardization and a unified classification system, which is convenient for retrieval and reading; \x0d\(4) The disclosure of inventions in patent documents is complete and detailed, and the technical content is relatively reliable. \x0d\\x0d\28。 What databases can I choose to search for Chinese and foreign literature on "Hepatitis B Epidemiology"? Please select a database and briefly retrieve the policy. \x0d\28。 Answer: The Chinese retrieval systems that can be used to search for "hepatitis B epidemiology" literature are: CNKI, VIP, Wanfang, CBM, etc. \x0d\ Optional foreign language retrieval systems include MEDLINE, Pubmed, EDSCO, springerlink, Open Access, etc. Among them, the retrieval strategy in CNKI is: \x0d\ Keywords: hepatitis B; Logic: or \x0d\ Search term: article name; Keywords: hepatitis B; Logic: and \x0d\ search term: subject; Key words: the retrieval strategies in epidemiology \ x0d \ CBM are: hepatitis, B/[ extended whole tree] epidemiology \ x0d \ x0d \ 29. What retrieval system can be used to find out whether there is a periodical of Computer Communication in the library of Jining Medical College? \x0d\29。 Answer: The available retrieval systems are: library online retrieval system, superstar digital books and scholar digital library \ x0d \ x0d \ 30. Please list five advanced Google search grammars and illustrate their usage with examples. \x0d\30。 Answer: You can choose five examples of the \ x0d \ (1) "link" operator in the following syntax: find all the webpages linked to a specific webpage, or realize the \ x0d \ (2) "correlation" operator in advanced search mode: automatically find similar webpages at the same level as a webpage, or. X0d \ (3) "Info" operator: you can get all the information of a webpage in Google's inventory \ x0d \ (4) "Site" operator: you can limit the search results to a certain domain, or \ x0d \ (5) you can use the "domain" option in the advanced search interface to carry out "All Intitle" operator: all questions are required. (6) "Allinurl" operator: Require all questions to appear in URL \ x0d \ (7) "inurl" operator: Require questions to appear in URL \ x0d \ (8) "filetype:" operator is a very powerful and practical search syntax developed by Google. The second is the document type, which can restrict access to a certain document type's web pages, and then take the document type as an example. \x0d\\x0d\3 1。 How many ways can I find the literature published by an author in CBMWin4.0 version 4.0? What is the difference? \x0d\3 1。 A: There are four \x0d\ "basic search \" and three ways to search for authors: \x0d\ First, select the "Author" field in the "Search Entry" drop-down menu, and then enter the author's name in the search question box; \x0d\ Second, enter the author's name directly in the query box, followed by "in au"; \x0d\ Third, directly enter "au=" in the search question box, followed by the author's name. \x0d\ Author search mode: enter the author's name directly in the search question box. \x0d\\x0d\32。 What are the main types of evidence in evidence-based medicine? \x0d\32。 According to the quality and reliability, it can be roughly divided into the following five grades (the reliability is correspondingly reduced): \x0d\ Grade 1: According to the specific treatment of specific diseases, all reliable randomized controlled trials are collected for systematic evaluation or meta-analysis; \ x0d \ level: the results of randomized controlled trials with sufficient single sample size; \x0d\ Level 3: Studies with control group but not randomly grouped; \x0d\ level 4: observation of a series of cases without control; \ x0d \ level: expert opinions or evidence based on physiology, pathophysiology and basic research. \x0d\\x0d\33。 List six databases that can retrieve citations. \x0d\33。 A: The databases that can retrieve citations are: (any of the following six can be written) \ x0d \ (1) sci \ x0d \ (2) ssci \ x0d \ (3) h&ACI \ x0d \ (4) China scientific papers and citations. X0d\(7) Citation Database of China Sci-tech Journals \x0d\(8) Chinese Citation Database of China Knowledge Foundation Engineering \x0d\(9) Chinese Biomedical Literature Database \x0d\34. Types and characteristics of information retrieval. \x0d\34。 Answer (1) Personal information source: A large number of personal information about a certain field and dynamic information about the development of various things are gathered. \x0d\(2) Organization information sources: mainly refer to the internal information sources within the organization. \x0d\(3) Physical information sources: information resources in the form of cultural relics, product samples, models, sculptures and other physical objects. \x0d\(4) Literature information sources: knowledge and information resources recorded on various carriers in various ways such as words, pictures and symbols. \x0d\ 1) Books: Books, also known as books, refer to publications with relatively mature contents, complete data systems and finalized binding forms. \x0d\2) Journals: Journals, also known as magazines, refer to those continuous publications that are published regularly or irregularly and collect works of many authors. \x0d\3) Dissertation: It is a professional research paper written by undergraduate and graduate students to obtain degree qualification. \x0d\4) Science and technology report: it is a formal report as a scientific research achievement. \x0d\5) Patent document: A legal document issued by a special government agency, which describes in detail the design and manufacturing process of an invention and indicates that the owner of the invention has the right to manufacture, use and sell it within a certain period of time. \x0d\6) Standard documents: normative documents of technical specifications approved by authoritative organizations and available for people to implement. \x0d\7) Product samples: documents issued by manufacturers to introduce and promote their products. \x0d\8) Conference documents: refer to papers and reports published at international and domestic professional academic conferences. \x0d\9) Government publications: documents produced by government agencies and published by designated publishing institutions. \x0d\(5) Electronic information source: mainly refers to information that can be stored and disseminated by electronic technology. \x0d\35。 Composition of the database. \x0d\35。 A: A database consists of documents, records and fields. \x0d\( 1) document: it is related records and combinations organized according to a certain structure. \x0d\(2) Record: it is the basic data unit that constitutes a document. \x0d\(3) field: it is a smaller unit than a record, and it is the basic unit that constitutes a record. \x0d\36。 Classification language. \x0d\36。 Answer: Classification language gives corresponding classification numbers according to the subject attributes of documents, expresses the subject concept of documents with the classification numbers, and arranges them into a classification index according to the order in the classification table, which provides a classification method for searching documents. \x0d\ classification language uses the principle of logical classification to collect documents according to the theme and specialty of document content, and reveals the differences and connections of various documents from the perspective of knowledge classification. \x0d\ classification language well reflects the systematicness of disciplines and specialties. \x0d\ classification language can be divided into system classification language and assembly language, and the former is widely used in China at present. \x0d\36。 Topic language. \x0d\37。 A: Topic language is a language that uses highly generalized words to describe the theme of a document and uses it as a symbol to organize the retrieval system. \x0d\ Topic languages are divided into headline words, meta words, keywords and descriptors. \x0d\( 1) Title words: words, phrases or phrases selected from natural languages and standardized to express the concept of things. \x0d\(2) Meta-words refer to the smallest and most basic words that can be used to describe the topics discussed in the literature. \x0d\(3) Description: Use words that can express literary themes and be rigorous.