In today's information age with the rapid development of computers and networks, does the traditional document retrieval institution (such as library) still have the value of existence? Why?

The word 1W is exceeded. Send me your email when you see it, and send it to Baidu news.

Indexing service is the main way for Chinese indexing society to go to society.

Editorial Department of China Index

The new leadership meeting of the Institute decided to strengthen the indexing service from 2004 and strive to achieve China indexing.

Make more contributions to career development.

The purpose of our association and the principles of membership activities are: sincerity, truth-seeking, pioneering and dedication. Developing index service is the main way to learn to go to society, integrate into information service industry and promote the progress of index cause in China.

Since the establishment of 12 Institute, we have also done some work in indexing service, mainly holding more than ten training courses on indexing business. A few members have been entrusted to compile multiple indexes for some units, provide indexing and design consulting services for some databases, and publicize and popularize some indexing and database knowledge through writing articles and meeting exchanges. But on the whole, the service work in this area is only piecemeal, extremely limited and out of scale.

As far as historical documents and contemporary books and periodicals are concerned, China can be called "the country of literature" in terms of the number of varieties. However, the scale of China's index industry lags far behind that of developed countries. It is the unshirkable responsibility of our professional academic group and all its members to promote the rapid development of the index. It can be said that it is urgent to strengthen the indexing service.

Carrying out various forms of indexing services can not only do more practical things to promote the development of indexing work in China, but also promote academic research and exchange activities of the society and improve the professional level of its members.

The scope and methods of indexing services are extensive and diverse, which can be summarized as the following aspects:

(1) Publicity and popularization of index and database knowledge;

(2) Professional training for indexing and database personnel;

(3) Organizing amateur members to compile various indexes and databases;

(4) Develop and provide special software for compiling various indexes and databases, and promote the commercialization of such software;

(5) Consulting on the design and compilation of indexes and databases;

(six) evaluation and certification of the technical level of membership indicators;

(seven) index service intermediary activities;

(8) Learn to establish index and database enterprises independently or jointly with relevant units (such as library and information institutions, publishing institutions, websites, database companies, etc.). ).

In order to organize the indexing service, two surveys should be conducted at present: the survey of the object of indexing service and the survey of the members who are willing to participate in indexing service.

The survey of the index service objects mainly includes: the types and carrier forms of literature resources (manuscripts, printed or digital resources), disciplines or majors, people (classical or vernacular literature) and so on. ), year and quantity; The type, retrieval performance and quality requirements of the index or database to be compiled; Time limit for completion; Acceptable payment standards and so on.

The survey of members willing to participate in indexing service mainly includes: having experience in indexing or database compilation or having received professional training in indexing or database; Time when the indexing service can be executed; Subject or major; Do you know classical Chinese? Whether the computer can be used; Acceptable remuneration standards and so on.

To do a good job in training, it is necessary to formulate teaching syllabus, compile corresponding teaching materials and compile teaching reference materials.

It is necessary to develop several kinds of indexing and database compiling software to determine which indexes and databases are worth popularizing.

In order to promote the indexing service smoothly, the professional committees of the institute and the publications and websites of the institute should cooperate closely and be responsible for the division of labor.

We should also strive for the help of library and information teaching units and other relevant units.

& lt& lt Back to the Directory

Index and database forum

Index directory of files

Zhang Qiyu

(Department of Information Management, Nanjing University of Politics, Shanghai Campus, 200433)

Accurately determining the indexable content of documents is an important link to control the quality of document indexing. Criteria for determining the indexable content of files; The indexable content of the document is hierarchical, inclusive and implicit, which adapts to various needs; Relevance of literature indexable catalogue; Examples of indexable content of various subject documents; Examples of characteristics of various types of documents are used to determine indexable content.

Keywords: indexable literature content, literature subject analysis

1 Accurately determining the indexable contents of documents is the first step to control the quality of document indexing.

Document indexing refers to the process of finding out which disciplines, majors and readers (users) are of reference value to all or part of the contents of the indexed document, and expressing them with search marks (search terms, classification numbers, index titles, etc.). ) provide accurate and effective search methods.

Determining the indexable content of documents is generally called subject analysis, that is, the process of finding out which disciplines, majors and readers (users) have reference value for the research of indexed document content, which is the premise of further accurately expressing the content with search tags to provide accurate and effective search ways. Accurate determination of the indexable contents of documents and accurate expression of the indexable contents of documents with retrieval marks are the preconditions to ensure the retrieval system to have high retrieval efficiency (mainly recall and precision). Therefore, accurately determining the indexable content of documents is the first level to control the quality of document indexing.

2. Determine the standards of literature indexable catalogue

2. 1 reference value standard

This is the most important criterion to determine the indexable content of a document. The indexable contents of the literature must be: ① more specific discussion in the literature; ② It has certain reference value; (3) It can be the object of retrieval.

If the content of a document has no reference value, it cannot be indexed. Because the retrieval of documents without reference value will only waste the reader's precious time and even cost, and will not make him gain anything.

For paper retrieval tools, the contents that have no reference value include:

(1) There is nothing explicitly mentioned or implied in the literature, or only mentioned without specific discussion;

(2) Although there are specific discussions in the literature, are there any contents of reference value, such as well-known contents and retrospective contents;

③ Introduction, arguments, arguments, reasoning, materials, equipment, methods, means, experiments and conclusions; (4) Hypothetical content;

⑤ Items or contents to be compared;

⑥ Quotes, metaphors and metaphorical contents;

⑦ Work not carried out;

(8) For special equipment and methods, only one of them can be extracted according to the specific situation, and if both are extracted, it is redundant;

⑨ Unnecessary search features, such as charging equipment at the top of high-pressure blast furnace, if the indexable content is determined as "blast furnace+top+charging equipment", "top" is redundant;

Attending the content that has reference value but does not conform to the subject scope of a specific retrieval tool or retrieval system.

For monograph (book) index, the standard of indexable content can be slightly relaxed. Specifically, the names of various items that are not specifically discussed but have retrieval significance can also be classified as indexable content.

The so-called reference value standard should be adjusted according to the authority of literature, the amount of information and the needs of readers.

2.2 Priority selection criteria for key content and novel content

That is, when the indexing depth of retrieval tools is limited, priority should be given to the key content and novel content in the literature as indexable content.

2.3 Appropriate selection criteria

The indexable content of a document is usually called the document topic. There is no limit to the number of topics that can be analyzed in a document. But in fact, there are many factors that affect the amount of indexable content:

① The amount of information in a specific document is not unlimited. For a specific document, when determining the indexable content, if it is less than the indexable content it actually owns, it will eventually lead to missed detection, and if it is greater than the indexable content it actually owns, it will eventually lead to false detection;

(2) the restrictions on the types of information retrieval languages used;

(3) the limitation of the professional scope of the compiled retrieval tools;

(4) Requirements for readers (users, such as comprehensive readers and pure research readers). );

⑤ Possibility of manpower and time.

2.4 Accurate summary of indexable content standards

After the indexable content of a document is determined, the generalization process must be accurate if it is to be summarized as a theme concept.

3. The contents of indexable documents are hierarchical, inclusive, implicit and adaptable.

3. 1 Hierarchical structure of indexable literature content

Regardless of whether a document has a chapter structure or not, its indexable content can generally be divided into the following three levels:

The highest level includes all the contents of literature, that is, the overall theme;

Intermediate, including only a part of literature, that is, local themes;

The bottom layer is the content that cannot form a theme, that is, the theme factor, which refers to the names of areas, people, institutions, events, biology, minerals, products, equipment, methods, processes, formulas, data, works and other matters involved in the literature. Although these names are not specifically discussed in the literature, they can lead to some relevant knowledge and information, which has certain retrieval significance.

The above three levels are consistent with the description of a certain knowledge in the literature. In terms of importance, the overall theme is greater than the local theme, and the local theme is greater than the theme factor.

When determining indexable content, the above three levels have different trade-offs between title retrieval tools and monograph index:

3.2 The document contains indexable content.

It can be seen from the relationship between the diagrams in section 3. 1 that the local theme is the refinement of the overall theme, so the overall theme contains the content of the local theme; Theme factor is the refinement of local theme, so local theme contains the content of theme factor. In document indexing, apart from the overall theme, it is necessary to subdivide local themes and theme factors, because this can provide more retrieval channels, make documents match more retrieval needs, and make readers who are searching for specialized content find specific content directly and conveniently.

When the indexing depth is limited, some or all local topics can be omitted for the title retrieval tool, and some theme factors can be omitted for the monograph index.

3.3 The meaning of an indexable directory of a single document

The indexable content of a document means that some documents have indexable content, which is directly, clearly and directly discussed by unused words. For example, in the article "Ideal varieties to replace high-residue fungicides", the indexable content seems to be "fungicides, (new) varieties", which is not directly expressed by the word "high-efficiency and low-toxicity pesticides", but actually implies the indexable content of "high-efficiency and low-toxicity pesticides".

3.4 The contents of the same document can meet various needs.

The same document can be used as a reference for various research topics and can meet various needs. Therefore, the same document content can be used as multiple indexable contents. For example, articles commemorating the 50th anniversary of War of Resistance against Japanese Aggression can be indexed as War of Resistance against Japanese Aggression (1937- 1945) and the July 7th Incident (1937). The article "Vasopressin Treating Coronary Heart Disease Caused by Shock" can be indexed as shock, drug therapy, vasopressin, clinical application, coronary heart disease, chemical induction, vasopressin, side effects, etc.

4. Relevance of indexable contents in literature

(1) The content expressed by a retrieval logo may be an overall theme in this document, a local theme in that document and only a theme factor in the third document. Therefore, between the same logos that summarize the indexable contents of documents, the amount of information often varies greatly;

(2) A certain knowledge content belongs to indexable content in this retrieval tool, but not in that retrieval tool.

Introduction to the indexable catalogue of various subject documents

Various subject documents have some unique indexable contents, such as:

Historical documents-countries, regions, dynasties (historical times), nationalities, figures, institutions and groups, events (such as revolution, uprising, war), political, economic, military, cultural and educational systems, policies, reform measures, meetings, regulations, works, etc. ;

Local documents-historical changes, economic (industrial) conditions, nature, climate, mineral resources, products, nationalities, customs, languages, culture, education, people, administration, institutions and groups, events, mountains and rivers, transportation, places of interest and so on;

Medical literature-disease, diagnosis, treatment, nursing, prevention, pathology, etiology, diagnosis and treatment methods and instruments, drugs, etc.;

Agricultural literature-crops, varieties, breeding, cultivation techniques, field management, pests and diseases, agrometeorology, soil, fertilizers, pesticides, etc. ;

Industrial technical documents-industrial product varieties, structures and parts, design, manufacturing materials, manufacturing equipment, manufacturing technology, etc. ;

Educational literature-educational theory, educational thought, educational system, educational institutions, educational figures, school management, teaching methods, various levels of education, etc.

The indexable content of various subject documents is the main basis for selecting parts of speech in thesaurus classification table.

The characteristics of all kinds of documents determine the indexable content.

Academic monograph-all indexable contents should be at least equal to the detailed outline of the monograph, and it is required that all kinds of subject factors with retrieval value (concepts with retrieval significance, especially new concepts) should not be omitted;

Encyclopedia-all its entries should be indexable, and the special content under some entries can also be used as indexable content;

Dictionary-a professional dictionary and definition. Each entry can be used as indexable content. Every word in a bilingual dictionary can generally be used as indexable content (such as the Chinese-English index attached to an English-Chinese dictionary).

Yearbook-In addition to all chapters, local themes under chapters and theme factors (concepts) with retrieval significance should also be indexed;

Textbook-make sure that its indexable content is broad, and regard technical terms as indexable content as far as possible, so as to facilitate learners to find it;

Journal papers-their overall theme must be indexable, and whether their local theme is indexable depends on the specific requirements of retrieval tools. For the literature of adjacent majors, only local topics useful for this major can be selected as indexable content. Theme factors are generally not used as indexable content.

refer to

1 Zhang Qiyu. On the index entry. Library journal, 1994(5)

2 money, Qi Lin. China thesaurus indexing manual. Science and Technology Literature Publishing House, 1985: 296.

Zhang Qiyu, Professor of Information Management Department, Shanghai Branch of Nanjing University of Political Science, Vice Chairman of China Index Society.

Information collection of international conferences and papers

1. Southern Africa Indexing and Cataloguing Association 10 anniversary celebration meeting.

The South African Index and Bibliography Association 10 anniversary celebration will be held in Johannesburg on September 6-7, 2004. The theme of the conference is "Book Index and Transcendence". The conference warmly welcomes indexers, librarians, publishers, researchers and archivists from all over the world to participate in and discuss the future of indexing. Contact information is as follows:

Marlene Burger (for paper) from burgem@unisa.ac.za

Tel +27 12 4296585

Fax +27 12 429322 1

2. Annual Meeting of Australian Index Association in 2005.

The next annual meeting of the Australian Index Association will be held in Melbourne from 6 to 9 March 2005. The theme of the conference is "Index: Participation, Inspiration and Enrichment". The content of the conference is not limited to the traditional back index, but will discuss the compilation of various indexes in various disciplines. Contact information is as follows:

Australian index association conference Committee

PO Box 125 1

Mel 300 1

Telephone/fax: +6 13 9500 87 15

email:mindexer@optusnet.com.au

& lt& lt Back to the Directory

Theory and Application of Citation Index (Ⅱ)

Eugene garfield's Revision of Hou Hanqing's Translation of Liu Yu

Chapter X Citation Analysis of Scientists (Continued)

merits and demerits

Any fair evaluation of scientists with citation analysis as a tool must admit that the meaning of cited frequency is still unclear. We still can't know exactly whether their scientific research performance is good or bad. We know little about the sociological factors that affect the frequency of citation. Little is known about the reasons for the low citation frequency. There is still much to learn from the changes of citation patterns in different fields, which is one aspect.

On the other hand, we know that the frequency of citation shows the contribution made by someone's scientific research work, at least in terms of the efficacy and interest found by colleagues in the scientific community. We now know that high citation frequency is related to peer experts' comments on the progress and importance of scientific research contributions. And now that we have defined the general citation patterns and the variables that affect them, we intend to design a useful statistical model to predict the citation frequency of a scientist's life (calculated according to the average citation frequency of each paper of a scientist).

This model was developed and tested by N.L.Geller, J.S.deCani and R.E.Davies, and it is based on our understanding of the general citation model and the annual growth of scientific and technological literature. The input of this model is the cited history of scientists' existing papers (at least 4 years). Based on these, this model calculates the total cited frequency of each paper during the 40-year period (generally speaking, the life span of a paper is 40 years). The lifetime average cited frequency of each paper can be calculated from the total cited frequency of 40 years. The model also develops a confirmation technology to identify papers worthy of special attention, because the citation history of these papers shows that their citation patterns are obviously different from ordinary papers.

Developing such a model is an important step towards systematically using citation statistics and reducing the influence of methodological errors. However, we still need to be cautious and understand the limitations of citation statistics as a relative evaluation of scientific research performance. Just like using any method, the effectiveness of citation analysis results is closely related to the skills of citation analysis. The simplicity of reference counting masks many sensitive issues related to reference statistics. Shallow citation studies that ignore these sensitive issues often seriously mislead others. Correct citation research requires a full understanding of all kinds of complicated situations when comparing cited frequencies, especially when dealing with those with low cited frequencies.

The reasons and objections of using citation frequency as an evaluation tool have been discussed earlier. There are two basic facts in this debate. First, when scientific research institutions are getting bigger and bigger and playing an increasingly important role in society, it becomes more and more difficult, more expensive and more necessary to evaluate and confirm the people and groups who have made the greatest contribution in the institutions. Secondly, citation measurement has proved to be an effective way for peer experts to evaluate, which introduces some useful and objective factors in the evaluation process and only involves a small amount of expenses in using investigation technology. Compared with intuitive judgment, citation analysis sometimes takes more time and energy. But professional evaluation is definitely very important, which shows that this kind of investment is right.

Conclusion: The future of citation index method.

The main content of this book is the past and present of citation index method. What will be the future of citation index method?

The citation index shows that its future will be as strong as it is now. Citation index, as a tool for retrospective retrieval of scientific and technological documents, will continue to be popularized and may accelerate its popularization.

The accelerated popularization of citation index method will depend on the following factors:

The popularity of online retrieval in the world will make the SCI retrieval system mastered by most scientists and scholars. The improvement of online retrieval software will make people familiar with SCI make more use of this retrieval system.

More and more people realize that citation index is an integral part of research evaluation, which will guide more scientists to learn to use SCI. Excellent "referees" will hopefully stop those seemingly just and legal practices that are actually compromises.

It will become more and more convenient to retrieve databases such as SCI and SSCI. Micro-storage technology or microcomputer will eventually make it economically feasible for individual scientists or scientific research institutions and groups to have their own databases. At the same time, the decreasing cost of online retrieval and remote connection to the database stored in the central computer will increase the convenience of retrieval.

In addition, ISI is seriously considering the idea of compiling a professional citation index based on the periodical literature of a certain discipline. However, citing documents and cited documents will reflect the extensiveness and dispersion of this subject. This professional citation index will include all the citations of the authors of the subject and those of non-authors pointing to the subject.