How to use retrieval tools to find historical documents

Literature classification and retrieval

The research and compilation of index is closely related to the change of literature circulation form. The wide application of computer-based new technology in the field of document circulation has impacted the research and compilation of index theory and brought about the requirements of reform. The ups and downs of index research in China since 10 show the importance of index research to meet the needs of social literature circulation. The development trend of bibliographic compilation, database construction, document indexing and retrieval language also shows the new characteristics and requirements brought by the application of new technologies to document collation. The research and compilation of index theory should adapt to the new situation and make index compilation serve the society better.

Index is a tool to use literature, which is closely related to the change of literature circulation form. Since 10, the theoretical research of indexing has fluctuated, which shows that our indexing concept is undergoing profound changes with the modernization of literature collation. In the past, the literature was mainly based on paper carriers, and the index was compiled by hand. The arrangement technology has not changed much for more than 100 years, and the theoretical research is relatively stable. Since the computer and its related electronic technology were applied to the field of literature and became a new circulation carrier, the indexing methods and forms have changed greatly, which has had a great impact on traditional concepts, which is the fundamental reason for the fluctuation of indexing research. This paper analyzes the index research in 10 (1993-2002) in order to have one or two views and plans for the further development of index research.

The theoretical research of 1 exponent is low, and the related research has a strong development momentum.

The compilation and theoretical research of index in China has a long history. In the 1920s and 1930s, modern indexing theories and methods were introduced into China, which set off the climax of indexing and research and made great achievements. After the founding of New China, the stable social environment and advanced political system provided a good atmosphere for academic research, and theoretical research and index compilation really reached a climax. Although the Cultural Revolution affected the development of index, the research and compilation of index flourished again after it was brought out of order. Especially in the late 1980s, the China Index Society was established, which led the index research, organized the index development, and carried out academic exchanges, making the index cause in China standardized.

The early 1990s was the peak of academic research on index theory in China, and the number of research decreased after 1996. The information reported in the national newspaper index (Zhejiang Social Edition) shows this trend (see table 1). However, this slide does not show the decline of China index research, but shows that traditional theoretical research is decreasing, which is a turning point for index research to mature. What can explain the problem is that the theoretical research of related disciplines or indicators is moving towards strength.

Table 1 index theory research (according to the reporting time of national newspaper index)

Year199319941995199719981999 2000 2006 5438+0 2002.

Total * * * 38 44 36 25 203129 012 226

Traditional mechanism 37 42 32 25 20 29 28 012216

Automatically compile12402100010.

1. 1 Bibliography Research

Bibliography has a long history in China. Since Liu Xiang collected a large number of books in Qilue in Han Dynasty, it has become an important tool for people to consult documents. The method of dividing books into six groups and arranging them in this way, although somewhat naive, has created a precedent for indexing. In the history of more than 2000 years, bibliography has had a great influence on the preservation and utilization of documents. Although the function of bibliography is not mainly used for document retrieval, for a long time, people mainly rely on bibliography to retrieve documents, so that many scholars blame the imperfection of indexing theory in China on the influence of bibliography. Until now, the research on bibliography compilation has been enduring for a long time. There are a large number of documents about bibliographic compilation and database (excluding bibliography theory, various bibliographies, library cataloging and catalogue organization, etc.). See table 2). In particular, the research on compiling bibliographies by automatic means has been gradually strengthened.

Table 2 Bibliographic compilation theory and database research status (according to the reporting time of national newspaper index)

Year199319941995199719981999 2000 2006 5438+0 2002.

Total * * * 2517 3210 27 5149 36 56 51374.

Bibliography1161251649121288.

Traditional organization11510248468664

Automatically compile 01118034624.

Research on the database 3510617 49 4118 32 27 208

1.2 Research on Bibliographic Database Construction

Database is a document form after computer technology is applied to document storage, and most databases are electronic books. Its various retrieval methods make it a multifunctional index. There was little research on it in the early 1990s, but it reached its peak in the late 1990s, and it has not decreased so far (see Table 2).

Research on 1.3 Document Index

Document indexing is a way to reveal documents, and the compilation of bibliography and index can not be separated from it. Bibliographic indexing is to reveal the contents of group books, and index indexing reveals various knowledge points including literature nomination. There is no essential difference in indexing technology, and the research on it has always been a common topic. There are many research articles since 10 (see table 3, excluding the indexing problems involved in library classification and cataloging). Since the large-scale application of computer technology in bibliography and indexing, there have been more and more articles discussing automatic indexing technology, which shows that the research of document processing technology in China has kept up with the development trend of the world.

Table 3 Research on Literature Indexing (according to the reporting time of national newspaper index)

Year199319941995199719981999 2000 2006 5438+0 2002.

Total 28 39 34 26 46 64 48 28 50 39 402

Traditional theory 22 36 29 2142 49 3616 33 29 313

Automatic index 635541512121710 89.

1.4 Research on Document Retrieval Language

Retrieval language is the medium of dialogue between people and documents. Without this language, it is impossible to standardize documents and communicate with them. Especially after the application of computer technology, retrieval language has become a way of man-machine dialogue. Retrieval language is an artificial language with certain norms and standards. For example, China Library Classification, China Thesaurus and various forms of "keyword list" and "author number list". Nowadays, people put forward the idea of indexing and retrieving computer-compiled documents with natural language. The discussion was heated, and there were many insightful articles, which represented the direction of retrieval language. There are more and more discussions on how to carry out literature retrieval under the network environment (see Table 4, the data does not include the classification and subject method used by the library, etc. ).

Table 4 Research Status of Literature Retrieval Language (according to the reporting time of national newspaper index)

Year199319941995199719981999 2000 2006 5438+0 2002.

Total * * *15 37 2516 4129 25 35 3719 279.

Traditional theory15352415382521272911240.

Automation language 0 2 1 1 3 4 4 8 8 39

The strong momentum of related disciplines is the inevitable result of index research and compilation, the embodiment of the practicality of index research, and the result of the cross-integration of index science and various disciplines. It shows that more attention should be paid to practice in the discussion of applied scientific theory.

The development curve of index research describes the course of electronic technology application literature circulation field.

10 years, research on index, bibliography compilation, bibliography database, literature index, retrieval language, etc. They all experienced fluctuations from low to high to low. The hump starts from 1995 to 1999, and the peak is 1997- 1998. This is a period when modern electronic technology with computers as the main body is widely used in all aspects of document publishing, storage and circulation. Computer network has become a file form that people actually use. It has complete functions, fast communication speed, large storage capacity, rich collection, convenient retrieval and high accuracy, which is beyond the reach of paper documents and tends to replace books and documents. People's psychology (accepting new things) and physiology (adapting and mastering operation technology) all have the desire to understand, master and apply, so it is inevitable to study and discuss the application of new technology. This kind of research will inevitably impact the traditional theory. However, it has to go through a process from shallow to deep, from general to in-depth research. With the popularization and stable operation of new technology, the research in this field will be reduced, thus forming a curve. This curve records the application of computer and its network technology in the field of document circulation.

2. 1 The influence of the application of computer technology on the research of traditional index theory

1994 has the largest number of articles on exponential theory, and then gradually decreases (see table 1). This is in sharp contrast with the increase of articles on database construction, literature indexing technology and retrieval language from 65438 to 0997. The reason for this contrast is the academic reflection and research on the wide application of computer technology in the field of document circulation. 1994- 1996 is the brewing, writing and publishing cycle of this kind of research. The gradual decline of traditional theory conforms to the development trend of academic research, but the development of "0" is abnormal, and the tendency to attach importance to one research while ignoring other research is not desirable. Traditional indexing theory is the theoretical basis of all new indexing and indexing forms, and its research should not be underestimated.

2.2 The development curve of bibliographic database research (see figure 1) clearly shows the application process of computers and their networks in literature dissemination industries such as libraries.

The article on database research began in the late 1980s and early 1990s, mainly focusing on introducing its functions. With the rapid popularization of computers in the field of literature circulation, the number of articles began to increase. This paper discusses the compilation, retrieval and production technology of database. After 1995, the number of research articles began to increase sharply, which marked the large-scale application of computers in the field of literature circulation with libraries as the main body. In the following three or four years, the popularization and application of computer and its network technology reached its peak, and it was basically stable by the end of the 1990s, so the number of articles studied was relatively reduced.

attached drawing

Figure 1 schematic diagram of research on document indexing, retrieval language and bibliographic database.

2.3 The research of document indexing and retrieval language are interdependent, and they are the ways and means to reveal documents.

Retrieval language is the language that people communicate with document carriers (printing, electronics, etc.). ). Through this language or logo symbols representing this language, the concept of document theme and other features with retrieval significance are expressed as the basis of document storage and retrieval. Without the retrieval language, it is impossible to index. Without an index, the retrieval language is useless. This is an important indexing method. Especially after computer technology is applied to document arrangement, it is particularly necessary to study these two aspects. As can be seen from the figure 1, the peak of research is also in the period when computer technology is widely used, which shows that academic circles attach importance to document indexing and its retrieval language compilation, and also reflects the academic style and enterprising spirit of these scholars. The application of computer is the most important event in the history of exponential theory research. It not only changes its form with the change of literature carrier, but also involves the reform and innovation of compilation methods and operation processes. More importantly, the concept of index compilation must be changed. This is a major revolution in the history of indexing.

3. The research of index theory is not rapid, but the field of vision is broad.

Index research is a junior in many disciplines, with relatively few achievements. Most of the research content belongs to the discussion of function introduction and compilation method. With the in-depth development of scientific research, people's demand for literature is more and more extensive, and the research of index theory is developing towards diversification and specialization. In particular, the emergence of new literature carriers has broadened the horizons of indexing theory research. People have explored the methods of revealing documents from many angles. Since 10, the overall research pace is not big, but the research horizon is much broader than before, which in itself is the progress of index theory research.

3. 1 The general trend is that traditional theoretical research is decreasing, but it reflects the call for index research under the new situation.

The study of index theory was still very stable in the 1990s. Between 1993- 1999, the number of research articles in the table 1 fluctuates little, but more. It shows that a group of scholars in China academic circles are concerned about the development of index. In this good atmosphere, China Index Society has made great contributions, organized academic research, communicated with academic circles at home and abroad, and done a lot of work. In the mid-1990s, China Index Society edited a series of papers on index research, including five volumes, including Yesterday and Tomorrow of Index, Index Technology and Index Standard, On Index and Index Method, Newspaper Index and News Database, and Automation of Index Compilation, with four papers1/kloc-0. Relevant experts were hired to write papers on indexing principle and automatic indexing technology, document indexing and automatic indexing technology, retrieval language programming and computer language recognition technology, bibliographic database technology and so on. It is unprecedented in the research of indexing theory for many years, involving deep problems, citing many materials and new academic opinions, which has played a guiding role in the development of indexing theory in China. However, the number of research articles has dropped sharply in recent years, which should attract the attention of academic circles. We should reflect on whether the previous research is suitable for the new situation and the needs of the new situation, and how to change the traditional concept as soon as possible and establish a new technical research system to lay the foundation for the development of indexing under the network environment.

3.2 The research horizon of indicators has been gradually expanded, adapting to the general trend of scientific and technological development.

During the period of 10, 203 articles were published in newspapers and periodicals (this figure is based on the publication time of the original documents), including general indexing theory, automatic indexing, foreign language indexing, various indexing studies, indexing history research, famous indexing scholars and institutions research, indexing monographs, indexing of various retrieval tools, indexing of ancient books and modern works, etc. (See Table 5). General theoretical research only accounts for 3 1.5%, and other specialized research accounts for more than 2/3. Among the 64 theoretical articles, 27 are about functions, 26 are about compilation principles, 8 are about the development of indexing, and 3 are about the comparative study of Chinese and foreign indexing theories. Studying indexing from multiple angles embodies the vitality of indexing research and can adapt to the progress of science and technology.

Table 5 1993-2002 Classification of Index Theory Research (according to the publication time of original documents)

Entry General Automatic Foreign Language Indexing Scholar Index Modern Retrieval of Ancient Books

Objective To study the index types, monographs and tool index works of institutional history.

Quantity 641213 62 67 2210 34 203

3.3 There are a lot of researches on indexing types, which shows that indexing is still a practical science.

As long as the society needs documents, there is a requirement to use indexes. Indexing is the most convenient way to use documents at any time. There are 62 indexes of all kinds, accounting for almost 1/3. Various indexes of 10 are discussed (see Table 6). Although the average number of articles is small, it reflects the attention given by the academic community. These indicators have different functions, but they are all needed by people.

Table 6 Research situation of type index (according to the publication time of original documents)

attached drawing

3.4 After the computer technology tends to be stable, the research on automation decreases, while the research on index types and indexing of academic works increases.

With the emergence of new literature carriers and the change of literature utilization forms, research in this field will inevitably appear, but once it is popularized and stabilized, the number of introductory or explanatory articles will decrease and gradually turn to in-depth discussion, and the number of articles will decrease. The decreasing number of articles about database discussion is an example. The automatic research in the fields of bibliographic compilation, document indexing, document arrangement and retrieval language is gradually increasing, which shows that people are getting rid of the research of general theory and turning to the discussion of practical theory and specialized compilation methods. This is also the process of computer technology from application to theoretical research. Therefore, articles about the application of computers in specific fields will gradually increase and deepen.

3.5 The research of scholars or institutions who have contributed to the history, research and compilation of index is gradually decreasing, while the research of index works and compilation of retrieval tools is increasing, which reflects that academic circles pay more attention to the research of index application theory and the discussion of index retrieval function.

Although this field is still blank in recent three years, this temporary phenomenon is caused by the adjustment period in the transformation of literature carriers. When the computer technology runs stably, due to the advantages of fast compiling speed, accurate word selection and standardized arrangement, it will save a lot of compiling cost and manpower, and it will inevitably lead to the climax of indexing. Including all kinds of academic research indexes at home and abroad. The index result may not be printed, but the function is the same.

As long as the function of literature does not disappear, the indexing function will not disappear. Judging from the utilitarian performance of indexing, the wide application of computers in the field of literature has opened up infinite bright prospects for indexing. The index of great works that could not be realized before may now be realized in Russian and China. The use of files has entered a new era.

4. Get out of the confusion of "theory", establish the concept of big index and pay attention to the practical application of index.

Index is an important part of "complete literature", which should have both original literature and retrieval tools, so it is very convenient to use. Index is a retrieval tool, attached to the literature. Nowadays, great changes have taken place in the form of documents, and the index research should be reformed to meet people's demand for documents under the new situation. We should pay attention to the current situation of index research from several aspects.

4. 1 We should break through the shackles of traditional ideas and establish innovative thinking.

In-depth study on the characteristics of people's use of documents today, the influence of computer and its network development on document dissemination, and people's demand for document utilization under the network environment, so that index compilation can meet the needs of people's use of documents in the future. The research of index theory should not only adapt to the changes of literature forms, but also conform to people's understanding, cognition, psychological adaptation and usage habits of the used literature. We can't stick to the traditional theory, which is divorced from practice, thus losing the significance of guiding practice. To establish innovative thinking, we can't expect to establish a complete set of new theories in a short time. It is necessary to establish a new way of thinking, dare to innovate and forge ahead, take social needs as the research purpose, give full play to the role of indexes in revealing documents, and provide a fast lane for the utilization of documents.

4.2 Break through the barrier of "Research on the Taiping Heavenly Kingdom" and enhance its rational exploration.

At present, the first need of many studies is to publish articles, so in theoretical discussions, people are willing to say "cliche" and "coherence", for fear that new things will be inaccurate and new formulations will not be recognized by editors, so they are in a fog to seek peace. There is also a tendency to write it as a "theory" type, which has the flavor of "theory" and talks about its characteristics, laws and functions. This is really not desirable. Theoretical research is to solve practical problems, not limited to a certain form, the content can be deep or shallow, as long as a problem is clearly discussed, it is a good article. Index is a practical science, so we should pay attention to the research and discussion of compiling technology. However, in 10, there are only 44 articles on bibliography and indexing technology (excluding library catalogue organization) (see table 7), which is 1 4% of the theory of indexing and bibliographic compilation in 3 14 (see table 2), accounting for the national newspaper index retrieval articles. Paying more attention to "theory" than technology is a sign of lack of rationality. It reflects that the industry is used to routine operation and is not good at pioneering and innovating.

Table 7 Research on Bibliographic Index Arrangement Technology (according to the reporting time of national newspaper index)

Year199319941995199719981999 2000 2006 5438+0 2002.

Total *** 4 2 5 5 3 6 2 6 2 9 44

Traditional 3 2 3 4 2 3 0 4 1 2 24

Automation102113221720

4.3 A considerable number of researchers have insufficient knowledge of advanced technology and vague understanding of the future development of indexing.

In the table 1, the ratio of the number of traditional research ideas to the number of research automation is 2 16: 10, which shows that the academic circles are not familiar with new technologies. Many topics and discussions have not talked about the influence of computer application on document arrangement, but about the problems that many people have talked about. This is because if you don't know much about the application of new technologies, naturally you won't have a clear understanding of the future development. In the early 1990s, computers were rarely used in China, but they were widely used in advanced western countries. At this time, there should be an introduction climax in the theoretical circle, but in fact there are few such articles. It shows that the application of new technology in China's index circle is not reflected enough, and academic research is relatively backward. This situation must be changed, otherwise our index research and compilation will be out of date, which will affect the development of national and national scientific and cultural undertakings.

4.4 First of all, we should deeply understand the challenges brought by the ever-changing new technology. We need to update, explore and pursue from time to time in order to keep up with the pace of the times and fully serve the society for literature.

According to the social demand for literature under the new situation, we should develop practical index products to serve the society. The research and compilation of indexes should meet the needs of the times, books, people and society. If this is true, why doesn't society agree?

4.5 Theoretical research should keep pace with the times, combine with practice and take the actual needs of society as the premise.

It is necessary to discuss the principle, and more importantly, to study the specialized compilation theory. Indexing is out of date, and it is worthless if it is divorced from actual needs. Only by forming a good research atmosphere and establishing a mechanism of demand → research → new demand → new research can targeted research have vitality. If the compilation method is scientific, the index results will produce great social benefits.

4.6 Establish the concept of big indicators, broaden the research field, and serve the national science, technology, culture and economic construction.

First of all, we should not confine our eyes to printed literature. Although electronic literature is not common at present, it will eventually become the mainstream carrier of literature. Therefore, we should strengthen the research on database retrieval methods to make electronic documents serve people more scientifically. Second, we should not only attach importance to the indexing of social science documents, but also expand our horizons to documents circulating in the whole society, such as economic construction, industrial production, commercial services and so on. All documents that can circulate in the society through the whole order, or documents that are beneficial to social circulation after the whole order, should be brought into our field of vision. Such as trade catalogue and commodity catalogue. Third, we should pay attention to the interdisciplinary research with index research, with similar research purposes, related technologies and similar functions. Such as document classification, cataloging, indexing, proofreading, textual research and database making technology. Although the purpose of collation is different, many technologies are similar and can be used for reference. * * * can complete the disclosure and collation of documents.

4.7 China Index Society should strengthen the guidance of academic research.

In addition to organizing academic activities, we should play a guiding role in the research direction of index theory, introduce the world's advanced compilation technology and index research trends, introduce achievements, translate influential academic works, strengthen academic exchanges, and let all sectors of society know about the new development of index. The society should also attract relevant technical personnel from the industrial and commercial circles, make the index research more practical, directly link with industrial and agricultural production and national economic construction, and better serve the national economic construction.

In a word, the index research in 10 years has mixed feelings, mixed feelings and worries. Hey, the research results are quite rich. Worried about the decline of traditional research in the past few years. The reason for the landslide is that the society pays attention to the wide application of new technologies, and people should have a familiar process. In this process, people will re-examine, evaluate, learn and integrate new technologies. The reduction of traditional research in recent years is a period of integration and adjustment of old and new technologies. There will be a period of vigorous development of new theories in the future, so there are joys and sorrows. Let's make theoretical preparations and meet the new exponential climax.