How to extract keywords from Chinese language use >> Abstract citation

I submit the paper you need, which was written by a teacher in Jiangsu. Please refer to!

In recent years, the compression of language fragments is a key point in the college entrance examination, because this kind of questions not only comprehensively examine the abilities of understanding, analysis, screening, generalization and language expression. , but also has a good degree of discrimination, closely integrated with social life.

There are many ways to check compressed paragraphs. The traditional questions are: define the concept, summarize the paragraph content, write a news or title, write a lead, and write a telegram; The newer types of questions are: writing short messages, ending essays, naming new products, adding concluding sentences to paragraphs, news comments, summarizing research conclusions and extracting keywords. Today, the key point is to extract keywords. Let's look at a college entrance examination question first:

Extract the main information of the following paragraph and write down four key words in the box.

According to reports, in the vast collection of ancient books in the National Library of China, there are only1.60,000 volumes with a length of more than 5,000 meters to be repaired in Dunhuang Suicide Notes, while the National Library has only 10 professionals engaged in the restoration of ancient books; There are 30 million ancient books collected by libraries and museums all over the country, and the damage situation is also quite serious, which urgently needs rescue repair, but the total number of ancient book repair talents in China is less than 100. It will take nearly a thousand years to complete such a huge restoration project with so few people, even if it is day and night.

Analysis: the concept of "keywords" first appeared in the Chinese test questions of the college entrance examination, which is refreshing. So what are the key words? In fact, keywords are words often mentioned in the network, which refer to words that people enter into the search box. The contents of keywords are mostly website names, webpage names, news events, names of people, terms, software names and so on. As far as today's discussion is concerned, we can search for "college entrance examination, condensed paragraphs and keywords", then these three words can be called keywords. To extract keywords, in the final analysis, we should be good at extracting "core information", which is a kind of compressed information type to examine students' ability to extract key information.

Methods 1: three-step problem solving method

(a) is a clear statement of the object or major events or central point of view, the main statement of the text object (major concepts or major events) "ancient books" ("books in ancient books"), talent is the subject word, can not be brought.

(2) It is a predicate verb or summary word that clearly corresponds to the main concept. For example, "repair" and "lack" are statements of the stated object and cannot be ignored.

(3) After the selection, several words can be slightly linked, and if the main content of the paragraph can be roughly expressed, it can be finalized. For example, this topic can be linked to: (collection) ancient books (urgent) repair, (but in this respect) talent (very) lack. This is a bit like extracting the trunk of a sentence, which can be done through grammatical analysis. The basic procedure is: compress the content-extract the trunk-select and compare-integrate expression (generally can be expressed as subject-predicate structure such as "who or what").

Method 2: summarize first and then refine.

Before extracting a paragraph, it should not be difficult to summarize its content. Summarizing the content of an article, refining the viewpoint of an article and summarizing the general idea of a paragraph in Chinese class are all concrete practice processes. The material given is a sentence. The first sentence is divided into two levels: first, the National Library is short of professionals in ancient book restoration; In addition, libraries and museums all over the country are also facing the problem of shortage of talents in the restoration of ancient books, among which "insufficient 100 people" indicates the shortage of professionals in this field. The second sentence is to calculate an account, which also shows the lack of talents to repair ancient books.

The center of this passage can be summarized as "the ancient books in the collection are in urgent need of repair, but the talents in this field are very scarce." We can extract again, grasp the main information, and find out four key words: "ancient books, restoration, talents, lack (deficiency)".