Specific embodiment
Fig. 1 is the structure chart that knowledge hierarchy provided in an embodiment of the present invention builds system.As shown in Figure 1, the present embodiment carries
The knowledge hierarchy structure system of confession includes:Antistop list management module, domain-planning setup module, thesaurus management module,
Domain body management module and user authority management module.
Antistop list management module, for managing the newly-increased of internal system keyword, editor, deletion, importing, exporting, new
Word finds and is issued as the management operation of descriptor, specifically, obtaining the keyword in target domain, the keyword is carried out
Editor deletes, publication.
Domain-planning setup module, for the management of the contents norm of all vocabularys in the art and ontology setting, specifically
, determine the thesaurus setting rule and ontology rule.
Thesaurus management module, for the table management to thesaurus and the management to descriptor in table, specifically, root
Descriptor is determined according to the keyword;According to the descriptor and thesaurus setting rule, thesaurus is built;To institute
Descriptor is stated to be increased newly, edited, deleted, inquired, visualizing display, export, correlated resources and publication;To the thesaurus
It created, edited, deleted.
Domain body management module, for the management to domain body, knowledge meta-model and Knowledge Element, specifically, obtaining
The Knowledge Element of the target domain;According to the knowledge connection between the model of the Knowledge Element and different Knowledge Elements, ontology is built;
The ontology is imported, exported, edited, is deleted;The Knowledge Element is increased newly, edited, deleted, inquired, is visualized
Display, correlated resources.
User authority management module, for, using the management of setting system permission, being used specifically, obtaining user and user
Family information verifies the operating right of user according to the user information.
The present embodiment builds system by knowledge hierarchy and keyword is automatically extracted during documents procession, based on existing system
A series of services such as system filling descriptor, the correlativity of Knowledge Element and attribute, reach the architectonic purpose of rapid build, carry
High document resource utilization rate, improves knowledge hierarchy dynamic learning updating ability and intelligent automaticization degree.
Fig. 2 is the functional schematic of antistop list management module provided in an embodiment of the present invention.As shown in Fig. 2, above-mentioned
On the basis of embodiment, the antistop list management module is specifically used for following at least one:It will be increased newly in the thesaurus
Descriptor as the keyword;Obtain the keyword in the target domain input by user;By the word in default vocabulary
As the keyword;The word in the text information that user uploads is extracted, using the word in the text information as the key
Word.
Keyword is one of source of descriptor, and the management of antistop list is the basis of information management structure.This module carries
It increases newly, edit for including keyword, deleting, importing, exporting, new word discovery and the function of being issued as descriptor.
As shown in Fig. 2, in antistop list structure, this system obtains keyword, and there are four types of modes:Descriptor write-back adds manually
Add keyword, keyword imports and new word discovery.
The descriptor write-back function, after increasing new descriptor in thesaurus, by system automatically by newly-increased theme
Word write-back into antistop list, as new keyword.
Addition keyword manually:The function can meet the needs of user is manually entered keyword, realize to antistop list
Flexibly supplement.
Keyword imports:The function, which can meet user and be based on having industry vocabulary, builds architectonic demand, can will
There is vocabulary to be introduced directly into system, the architectonic basic vocabulary of rapid build.
New word discovery:The function can confirm according to the neologisms in the text information extraction text that user uploads through user
Afterwards, it updates in antistop list.The function can be based on having language material construction antistop list, improve the same of language material Document Utility Rate
When, promote antistop list construction.
Keyword editor/deletion:For having an antistop list, user can manual editing, delete certain keywords, realize pair
The accurate control of antistop list.
Keyword is issued:To having confirmed that errorless keyword, the scalable descriptor being issued as in certain field thesaurus.
Fig. 3 is the functional schematic of domain-planning setup module provided in an embodiment of the present invention.As shown in figure 3, above-mentioned
On the basis of embodiment, the thesaurus setting rule includes:The Feature Words definition rule of descriptor, the attribute of descriptor are determined
Contextual definition rule between adopted rule and descriptor.
Domain-planning setup module includes field establishment and editor, thesaurus rule setting and ontology rule setting work(
Energy.Domain-planning setup module has formulated the contents norm of all vocabularys and ontology in field, thesaurus in field and this
Body inherits the rule in the field.
Under the field thesaurus rule setting include the Feature Words of descriptor in field are defined, the attribute of descriptor
The management of contextual definition between definition and word.The Feature Words management of the descriptor, available for descriptor institute energy in management domain
The Feature Words and tagsort of the adaptation of the selection descriptor when checking screening descriptor, can pass through the feature of descriptor point
Class and Feature Words realize quick positioning;The attribute management of the descriptor required during descriptor editor in management domain can be filled in
Attribute classification, such as spelling attribute, COMMENTS attribute;By managing relationship between word, it may be determined that closed between the word of descriptor in field
Which system has, wherein between word relationship include with generation, belong to be divided to, reference or hypernym, hyponym, synonym, two kinds of related term are tieed up
Degree.
The domain body rule includes the definition of abstract knowledge meta-model, Knowledge Element contextual definition.Abstract knowledge meta-model
Definition is included to the assignment of model attributes, the definition of model instance attribute and the definition of model instance pattern.Wherein model attributes
Including model name and its father's model inherited;The definition of model instance attribute is had including model instance (i.e. Knowledge Element)
Property Name, input form, verification mode definition;The definition of model pattern defines the model instance in Knowledge Element collection of illustrative plates
In color pattern.The incidence relation that may have between each Knowledge Element model instance of the Knowledge Element contextual definition.Relationship
The content of definition includes:Relation name, relationship type, relationship color matching, relationship explanation and the relationship and Knowledge Element model attributes
Mapping relations.
Fig. 4 is the functional schematic of thesaurus management module provided in an embodiment of the present invention.As shown in figure 4, above-mentioned
On the basis of embodiment, descriptor is that have in a certain field for have attribute, classification, relating subject word and correlated resources centainly
The word of feature.The present invention supports the need for combing relevant knowledge structure thesaurus for Publication Enterprises, knowledge services being provided for reader
It asks.
Thesaurus construction includes the Rulemaking of vocabulary and Edition Contains add two parts, and vocabulary Rulemaking is in field
Rule setting module is completed, the function that thesaurus Edition Contains are added in the offer of thesaurus management module.Chief editor's vocabulary pipe
It manages module and includes the management to the table management of thesaurus and to descriptor in table.
The table management of thesaurus includes to the newly-built of field thesaurus, editor, deletes function, newly-built and editor's theme
During vocabulary, fields, the vocabulary title of thesaurus may be selected.
The management of descriptor is included to the newly-increased of descriptor, editor, deletion, inquiry, visualization display, export, pass in table
Join resource and issuing function.
The acquisition source of descriptor includes four kinds in vocabulary:
1) based on traditional descriptor construction method, the artificial Zhen to word is realized in a manner that domain expert increases newly by hand
Not with vocabulary authorized strength work;
2) from keyword has been issued, descriptor structure in vocabulary is realized;
3) based on existing descriptor, thesaurus is built in a manner that outside imports;
4) the existing Knowledge Element write-back in knowledge hierarchy forms descriptor in vocabulary.
By the descriptor acquisition modes of above four kinds of forms, realize the flexible rapid build of vocabulary, avoid single side
The problem of linearly process velocity caused by processing is slow for formula, it is imperfect to cover, document resource waste.Simultaneously in above four kinds of forms
In building process, by manually with the method for service that is combined of automatic decimation parsing, to descriptor into edlin, related subject
Word is filled and the mounting of related resource, improves vocabulary construction speed.
The content of edit of descriptor includes:Descriptor title, descriptor attribute, descriptor feature, affiliated classification and theme
The conjunctive word of word.Wherein descriptor attribute includes phonetic, the definition of head-word, free word of descriptor etc., and descriptor attribute can led
Domain rule setting module is configured.The conjunctive word of descriptor can be fast by classification, feature etc. belonging to fuzzy query and descriptor
The mode of speed positioning filtering is specified, and the domain-planning setup module that is arranged on of relationship is managed between word.
It is described manually to include with the service that automatic decimation parsing is combined:
Related term is filled:It, can be to the related term intelligence of the descriptor according to the word-building characteristic of descriptor, feature itself and attribute
It can supplement, after manual confirmation is audited, complete related word filling.
Correlated resources mount:According to descriptor title, free word etc., its correlated resources can be mounted automatically, and by artificial
Audit confirms.
Thesaurus by above-mentioned analysis mode obtain descriptor, build vocabulary, and pass through manually with intelligent Service phase
With reference to mode editor's descriptor, filling descriptor attribute and mounting correlated resources realize architectonic thesaurus structure.
For the descriptor in vocabulary, the present invention provides descriptor and the visual presentation function of related term, can intuitively check descriptor
Attribute, correlated resources and the descriptor collection of illustrative plates with relationship between certain word.
Fig. 5 is the functional schematic of domain body management module provided in an embodiment of the present invention.As shown in figure 5, above-mentioned
On the basis of embodiment, the domain body management module is specifically used for following at least one:It receives input by user newly-increased
Knowledge Element;According to announced descriptor, the Knowledge Element of the target domain is obtained;According to abstract knowledge meta-model, Knowledge Element
And language material, determine the Knowledge Element of the target domain;According to OWL ontologies lead-in mode, offline ontology, by building ontology, obtain
Obtain ontology knowledge member.After the domain body management module obtains ontology knowledge member, it is additionally operable to:Knowledge Element is set to be associated with
System, mounting Knowledge Element related resource.
Domain body be by numerous Knowledge Elements and its between the netted system that is formed of knowledge connection, by building field
Ontology, can the further knowledge reasoning ability of strengthening form ability and height, and pass through complicated reasoning from logic and obtain concept
Between more relationships.
Domain body management module provides the management to the management of domain body and to Knowledge Element in field.
The management of domain body, including the importing, export, edit-modify to ontology.
The ontology import feature, the user that can conveniently have basis are based on existing OWL files rapid build ontology.OWL
It is a kind of ontology description language, complete domain body can be described, uses the offline text of the domain body of OWL language descriptions
Part can realize the importing of system;After importing, the present invention provides parsing to OWL files, obtain ontology knowledge connection and
Knowledge metamessage, and it is stored in database.
The ontology export function, the present invention, which can export the knowledge hierarchy of standard, becomes OWL files, for using this offline
Body the build tool is into edlin.
The management of Knowledge Element is included to the newly-increased of Knowledge Element, editor, deletion, inquiry, visualization display, association money in field
Source function.Wherein knowledge meta-attribute can be configured in domain-planning setup module.
The source mode of Knowledge Element includes four kinds in ontology:
1) domain expert increases the mode of Knowledge Element newly by hand;
2) from issuing subject word;
3) from the Knowledge Element automatically extracted based on abstract knowledge meta-model, Knowledge Element and a large amount of training of language material;
4) by OWL ontology lead-in modes, it can be based on having offline ontology, rapid build ontology obtains ontology knowledge member.
After the ontology knowledge member obtained in the above manner, by the further setting for completing Knowledge Element incidence relation and correlation
Resource mounts.
The incidence relation setting of Knowledge Element, based on the definition to Knowledge Element relationship model in domain-planning setup module, leads to
It crosses the modes such as fuzzy query, pinyin indexes and Knowledge Element source, residing flow and positions filtering, then determined by artificial selection.
There are two types of Knowledge Element related resource mounting modes:(1) mode being manually combined with intelligent recommendation, according to required extension
The information such as title, keyword, the author of resource are connect, intelligent recommendation related resource or retrieval obtain related resource, hung after confirmed
It connects;(2) according to Knowledge Element title, attribute etc., intelligent recommendation simultaneously mounts resource.
After acquisition, editor, incidence relation setting and the related resource mounting of completing Knowledge Element, by visualizing work(
Can, it can intuitively check that Knowledge Element forms the collection of illustrative plates and the knowledge meta-attribute of ontology, related resource and relevant knowledge member are formed
The details of small collection of illustrative plates.
Fig. 6 is mission flow diagram provided in an embodiment of the present invention;Fig. 7 is the flow of task that another embodiment of the present invention provides
Figure.As shown in fig. 6, on the basis of above-described embodiment, user and authority management module are used for user, user role and use
Family browses and the management of operating right.
User role is divided into three classes in the present invention:Industry specialists, publishing house and system manager.Above-mentioned industry specialists are born
Duty is to architectonic editorial management, including the edit operation and examination to thesaurus, Domain and ontology knowledge content;Publishing house
User is responsible for carrying out audit confirmation to the operation that expert carries out;System manager is responsible for user and role-security, field rule
Then set.
The user's operation permission, including user to the creating of antistop list, thesaurus and domain body, edit, lead
The permission of vocabularys and body contents editor filling such as enter, export.
As shown in fig. 6, the task type in the present invention includes two classes:Processing tasks, audit task, task object are the theme
Vocabulary and domain body.The state of Knowledge Element is divided into descriptor and ontology wait to index, index in, in pending, audit with
And be put in storage, the transformation of state is by task-driven.
Quantifiable task shares descriptor processing, descriptor audit, Knowledge Element processing and Knowledge Element and examines in this system
Four class of core.Descriptor processing is to treat index descriptor and carry out relationship between attribute filling, Feature Selection and word to establish;It is main
Epigraph audit is to be processed reaffirming for result to the word under pending state;Knowledge Element processing is treated under index state
Knowledge Element carry out attribute filling, attachment upload, relationship establish;Knowledge Element audit is that the Knowledge Element under pending state is carried out
Processing result is reaffirmed.
In a subtask, either any task, establishment and distribution all including task and carry the processing of task
It hands over, three steps of audit and confirmation of task result.Flow of task is illustrated in fig. 7 shown below.
The present embodiment uses the method and system of structure digital publishing domain knowledge system, it can be achieved that with document resource and word
Based on table, for means in a manner of intellectualized technology combination manual examination and verification, the knowledge in fast and flexible structure digital publishing field
System.By being quickly introduced to having resource, coordinate intelligent extraction technique and artificial auditing flow, it can be achieved that knowledge
The rapid build of system improves document resource utilization rate, avoids the waste of personnel that linear work stream is brought and resource.
Fig. 8 is knowledge hierarchy construction method flow chart provided in an embodiment of the present invention.As shown in figure 8, this method specifically walks
It is rapid as follows:
Step S101, the keyword in target domain is obtained, descriptor is determined according to the keyword.
Step S102, rule is set according to the descriptor and thesaurus, builds thesaurus.
Step S103, the descriptor is operated according to the first operational order, the operation of the descriptor is included:
Newly-increased, editor deletes, inquiry, visualizes display, export, correlated resources and publication.
Step S104, the thesaurus is operated according to the second operational order, the operation to the thesaurus
Including:Newly-built, editor deletes.
Step S105, the Knowledge Element of the target domain is obtained.
Step S106, according to the knowledge connection between the model of the Knowledge Element and different Knowledge Elements, ontology is built.
Step S107, the ontology is operated according to third operational order, the operation of the ontology is included:It leads
Enter, export, edit, delete.
Step S108, the Knowledge Element is operated according to the 4th operational order, the operation of the Knowledge Element is included:
Newly-increased, editor deletes, inquiry, visualizes display, correlated resources.
Step S109, user information is obtained, the operating right of user is verified according to the user information.
Knowledge hierarchy structure system principle described in Method And Principle and above-described embodiment described in step S101- steps S109
Unanimously, details are not described herein again.
The present embodiment builds system by knowledge hierarchy and keyword is automatically extracted during documents procession, based on existing system
A series of services such as system filling descriptor, the correlativity of Knowledge Element and attribute, reach the architectonic purpose of rapid build, carry
High document resource utilization rate, improves knowledge hierarchy dynamic learning updating ability and intelligent automaticization degree.
On the basis of above-described embodiment, the keyword obtained in target domain, including following at least one:By institute
The descriptor increased newly in thesaurus is stated as the keyword;Obtain the keyword in the target domain input by user;
Using the word in default vocabulary as the keyword;The word in the text information that user uploads is extracted, it will be in the text information
Word as the keyword.
The thesaurus setting rule includes:The Feature Words definition rule of descriptor, the attribute definition rule of descriptor
And contextual definition rule between descriptor.
The Knowledge Element for obtaining the target domain, including following at least one:It receives and input by user newly-increased knows
Know member;According to announced descriptor, the Knowledge Element of the target domain is obtained;According to abstract knowledge meta-model, Knowledge Element and
Language material determines the Knowledge Element of the target domain;According to OWL ontologies lead-in mode, offline ontology, by building ontology, obtain
Ontology knowledge member.
In addition, after the acquisition ontology knowledge member, further include:Set Knowledge Element incidence relation, mounting Knowledge Element related
Resource.
Knowledge hierarchy construction method provided in an embodiment of the present invention can be built especially by the knowledge hierarchy that Fig. 1 is provided
System realizes that details are not described herein again for concrete function.
The present embodiment uses the method and system of structure digital publishing domain knowledge system, it can be achieved that with document resource and word
Based on table, for means in a manner of intellectualized technology combination manual examination and verification, the knowledge in fast and flexible structure digital publishing field
System.By being quickly introduced to having resource, coordinate intelligent extraction technique and artificial auditing flow, it can be achieved that knowledge
The rapid build of system improves document resource utilization rate, avoids the waste of personnel that linear work stream is brought and resource.
In conclusion the embodiment of the present invention builds system by knowledge hierarchy automatically extracts key during documents procession
Word fills a series of services such as descriptor, the correlativity of Knowledge Element and attribute based on existed system, reaches rapid build knowledge
The purpose of system improves document resource utilization rate, improves knowledge hierarchy dynamic learning updating ability and intelligent automatic
Change degree;The method and system of structure digital publishing domain knowledge system is used, it can be achieved that based on document resource and vocabulary,
For means in a manner of intellectualized technology combination manual examination and verification, the knowledge hierarchy in fast and flexible structure digital publishing field.Pass through
It is quickly introduced to having resource, coordinates intelligent extraction technique and artificial auditing flow, it can be achieved that architectonic fast
Speed structure, improves document resource utilization rate, avoids the waste of personnel that linear work stream is brought and resource.
In several embodiments provided by the present invention, it should be understood that disclosed device and method can pass through it
Its mode is realized.For example, the apparatus embodiments described above are merely exemplary, for example, the division of the unit, only
Only a kind of division of logic function can have other dividing mode in actual implementation, such as multiple units or component can be tied
It closes or is desirably integrated into another system or some features can be ignored or does not perform.Another point, it is shown or discussed
Mutual coupling, direct-coupling or communication connection can be the INDIRECT COUPLING or logical by some interfaces, device or unit
Letter connection can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separate, be shown as unit
The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple
In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme
's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also
That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list
The form that hardware had both may be used in member is realized, can also be realized in the form of hardware adds SFU software functional unit.
The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in one and computer-readable deposit
In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, is used including some instructions so that a computer
It is each that equipment (can be personal computer, server or the network equipment etc.) or processor (processor) perform the present invention
The part steps of embodiment the method.And aforementioned storage medium includes:USB flash disk, mobile hard disk, read-only memory (Read-
Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. it is various
The medium of program code can be stored.
Those skilled in the art can be understood that, for convenience and simplicity of description, only with above-mentioned each function module
Division progress for example, in practical application, can be complete by different function modules by above-mentioned function distribution as needed
Into the internal structure of device being divided into different function modules, to complete all or part of function described above.On
The specific work process of the device of description is stated, the corresponding process in preceding method embodiment can be referred to, details are not described herein.
Finally it should be noted that:The above embodiments are only used to illustrate the technical solution of the present invention., rather than its limitations;To the greatest extent
Pipe is described in detail the present invention with reference to foregoing embodiments, it will be understood by those of ordinary skill in the art that:Its according to
Can so modify to the technical solution recorded in foregoing embodiments either to which part or all technical features into
Row equivalent replacement;And these modifications or replacement, various embodiments of the present invention technology that it does not separate the essence of the corresponding technical solution
The range of scheme.