CN109086449A - A method of document study is carried out based on XML fragmentation technology - Google Patents

A method of document study is carried out based on XML fragmentation technology Download PDF

Info

Publication number
CN109086449A
CN109086449A CN201810954078.5A CN201810954078A CN109086449A CN 109086449 A CN109086449 A CN 109086449A CN 201810954078 A CN201810954078 A CN 201810954078A CN 109086449 A CN109086449 A CN 109086449A
Authority
CN
China
Prior art keywords
fragmentation
document
xml
cnki
paragraph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810954078.5A
Other languages
Chinese (zh)
Inventor
宋菲菲
冯自强
相生昌
***
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
TONGFANG KNOWLEDGE NETWORK DIGITAL PUBLICATION TECHNOLOGY Co Ltd
Original Assignee
TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
TONGFANG KNOWLEDGE NETWORK DIGITAL PUBLICATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd, TONGFANG KNOWLEDGE NETWORK DIGITAL PUBLICATION TECHNOLOGY Co Ltd filed Critical TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
Priority to CN201810954078.5A priority Critical patent/CN109086449A/en
Publication of CN109086449A publication Critical patent/CN109086449A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a kind of methods for carrying out document study based on XML fragmentation technology, this method comprises: establishing document special topic and son special topic by CNKI;Data is added from the total library CNKI into platform, and is opened corresponding document and learnt;It is read carefully and thoroughly and is modified for document paragraph;Literature content is subjected to fragmentation by XML, XML fragmentation literature content is judged as needed;Approval, is stored in storage container in a manner of digest or notes;It does not accept, the criticism type notes made a response, and document is created.The present invention reduces the workloads of reader's literature reading, are targetedly learnt according to fragmentation particle paragraph catalogue, learning time is greatly reduced, improves learning efficiency;The picture and chart of needs can be checked as needed, the particle paragraph after fragmentation can be directly collected, and are provided material for subsequent writing, are improved scientific research efficiency in this way.

Description

A method of document study is carried out based on XML fragmentation technology
Technical field
The present invention relates to a kind of Novel learning methods, more particularly to one kind to carry out document directly against XML fragmentation technology The method of study.
Background technique
In academic documents reading, there are many methods and platforms with learning areas at present, but are all also merely resting on to biography The reading and utilization for PDF format of uniting can only provide the tool for checking document, similar reader to reader.
XML is that one kind is markup, and scalability is strong, is stored in data, data exchange.The XML literature content fragment Change technology is preferentially to introduce academic publishing field in the industry, and a data mart modeling completes the needs of a variety of outputs.
Application No. is the patents of CN201611225928.5 to propose the householder method and device of a kind of Literature Consult, auxiliary Method includes the location information of each anchor point in the anchor point map and the anchor point map for obtain document;For in the document of user's selection Hold, generates new anchor point;Calculate the anchor point position of new anchor point generated in the literature;It will according to the anchor point position of the new anchor point The new anchor point is added to the anchor point map;And in response to user's operation, each anchor point is edited in the anchor point map.
The shortcomings that above scheme, is: use manually select anchor point map or manual formula, reader's learning efficiency compare It is low.And design does not adjust the learning method after anchor point map in scheme.
Summary of the invention
Text is carried out directly against XML fragmentation technology in order to solve the above technical problems, the object of the present invention is to provide one kind The method for offering study.
The purpose of the present invention is realized by technical solution below:
A method of document study is carried out directly against XML fragmentation technology, comprising: document special topic is established by CNKI With son special topic;
It is ground in platform from data is added in the total library CNKI to CNKI, and opens corresponding document and learnt;
It is read carefully and thoroughly and is modified for document paragraph;
Literature content is subjected to fragmentation by XML,
XML fragmentation literature content is judged as needed;Approval, is stored in a manner of digest or notes and is deposited Storage container;It does not accept, the criticism type notes made a response, and document is created.
Compared with prior art, one or more embodiments of the invention can have following advantage:
The workload for reducing reader's literature reading, targetedly learns, significantly according to fragmentation particle paragraph catalogue Learning time is reduced, is improved learning efficiency;
The picture and chart of needs can be checked as needed, the particle paragraph after fragmentation can be directly collected, Material is provided for subsequent writing, improves scientific research efficiency in this way.
Detailed description of the invention
Fig. 1 is the method flow diagram that document study is carried out directly against XML fragmentation technology;
Fig. 2 is bibliography frame structure chart;
Fig. 3 is the special topic established by CNKI learning software and son special topic diagram;
Fig. 4 is to add data from the total library CNKI to carry out study diagram to document into platform;
Fig. 5 a and 5b are to be read carefully and thoroughly and modified diagram for document paragraph;
Fig. 6, which is fragmentation content, needs to be stored in storage container diagram in a manner of digest or notes according to reader.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with examples and drawings to this hair It is bright to be described in further detail.
As shown in Figure 1, the method for carrying out document study for XML fragmentation technology, comprising: establish document special topic by CNKI With son special topic;
Data is added from the total library CNKI into platform, and is opened corresponding document and learnt;
It is read carefully and thoroughly and is modified for document paragraph;
Literature content is subjected to fragmentation by XML,
XML fragmentation literature content is judged as needed;Approval, is stored in a manner of digest or notes and is deposited Storage container;It does not accept, the criticism type notes made a response, and document is created.
Above-mentioned to be based on XML fragmentation technology, being includes that extensive reading, intensive reading and criticism formula are read from the reading of document, to creation Learning method.
Fine grained fragmentation content with chapters and sections, paragraph, picture and icon will be obtained after file content fragmentation by XML Document.
Above-mentioned document includes catalog rack structure, the catalog rack structure include index map, metadata, it is preceding it is auxiliary text, it is rear it is auxiliary text and Chapters and sections (as shown in Figure 2).
Fig. 3 is the special topic established by CNKI learning software and son special topic diagram;
According to research direction, creation study special topic can change All Files progress additions and deletions and look into.
File management is carried out by way of special topic;
By thematic classification in a manner of the data of adding, can and classification special topic under establish second level son special topic;
There are three types of forms for data addition: questions record addition, and local full text adds, and searching document data is added from the total library CNKI Learnt into platform;
Interim study document can also be transferred in special topic;
Name of document is clicked, starts document and grinds.
Fig. 4 is to add data from the total library CNKI to carry out study diagram to document into platform;Text after XML fragmentation It offers, it is automatic to extract article fragmentation outline;The outline of the fragmentation include chapter, paragraph fall, scheme, table etc.;Text structure figure and Full text is presented simultaneously;There are three types of presentation modes: original text, original text+notes, all notes;Literature reading bottom can be selected as needed Color, such as eyeshield color;Text structure pane can be hidden;Notes pane can be hidden.
Fig. 5 a and 5b are to be read carefully and thoroughly and modified diagram for document paragraph;Can chapter, paragraph to article fall, scheme, table etc. It is edited and is modified;The editor and modification include: digest, are taken notes, duplication etc.;The result system of editor and modification is automatic It saves;The intensive reading content saved is subsequent to be checked and secondary editor, modification and is utilized.
Fig. 6, which is fragmentation content, needs to be stored in storage container diagram in a manner of digest or notes according to reader, is depositing Storage container can carry out the increasing of material, delete, change, looking into and secondary editor, modification and utilizing.
The present embodiment is:
It is read based on the fragmentation that enhancing is published
Based on XML fragmentation content, change the reading method of traditional static, format, template and interior is issued according to content Hold frame and provide the streaming reading model of adaptive interaction towards multiple terminals carrier, completely new reading experience is provided for reader.
Compared with traditional publication expression way, the content of enhancing includes: that the Knowledge Elements such as concept, term, definition explain link (including literary internal chaining and external linkage);The scaling browsing of high-precision figure, table, photo etc. and downloading;Experimentation data and knot Fruit analyzes browsing data, obtains;Appended document, such as process discussion, the method for more details, video file, audio file etc..
Accepting study based on the notes of Hownet type
The overall process realizing reader's study on XML fragmentation reading page and recording the note, presses section on the basis of original text Management and displaying notes, support navigation, positioning and the link of notes.Support the recombination of adapting of original text outline catalogue and content, reality It is existing to collect in study, complete combing and the content compilation of knowledge frame, directly formation self-study achievement document.
Online editing creation based on XML
Personal online authoring tools are provided for readers and users, complete to learn using online authoring tools at any time in learning process Practise writing for achievement, including notes compilation, investigation report, preliminary report, literature review, systematic review, academic report etc..Creation Quote notes, digest, creation, CNKI document etc. at any time in the process.
Grind based on academic social collaboration
Converge expert, the scholar of global all trades and professions, building collaboration and the study circle shared, help reader in real time and its His scholar and researcher carry out problem discussion and academic exchange.
Support the personal knowledge management of life-long education and accurate knowledge services
The mobile personal digital library of life-long education and persistence is provided for user, convenient for user management study money Material, research achievement, can obtain and utilize whenever and wherever possible, construct and manage personal knowledge structure.Based on user behavior draw a portrait into The academic documents of newest publication are pushed to the most desirable reader by row intelligently pushing at the first time.
Above-described embodiment is based on XML fragmentation content, changes the reading method of traditional static, format, is sent out according to content Cloth template and content frame provide adaptive, interactive streaming reading model towards multiple terminals carrier, provide for reader completely new Reading experience.
Can be by XML technology by literature content fragmentation, so that it may directly obtain by chapters and sections and by paragraph even by The document of the fine grained fragmentation content of picture, icon, thus reader does not need oneself and goes to improve anchor point map manually, can be with Learnt directly against the content after XML.
Although disclosed herein embodiment it is as above, the content is only to facilitate understanding the present invention and adopting Embodiment is not intended to limit the invention.Any those skilled in the art to which this invention pertains are not departing from this Under the premise of the disclosed spirit and scope of invention, any modification and change can be made in the implementing form and in details, But scope of patent protection of the invention, still should be subject to the scope of the claims as defined in the appended claims.

Claims (4)

1. a kind of method for carrying out document study based on XML fragmentation technology, which is characterized in that the described method includes:
Document special topic and son special topic are established by CNKI;
It is ground in platform from data is added in the total library CNKI to CNKI, and opens corresponding document and learnt;
It is read carefully and thoroughly and is modified for document paragraph;
Literature content is subjected to fragmentation by XML,
XML fragmentation literature content is judged as needed;Approval is stored in storage in a manner of digest or notes and held Device;It does not accept, the criticism type notes made a response, and document is created.
2. the method for carrying out document study based on XML fragmentation technology as described in claim 1, which is characterized in that the text Offering reading includes that extensive reading, intensive reading and criticism formula are read.
3. the method for carrying out document study based on XML fragmentation technology as described in claim 1, by XML by file content It is obtained after fragmentation with the document of the fine grained fragmentation content of chapters and sections, paragraph, picture and icon.
4. the method for carrying out document study based on XML fragmentation technology as described in claim 1, which is characterized in that the text It offers including catalog rack structure, which includes index map, metadata, preceding auxiliary text, rear auxiliary text and chapters and sections.
CN201810954078.5A 2018-08-21 2018-08-21 A method of document study is carried out based on XML fragmentation technology Pending CN109086449A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810954078.5A CN109086449A (en) 2018-08-21 2018-08-21 A method of document study is carried out based on XML fragmentation technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810954078.5A CN109086449A (en) 2018-08-21 2018-08-21 A method of document study is carried out based on XML fragmentation technology

Publications (1)

Publication Number Publication Date
CN109086449A true CN109086449A (en) 2018-12-25

Family

ID=64794062

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810954078.5A Pending CN109086449A (en) 2018-08-21 2018-08-21 A method of document study is carried out based on XML fragmentation technology

Country Status (1)

Country Link
CN (1) CN109086449A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112016289A (en) * 2020-08-28 2020-12-01 《中国学术期刊(光盘版)》电子杂志社有限公司 Thesis writing method based on big data technology

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130185366A1 (en) * 2012-01-05 2013-07-18 Joby Joy System and method for managing user generated content
CN104317785A (en) * 2014-10-13 2015-01-28 安徽华贞信息科技有限公司 Internet paragraph level topic identifying system
CN105550940A (en) * 2015-11-25 2016-05-04 中国南方电网有限责任公司电网技术研究中心 Power gird equipment standard index data mining and extraction method
CN105787741A (en) * 2016-02-17 2016-07-20 林慕新 Electronic contract signing system based on mobile phone client, and application method thereof
CN106254685A (en) * 2016-09-23 2016-12-21 努比亚技术有限公司 A kind of content-label method and mobile terminal
CN106802884A (en) * 2017-02-17 2017-06-06 同方知网(北京)技术有限公司 A kind of method of format document text fragmentation
CN107220814A (en) * 2017-06-01 2017-09-29 同方知网数字出版技术股份有限公司 The full media that a kind of the problem of technology based on fragmentation is oriented to publish collaboration discussion system method
CN108153717A (en) * 2017-12-29 2018-06-12 北京仁和汇智信息技术有限公司 A kind of structuring processing method and processing device of papers in sci-tech word document

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130185366A1 (en) * 2012-01-05 2013-07-18 Joby Joy System and method for managing user generated content
CN104317785A (en) * 2014-10-13 2015-01-28 安徽华贞信息科技有限公司 Internet paragraph level topic identifying system
CN105550940A (en) * 2015-11-25 2016-05-04 中国南方电网有限责任公司电网技术研究中心 Power gird equipment standard index data mining and extraction method
CN105787741A (en) * 2016-02-17 2016-07-20 林慕新 Electronic contract signing system based on mobile phone client, and application method thereof
CN106254685A (en) * 2016-09-23 2016-12-21 努比亚技术有限公司 A kind of content-label method and mobile terminal
CN106802884A (en) * 2017-02-17 2017-06-06 同方知网(北京)技术有限公司 A kind of method of format document text fragmentation
CN107220814A (en) * 2017-06-01 2017-09-29 同方知网数字出版技术股份有限公司 The full media that a kind of the problem of technology based on fragmentation is oriented to publish collaboration discussion system method
CN108153717A (en) * 2017-12-29 2018-06-12 北京仁和汇智信息技术有限公司 A kind of structuring processing method and processing device of papers in sci-tech word document

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112016289A (en) * 2020-08-28 2020-12-01 《中国学术期刊(光盘版)》电子杂志社有限公司 Thesis writing method based on big data technology

Similar Documents

Publication Publication Date Title
US10372801B2 (en) Narrative-based media organizing system for transforming and merging graphical representations of digital media within a work area
US20130031208A1 (en) Management and Provision of Interactive Content
Hillmann et al. Metadata in practice
McGlynn et al. Ageing women in literature and visual culture: Reflections, refractions, reimaginings
Barker Living books and dynamic electronic libraries
CN109086449A (en) A method of document study is carried out based on XML fragmentation technology
CN102426563A (en) Method and equipment system for handwriting comments on electronic document
CN108319718A (en) Method for building up, device and the teaching material bank of teaching material bank
Whitt et al. Pro freeware and open source solutions for business
CN111027280A (en) Method and system for generating and rendering digital publication and readable storage medium
Huddleston Teach yourself visually web design
Shaw Using Digital Information Services in the Library Workplace: An Introduction for Support Staff
Feigel The digitization and accessibility of documents: a case study at the Rochester Public Library
Tang Theorizing New Photographies in Contemporary China: An Introduction
Cai A Metadata‐Based Multimodal Model for Resource Sharing of British and American Female Literary Works
Bruemmer et al. Realizing the concept: a history of the CBI archives
Fong et al. An editable multi-media authoring eBook system for mobile learning
Young Film Box Office Charts and the Metadata of Culture
Hatton Creating and managing archives for local history groups
Torres Archival digitization for non-professionals
Casad et al. Enduring access to rich media content: understanding use and usability requirements
Miao Research on Design of Digital Display Platform Based on Cloud Computing
Furgang Coding Activities for Building Websites with HTML
Whiteman Chinese Painting & Calligraphy, edited by Mimi Gardner Gates and Josh Yiu: Seattle: Seattle Art Museum, 2011, accessed September 23, 2016, http://chinesepainting. seattleartmuseum. org/OSCI
CN110428667A (en) A kind of electronic textbook and its application method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181225