JP2023040248A - テキスト情報抽取方法、装置、電子機器、記憶媒体及びコンピュータプログラム - Google Patents

テキスト情報抽取方法、装置、電子機器、記憶媒体及びコンピュータプログラム Download PDF

Info

Publication number
JP2023040248A
JP2023040248A JP2023003753A JP2023003753A JP2023040248A JP 2023040248 A JP2023040248 A JP 2023040248A JP 2023003753 A JP2023003753 A JP 2023003753A JP 2023003753 A JP2023003753 A JP 2023003753A JP 2023040248 A JP2023040248 A JP 2023040248A
Authority
JP
Japan
Prior art keywords
target
entity
relationship
slot
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023003753A
Other languages
English (en)
Japanese (ja)
Inventor
ジャンドン ソン,
Jiandong Sun
ヤビン シー,
Yabing Shi
イェ ジャン,
Ye Jiang
チュングァン ツァイ,
Chengquan Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Publication of JP2023040248A publication Critical patent/JP2023040248A/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/322Trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2323Non-hierarchical techniques based on graph theory, e.g. minimum spanning trees [MST] or graph cuts
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24147Distances to closest patterns, e.g. nearest neighbour classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Discrete Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
JP2023003753A 2022-06-24 2023-01-13 テキスト情報抽取方法、装置、電子機器、記憶媒体及びコンピュータプログラム Pending JP2023040248A (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210732269.3A CN115080742B (zh) 2022-06-24 2022-06-24 文本信息抽取方法、装置、设备、存储介质以及程序产品
CN202210732269.3 2022-06-24

Publications (1)

Publication Number Publication Date
JP2023040248A true JP2023040248A (ja) 2023-03-22

Family

ID=83256480

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023003753A Pending JP2023040248A (ja) 2022-06-24 2023-01-13 テキスト情報抽取方法、装置、電子機器、記憶媒体及びコンピュータプログラム

Country Status (3)

Country Link
JP (1) JP2023040248A (zh)
KR (1) KR20230009345A (zh)
CN (1) CN115080742B (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116383655A (zh) * 2023-04-07 2023-07-04 北京百度网讯科技有限公司 样本生成方法、模型训练方法、文本处理方法及装置
CN116522935A (zh) * 2023-03-29 2023-08-01 北京德风新征程科技股份有限公司 文本数据处理方法、处理装置和电子设备

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116628230A (zh) * 2023-07-25 2023-08-22 航天宏图信息技术股份有限公司 属性关联关系的表达方法、装置、电子设备及存储介质
CN117174234B (zh) * 2023-11-03 2024-01-05 南京都昌信息科技有限公司 医疗文本数据分析方法及***

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105701253B (zh) * 2016-03-04 2019-03-26 南京大学 中文自然语言问句语义化的知识库自动问答方法
CN111611399A (zh) * 2020-04-15 2020-09-01 广发证券股份有限公司 一种基于自然语言处理的资讯事件图谱化***及方法
CN114595686B (zh) * 2022-03-11 2023-02-03 北京百度网讯科技有限公司 知识抽取方法、知识抽取模型的训练方法及装置

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116522935A (zh) * 2023-03-29 2023-08-01 北京德风新征程科技股份有限公司 文本数据处理方法、处理装置和电子设备
CN116522935B (zh) * 2023-03-29 2024-03-29 北京德风新征程科技股份有限公司 文本数据处理方法、处理装置和电子设备
CN116383655A (zh) * 2023-04-07 2023-07-04 北京百度网讯科技有限公司 样本生成方法、模型训练方法、文本处理方法及装置
CN116383655B (zh) * 2023-04-07 2024-01-05 北京百度网讯科技有限公司 样本生成方法、模型训练方法、文本处理方法及装置

Also Published As

Publication number Publication date
CN115080742A (zh) 2022-09-20
KR20230009345A (ko) 2023-01-17
CN115080742B (zh) 2023-09-05

Similar Documents

Publication Publication Date Title
Lin et al. A natural‐language‐based approach to intelligent data retrieval and representation for cloud BIM
JP2023040248A (ja) テキスト情報抽取方法、装置、電子機器、記憶媒体及びコンピュータプログラム
US20230004721A1 (en) Method for training semantic representation model, device and storage medium
JP7301922B2 (ja) 意味検索方法、装置、電子機器、記憶媒体およびコンピュータプログラム
US10191946B2 (en) Answering natural language table queries through semantic table representation
US11003701B2 (en) Dynamic faceted search on a document corpus
CN111738001A (zh) 同义词识别模型的训练方法、同义词确定方法及设备
Wang et al. NLP-based query-answering system for information extraction from building information models
US20240220772A1 (en) Method of evaluating data, training method, electronic device, and storage medium
CN113282762A (zh) 知识图谱构建方法、装置、电子设备和存储介质
CN114861889A (zh) 深度学习模型的训练方法、目标对象检测方法和装置
JP7369228B2 (ja) ユーザ興味画像の生成方法、装置、電子機器及び記憶媒体
US20220129623A1 (en) Performance characteristics of cartridge artifacts over text pattern constructs
Zhou et al. Deep personalized medical recommendations based on the integration of rating features and review sentiment analysis
US20220358906A1 (en) Semi-structured content aware bi-directional transformer
CN114036921A (zh) 一种政策信息匹配方法和装置
CN108038109A (zh) 从非结构化文本中提取特征词的方法及***、计算机程序
US8971644B1 (en) System and method for determining an annotation for an image
JP7390442B2 (ja) 文書処理モデルのトレーニング方法、装置、機器、記憶媒体及びプログラム
Zhuo Consumer demand behavior mining and product recommendation based on online product review mining and fuzzy sets
US11663251B2 (en) Question answering approach to semantic parsing of mathematical formulas
JP2023002475A (ja) コンピュータシステム、コンピュータプログラムおよびコンピュータで実装される方法(因果関係知識の識別および抽出)
Corredera Arbide et al. Affective computing for smart operations: a survey and comparative analysis of the available tools, libraries and web services
CN114254642A (zh) 实体信息处理方法、装置、电子设备和介质
Rajesh et al. Significance of natural language processing in data analysis using business intelligence

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230113

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20240126

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20240206

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240418