JP2010511958A - ジェスチャー/音声統合認識システム及び方法 - Google Patents

ジェスチャー/音声統合認識システム及び方法 Download PDF

Info

Publication number
JP2010511958A
JP2010511958A JP2009540141A JP2009540141A JP2010511958A JP 2010511958 A JP2010511958 A JP 2010511958A JP 2009540141 A JP2009540141 A JP 2009540141A JP 2009540141 A JP2009540141 A JP 2009540141A JP 2010511958 A JP2010511958 A JP 2010511958A
Authority
JP
Japan
Prior art keywords
gesture
integrated
voice
feature information
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2009540141A
Other languages
English (en)
Japanese (ja)
Inventor
ヨン ジユ ジョン
ムン ソン ハン
ジェ ソン イ
ジュン ソク パク
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics and Telecommunications Research Institute ETRI filed Critical Electronics and Telecommunications Research Institute ETRI
Priority claimed from PCT/KR2007/006189 external-priority patent/WO2008069519A1/en
Publication of JP2010511958A publication Critical patent/JP2010511958A/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Image Analysis (AREA)
JP2009540141A 2006-12-04 2007-12-03 ジェスチャー/音声統合認識システム及び方法 Pending JP2010511958A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR20060121836 2006-12-04
KR1020070086575A KR100948600B1 (ko) 2006-12-04 2007-08-28 제스처/음성 융합 인식 시스템 및 방법
PCT/KR2007/006189 WO2008069519A1 (en) 2006-12-04 2007-12-03 Gesture/speech integrated recognition system and method

Publications (1)

Publication Number Publication Date
JP2010511958A true JP2010511958A (ja) 2010-04-15

Family

ID=39806143

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2009540141A Pending JP2010511958A (ja) 2006-12-04 2007-12-03 ジェスチャー/音声統合認識システム及び方法

Country Status (2)

Country Link
JP (1) JP2010511958A (ko)
KR (1) KR100948600B1 (ko)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011081541A (ja) * 2009-10-06 2011-04-21 Canon Inc 入力装置及びその制御方法
WO2018061743A1 (ja) * 2016-09-28 2018-04-05 コニカミノルタ株式会社 ウェアラブル端末
CN108248413A (zh) * 2016-12-28 2018-07-06 广州市移电科技有限公司 设有充电桩的路灯
JP2018163400A (ja) * 2017-03-24 2018-10-18 日本電信電話株式会社 モデル学習装置、発話単語推定装置、モデル学習方法、発話単語推定方法、プログラム
US11521038B2 (en) 2018-07-19 2022-12-06 Samsung Electronics Co., Ltd. Electronic apparatus and control method thereof
KR20230129964A (ko) * 2016-11-03 2023-09-11 삼성전자주식회사 전자 장치, 그의 제어 방법

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101329100B1 (ko) * 2008-12-08 2013-11-14 한국전자통신연구원 상황 인지 장치 및 이를 이용한 상황 인지 방법
US8600166B2 (en) * 2009-11-06 2013-12-03 Sony Corporation Real time hand tracking, pose classification and interface control
US20130033644A1 (en) * 2011-08-05 2013-02-07 Samsung Electronics Co., Ltd. Electronic apparatus and method for controlling thereof
EP2555536A1 (en) 2011-08-05 2013-02-06 Samsung Electronics Co., Ltd. Method for controlling electronic apparatus based on voice recognition and motion recognition, and electronic apparatus applying the same
KR101971697B1 (ko) * 2012-02-24 2019-04-23 삼성전자주식회사 사용자 디바이스에서 복합 생체인식 정보를 이용한 사용자 인증 방법 및 장치
KR102254484B1 (ko) * 2014-05-08 2021-05-21 현대모비스 주식회사 제스처 하이브리드 인식 장치 및 방법
KR102265143B1 (ko) 2014-05-16 2021-06-15 삼성전자주식회사 입력 처리 장치 및 방법
KR101650769B1 (ko) 2015-05-28 2016-08-25 미디어젠(주) 제스처 인식을 이용한 차량용 음성 인식시스템
US10986287B2 (en) 2019-02-19 2021-04-20 Samsung Electronics Co., Ltd. Capturing a photo using a signature motion of a mobile device
CN110287363A (zh) * 2019-05-22 2019-09-27 深圳壹账通智能科技有限公司 基于深度学习的资源推送方法、装置、设备及存储介质
KR102322817B1 (ko) * 2020-09-10 2021-11-08 한국항공대학교산학협력단 도플러 레이다 및 음성 센서를 이용한 cnn 기반의 hmi 시스템, hmi 시스템의 센서 데이터 처리 장치 및 그 동작 방법
KR102539047B1 (ko) * 2021-06-04 2023-06-02 주식회사 피앤씨솔루션 증강현실 글라스 장치의 입력 인터페이스를 위한 손동작 및 음성명령어 인식 성능 향상 방법 및 장치

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1173297A (ja) * 1997-08-29 1999-03-16 Hitachi Ltd 音声とジェスチャによるマルチモーダル表現の時間的関係を用いた認識方法
JPH11288342A (ja) * 1998-02-09 1999-10-19 Toshiba Corp マルチモーダル入出力装置のインタフェース装置及びその方法

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05108302A (ja) * 1991-10-14 1993-04-30 Nippon Telegr & Teleph Corp <Ntt> 音声と指示動作を用いた情報入力方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1173297A (ja) * 1997-08-29 1999-03-16 Hitachi Ltd 音声とジェスチャによるマルチモーダル表現の時間的関係を用いた認識方法
JPH11288342A (ja) * 1998-02-09 1999-10-19 Toshiba Corp マルチモーダル入出力装置のインタフェース装置及びその方法

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011081541A (ja) * 2009-10-06 2011-04-21 Canon Inc 入力装置及びその制御方法
WO2018061743A1 (ja) * 2016-09-28 2018-04-05 コニカミノルタ株式会社 ウェアラブル端末
KR20230129964A (ko) * 2016-11-03 2023-09-11 삼성전자주식회사 전자 장치, 그의 제어 방법
US11908465B2 (en) 2016-11-03 2024-02-20 Samsung Electronics Co., Ltd. Electronic device and controlling method thereof
KR102643027B1 (ko) 2016-11-03 2024-03-05 삼성전자주식회사 전자 장치, 그의 제어 방법
CN108248413A (zh) * 2016-12-28 2018-07-06 广州市移电科技有限公司 设有充电桩的路灯
JP2018163400A (ja) * 2017-03-24 2018-10-18 日本電信電話株式会社 モデル学習装置、発話単語推定装置、モデル学習方法、発話単語推定方法、プログラム
US11521038B2 (en) 2018-07-19 2022-12-06 Samsung Electronics Co., Ltd. Electronic apparatus and control method thereof

Also Published As

Publication number Publication date
KR20080050994A (ko) 2008-06-10
KR100948600B1 (ko) 2010-03-24

Similar Documents

Publication Publication Date Title
JP2010511958A (ja) ジェスチャー/音声統合認識システム及び方法
WO2008069519A1 (en) Gesture/speech integrated recognition system and method
WO2021036644A1 (zh) 一种基于人工智能的语音驱动动画方法和装置
CN107799126B (zh) 基于有监督机器学习的语音端点检测方法及装置
WO2021082941A1 (zh) 视频人物识别方法、装置、存储介质与电子设备
CN105843381B (zh) 用于实现多模态交互的数据处理方法及多模态交互***
US8793134B2 (en) System and method for integrating gesture and sound for controlling device
KR101604593B1 (ko) 이용자 명령에 기초하여 리프리젠테이션을 수정하기 위한 방법
US20150325240A1 (en) Method and system for speech input
WO2018113650A1 (zh) 一种虚拟现实语言交互***与方法
CN110310623A (zh) 样本生成方法、模型训练方法、装置、介质及电子设备
Madhuri et al. Vision-based sign language translation device
JP2012014394A (ja) ユーザ指示取得装置、ユーザ指示取得プログラムおよびテレビ受像機
KR20100062207A (ko) 화상통화 중 애니메이션 효과 제공 방법 및 장치
CN110309254A (zh) 智能机器人与人机交互方法
CN113129867B (zh) 语音识别模型的训练方法、语音识别方法、装置和设备
CN109241924A (zh) 基于互联网的多平台信息交互***
CN111326152A (zh) 语音控制方法及装置
CN106502382A (zh) 用于智能机器人的主动交互方法和***
Su et al. Liplearner: Customizable silent speech interactions on mobile devices
CN115206306A (zh) 语音交互方法、装置、设备及***
Song et al. A review of audio-visual fusion with machine learning
CN107452381B (zh) 一种多媒体语音识别装置及方法
CN111462732B (zh) 语音识别方法和装置
CN108388399B (zh) 虚拟偶像的状态管理方法及***

Legal Events

Date Code Title Description
A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20111213

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20120313

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20120831