AR072863A1 - Metodo y discriminador para clasificar distintos segmentos de una senal - Google Patents

Metodo y discriminador para clasificar distintos segmentos de una senal

Info

Publication number
AR072863A1
AR072863A1 ARP090102544A ARP090102544A AR072863A1 AR 072863 A1 AR072863 A1 AR 072863A1 AR P090102544 A ARP090102544 A AR P090102544A AR P090102544 A ARP090102544 A AR P090102544A AR 072863 A1 AR072863 A1 AR 072863A1
Authority
AR
Argentina
Prior art keywords
signal
term
short
long
type
Prior art date
Application number
ARP090102544A
Other languages
English (en)
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of AR072863A1 publication Critical patent/AR072863A1/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Analysis (AREA)

Abstract

Para clasificar distintos segmentos de una senal que comprende segmentos de por lo menos un primer tipo y un segundo tipo, por ejemplo, segmentos de voz y de musica, se clasifica la senal en un corto plazo (150) sobre la base de por lo menos un rasgo distintivo de corto plazo extraída de la senal y se entrega un resultado de clasificacion de corto plazo (152). La senal se clasifica también en un largo plazo (154) sobre la base de por lo menos un rasgo distintivo de corto plazo y por lo menos un rasgo distintivo de largo plazo extraídos de la senal y se entrega un resultado de clasificacion de largo plazo (156). Se combinan (158) el resultado de la clasificacion de corto plazo (152) y el resultado de la clasificacion de largo plazo (156) para proveer una senal de salida (160) que indica si un segmento de la senal es del primer tipo o del segundo tipo.
ARP090102544A 2008-07-11 2009-07-07 Metodo y discriminador para clasificar distintos segmentos de una senal AR072863A1 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US7987508P 2008-07-11 2008-07-11

Publications (1)

Publication Number Publication Date
AR072863A1 true AR072863A1 (es) 2010-09-29

Family

ID=40851974

Family Applications (1)

Application Number Title Priority Date Filing Date
ARP090102544A AR072863A1 (es) 2008-07-11 2009-07-07 Metodo y discriminador para clasificar distintos segmentos de una senal

Country Status (20)

Country Link
US (1) US8571858B2 (es)
EP (1) EP2301011B1 (es)
JP (1) JP5325292B2 (es)
KR (2) KR101281661B1 (es)
CN (1) CN102089803B (es)
AR (1) AR072863A1 (es)
AU (1) AU2009267507B2 (es)
BR (1) BRPI0910793B8 (es)
CA (1) CA2730196C (es)
CO (1) CO6341505A2 (es)
ES (1) ES2684297T3 (es)
HK (1) HK1158804A1 (es)
MX (1) MX2011000364A (es)
MY (1) MY153562A (es)
PL (1) PL2301011T3 (es)
PT (1) PT2301011T (es)
RU (1) RU2507609C2 (es)
TW (1) TWI441166B (es)
WO (1) WO2010003521A1 (es)
ZA (1) ZA201100088B (es)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2871498C (en) * 2008-07-11 2017-10-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder and decoder for encoding and decoding audio samples
CN101847412B (zh) * 2009-03-27 2012-02-15 华为技术有限公司 音频信号的分类方法及装置
KR101666521B1 (ko) * 2010-01-08 2016-10-14 삼성전자 주식회사 입력 신호의 피치 주기 검출 방법 및 그 장치
RU2562384C2 (ru) 2010-10-06 2015-09-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Способ и устройство для обработки аудио сигнала и для обеспечения большей детализации во времени для комбинированного унифицированного кодека речи и аудио (usac)
US8521541B2 (en) * 2010-11-02 2013-08-27 Google Inc. Adaptive audio transcoding
CN103000172A (zh) * 2011-09-09 2013-03-27 中兴通讯股份有限公司 信号分类方法和装置
US20130090926A1 (en) * 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
WO2013061584A1 (ja) * 2011-10-28 2013-05-02 パナソニック株式会社 音信号ハイブリッドデコーダ、音信号ハイブリッドエンコーダ、音信号復号方法、及び音信号符号化方法
CN103139930B (zh) 2011-11-22 2015-07-08 华为技术有限公司 连接建立方法和用户设备
US9111531B2 (en) * 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
EP2702776B1 (en) * 2012-02-17 2015-09-23 Huawei Technologies Co., Ltd. Parametric encoder for encoding a multi-channel audio signal
US20130317821A1 (en) * 2012-05-24 2013-11-28 Qualcomm Incorporated Sparse signal detection with mismatched models
ES2604652T3 (es) * 2012-08-31 2017-03-08 Telefonaktiebolaget Lm Ericsson (Publ) Método y dispositivo para detectar la actividad vocal
US9589570B2 (en) 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
RU2656681C1 (ru) * 2012-11-13 2018-06-06 Самсунг Электроникс Ко., Лтд. Способ и устройство для определения режима кодирования, способ и устройство для кодирования аудиосигналов и способ, и устройство для декодирования аудиосигналов
US9100255B2 (en) * 2013-02-19 2015-08-04 Futurewei Technologies, Inc. Frame structure for filter bank multi-carrier (FBMC) waveforms
BR112015019543B1 (pt) 2013-02-20 2022-01-11 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Aparelho para codificar um sinal de áudio, descodificador para descodificar um sinal de áudio, método para codificar e método para descodificar um sinal de áudio
CN104347067B (zh) * 2013-08-06 2017-04-12 华为技术有限公司 一种音频信号分类方法和装置
US9666202B2 (en) * 2013-09-10 2017-05-30 Huawei Technologies Co., Ltd. Adaptive bandwidth extension and apparatus for the same
KR101498113B1 (ko) * 2013-10-23 2015-03-04 광주과학기술원 사운드 신호의 대역폭 확장 장치 및 방법
KR102552293B1 (ko) * 2014-02-24 2023-07-06 삼성전자주식회사 신호 분류 방법 및 장치, 및 이를 이용한 오디오 부호화방법 및 장치
CN107452391B (zh) 2014-04-29 2020-08-25 华为技术有限公司 音频编码方法及相关装置
WO2015174912A1 (en) * 2014-05-15 2015-11-19 Telefonaktiebolaget L M Ericsson (Publ) Audio signal classification and coding
CN107424622B (zh) 2014-06-24 2020-12-25 华为技术有限公司 音频编码方法和装置
US9886963B2 (en) 2015-04-05 2018-02-06 Qualcomm Incorporated Encoder selection
CN113035212A (zh) * 2015-05-20 2021-06-25 瑞典爱立信有限公司 多声道音频信号的编码
US10706873B2 (en) * 2015-09-18 2020-07-07 Sri International Real-time speaker state analytics platform
US20190139567A1 (en) * 2016-05-12 2019-05-09 Nuance Communications, Inc. Voice Activity Detection Feature Based on Modulation-Phase Differences
US10699538B2 (en) * 2016-07-27 2020-06-30 Neosensory, Inc. Method and system for determining and providing sensory experiences
US10198076B2 (en) 2016-09-06 2019-02-05 Neosensory, Inc. Method and system for providing adjunct sensory information to a user
CN107895580B (zh) * 2016-09-30 2021-06-01 华为技术有限公司 一种音频信号的重建方法和装置
US10744058B2 (en) * 2017-04-20 2020-08-18 Neosensory, Inc. Method and system for providing information to a user
US10325588B2 (en) * 2017-09-28 2019-06-18 International Business Machines Corporation Acoustic feature extractor selected according to status flag of frame of acoustic signal
CN113168839B (zh) * 2018-12-13 2024-01-23 杜比实验室特许公司 双端媒体智能
RU2761940C1 (ru) * 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
CN110288983B (zh) * 2019-06-26 2021-10-01 上海电机学院 一种基于机器学习的语音处理方法
WO2021062276A1 (en) 2019-09-25 2021-04-01 Neosensory, Inc. System and method for haptic stimulation
US11467668B2 (en) 2019-10-21 2022-10-11 Neosensory, Inc. System and method for representing virtual object information with haptic stimulation
US11079854B2 (en) 2020-01-07 2021-08-03 Neosensory, Inc. Method and system for haptic stimulation
CN115428068A (zh) * 2020-04-16 2022-12-02 沃伊斯亚吉公司 用于声音编解码器中的语音/音乐分类和核心编码器选择的方法和设备
US11497675B2 (en) 2020-10-23 2022-11-15 Neosensory, Inc. Method and system for multimodal stimulation
WO2022147615A1 (en) * 2021-01-08 2022-07-14 Voiceage Corporation Method and device for unified time-domain / frequency domain coding of a sound signal
US11862147B2 (en) 2021-08-13 2024-01-02 Neosensory, Inc. Method and system for enhancing the intelligibility of information for a user
US20230147185A1 (en) * 2021-11-08 2023-05-11 Lemon Inc. Controllable music generation
US11995240B2 (en) 2021-11-16 2024-05-28 Neosensory, Inc. Method and system for conveying digital texture information to a user
CN116070174A (zh) * 2023-03-23 2023-05-05 长沙融创智胜电子科技有限公司 一种多类别目标识别方法及***

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IT1232084B (it) * 1989-05-03 1992-01-23 Cselt Centro Studi Lab Telecom Sistema di codifica per segnali audio a banda allargata
JPH0490600A (ja) * 1990-08-03 1992-03-24 Sony Corp 音声認識装置
JPH04342298A (ja) * 1991-05-20 1992-11-27 Nippon Telegr & Teleph Corp <Ntt> 瞬時ピッチ分析方法及び有声・無声判定方法
RU2049456C1 (ru) * 1993-06-22 1995-12-10 Вячеслав Алексеевич Сапрыкин Способ передачи речевых сигналов
US6134518A (en) * 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
JP3700890B2 (ja) * 1997-07-09 2005-09-28 ソニー株式会社 信号識別装置及び信号識別方法
RU2132593C1 (ru) * 1998-05-13 1999-06-27 Академия управления МВД России Многоканальное устройство для передачи речевых сигналов
SE0004187D0 (sv) 2000-11-15 2000-11-15 Coding Technologies Sweden Ab Enhancing the performance of coding systems that use high frequency reconstruction methods
US7469206B2 (en) 2001-11-29 2008-12-23 Coding Technologies Ab Methods for improving high frequency reconstruction
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
AUPS270902A0 (en) * 2002-05-31 2002-06-20 Canon Kabushiki Kaisha Robust detection and classification of objects in audio using limited training data
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
JP2004354589A (ja) * 2003-05-28 2004-12-16 Nippon Telegr & Teleph Corp <Ntt> 音響信号判別方法、音響信号判別装置、音響信号判別プログラム
JP4725803B2 (ja) * 2004-06-01 2011-07-13 日本電気株式会社 情報提供システム及び方法並びに情報提供用プログラム
US7130795B2 (en) * 2004-07-16 2006-10-31 Mindspeed Technologies, Inc. Music detection with low-complexity pitch correlation algorithm
JP4587916B2 (ja) * 2005-09-08 2010-11-24 シャープ株式会社 音声信号判別装置、音質調整装置、コンテンツ表示装置、プログラム、及び記録媒体
JP2010503881A (ja) 2006-09-13 2010-02-04 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声・音響送信器及び受信器のための方法及び装置
CN1920947B (zh) * 2006-09-15 2011-05-11 清华大学 用于低比特率音频编码的语音/音乐检测器
WO2008045846A1 (en) * 2006-10-10 2008-04-17 Qualcomm Incorporated Method and apparatus for encoding and decoding audio signals
KR101016224B1 (ko) * 2006-12-12 2011-02-25 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 인코더, 디코더 및 시간 영역 데이터 스트림을 나타내는 데이터 세그먼트를 인코딩하고 디코딩하는 방법
KR100964402B1 (ko) * 2006-12-14 2010-06-17 삼성전자주식회사 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치
KR100883656B1 (ko) * 2006-12-28 2009-02-18 삼성전자주식회사 오디오 신호의 분류 방법 및 장치와 이를 이용한 오디오신호의 부호화/복호화 방법 및 장치
WO2010001393A1 (en) * 2008-06-30 2010-01-07 Waves Audio Ltd. Apparatus and method for classification and segmentation of audio content, based on the audio signal

Also Published As

Publication number Publication date
KR101380297B1 (ko) 2014-04-02
CO6341505A2 (es) 2011-11-21
HK1158804A1 (en) 2012-07-20
RU2011104001A (ru) 2012-08-20
AU2009267507A1 (en) 2010-01-14
TWI441166B (zh) 2014-06-11
ES2684297T3 (es) 2018-10-02
WO2010003521A1 (en) 2010-01-14
US20110202337A1 (en) 2011-08-18
KR20110039254A (ko) 2011-04-15
JP5325292B2 (ja) 2013-10-23
EP2301011B1 (en) 2018-07-25
PL2301011T3 (pl) 2019-03-29
MY153562A (en) 2015-02-27
BRPI0910793A2 (pt) 2016-08-02
CN102089803B (zh) 2013-02-27
BRPI0910793B8 (pt) 2021-08-24
CA2730196A1 (en) 2010-01-14
AU2009267507B2 (en) 2012-08-02
BRPI0910793B1 (pt) 2020-11-24
MX2011000364A (es) 2011-02-25
CA2730196C (en) 2014-10-21
ZA201100088B (en) 2011-08-31
KR101281661B1 (ko) 2013-07-03
RU2507609C2 (ru) 2014-02-20
EP2301011A1 (en) 2011-03-30
PT2301011T (pt) 2018-10-26
JP2011527445A (ja) 2011-10-27
TW201009813A (en) 2010-03-01
KR20130036358A (ko) 2013-04-11
CN102089803A (zh) 2011-06-08
US8571858B2 (en) 2013-10-29

Similar Documents

Publication Publication Date Title
AR072863A1 (es) Metodo y discriminador para clasificar distintos segmentos de una senal
IL200487A0 (en) Method and apparatus for detecting computer fraud
AR078575A1 (es) Procedimiento de deteccion de segmentos de voz
ECSP067023A (es) Indicador de autenticidad
AR072552A1 (es) Un aparato y un metodo para calcular una cantidad de envolventes espectrales
CO6430492A2 (es) Aplicacion de metodos de aprendizaje automatico para extraer reglas de asociacion en conjuntos de datos de animales y plantas que contienen marcadores geneticos moleculares, seguidos por clasificacion o prediccion utilizando caracteristicas creadas d
ES2570359T3 (es) Regiones ultraconservadas que codifican ARNnc
AR063074A1 (es) Una turbina eolica, un metodo para amortiguar las oscilaciones de borde en una o mas aspas de una turbina eolica, y uso del mismo
CO2019012771A2 (es) Ensayos de diagnóstico para detectar, cuantificar y/o rastrear microbios y otros analitos
EA201600085A1 (ru) Набор для обнаружения соевого события syht0h2
CL2009001692A1 (es) Metodo para monitorear una cinta transportadora en movimiento en un sistema de cinta transportadora, con el objetivo de evaluar daños en la cinta en movimiento y detectar la posicion del daño en la cinta, dicha cinta transportadora tiene una pluralidad de cuerdas de reforzamiento intergradas y etiquetadas de indentificacion.
BRPI0922952B8 (pt) métodos de detectar e identificar um microorganismos em meios sólidos ou semi-sólidos
CL2008001365A1 (es) Composicion que comprende al menos un interpolimero basado en etileno de alto peso molecular y al menos un interpolimero basado en etileno de bajo peso molecular; y articulo que comprende dicha composicion
BR0116791A (pt) Método e arranjo para determinar um sinal de ruìdo de uma fonte de ruìdo
BR112018076026A8 (pt) Determinação de conclusão de viagem para transporte sob demanda
CR11504A (es) Deteccion mejorada de la expresion de mage-a
CL2013002153A1 (es) Un procedimiento y sistema para identificar un documento valioso.
AR073119A1 (es) Metodos para el recuento de estigmas de maiz u otros filamentos alargados y uso del recuento para caracterizar los filamentos o su origen
ATE493290T1 (de) Alcotestgerät
BR112015007273A2 (pt) sinalização aprimorada de identificadores de camada para pontos de operação de um codificador de vídeo
CL2019000550A1 (es) Deslizador para montaje de oruga de máquina.
AR093172A1 (es) Polimeros quimicamente marcados para la cuantificacion simplificada y metodos relacionados
DE502006009015D1 (de) Dungen
PA8847601A1 (es) Metodo y sistema de clasificacion de informacion audiovisual
PE20120189A1 (es) Metodo y sistema de identificacion en tiempo real de un anuncio audiovisual en un flujo de datos

Legal Events

Date Code Title Description
FG Grant, registration