GB2620817A8 - Method and apparatus for on-device personalised analysis using a machine learning model - Google Patents

Method and apparatus for on-device personalised analysis using a machine learning model Download PDF

Info

Publication number
GB2620817A8
GB2620817A8 GB2306985.9A GB202306985A GB2620817A8 GB 2620817 A8 GB2620817 A8 GB 2620817A8 GB 202306985 A GB202306985 A GB 202306985A GB 2620817 A8 GB2620817 A8 GB 2620817A8
Authority
GB
United Kingdom
Prior art keywords
analysis
model
trained
data item
personalised
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
GB2306985.9A
Other versions
GB202306985D0 (en
GB2620817A (en
Inventor
Li Da
Bohdal Ondrej
Hu Xu
Hospedales Timothy
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Priority to PCT/KR2023/006858 priority Critical patent/WO2023224430A1/en
Publication of GB202306985D0 publication Critical patent/GB202306985D0/en
Publication of GB2620817A publication Critical patent/GB2620817A/en
Publication of GB2620817A8 publication Critical patent/GB2620817A8/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/53Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1612Programme controls characterised by the hand, wrist, grip control
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application relates to a computer-implemented method for performing personalised visual or audio analysis on an electronic device using a trained machine learning, ML, model. The method comprises receiving a query data item for analysis by the trained ML model; comparing the received query data item with a plurality of support data items stored on the electronic device to determine a similarity between each of the received query data item and the support data items; and performing personalised analysis on the received query data item, using the trained ML model, the support data items and the determined similarities. The method may make use of a feature extractor or a cross-attention module of the trained ML model. Potential visual analysis applications of the method include its use in image classification, object recognition, semantic segmentation, grasp prediction, navigation, and image enhancement. Potential audio analysis applications include speech recognition, audio enhancement, noise suppression, and language translation. The method may be of particular use in mobile computing devices, and described embodiments include the use of the method to control autonomous robots or smartphones.
GB2306985.9A 2022-05-19 2023-05-11 Method and apparatus for on-device personalised analysis using a machine learning model Pending GB2620817A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/KR2023/006858 WO2023224430A1 (en) 2022-05-19 2023-05-19 Method and apparatus for on-device personalised analysis using a machine learning model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB2207373.8A GB202207373D0 (en) 2022-05-19 2022-05-19 Method and apparatus for on-device user personalisation

Publications (3)

Publication Number Publication Date
GB202306985D0 GB202306985D0 (en) 2023-06-28
GB2620817A GB2620817A (en) 2024-01-24
GB2620817A8 true GB2620817A8 (en) 2024-02-21

Family

ID=82220449

Family Applications (2)

Application Number Title Priority Date Filing Date
GBGB2207373.8A Ceased GB202207373D0 (en) 2022-05-19 2022-05-19 Method and apparatus for on-device user personalisation
GB2306985.9A Pending GB2620817A (en) 2022-05-19 2023-05-11 Method and apparatus for on-device personalised analysis using a machine learning model

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GBGB2207373.8A Ceased GB202207373D0 (en) 2022-05-19 2022-05-19 Method and apparatus for on-device user personalisation

Country Status (2)

Country Link
GB (2) GB202207373D0 (en)
WO (1) WO2023224430A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117930028B (en) * 2024-03-21 2024-05-17 成都赛力斯科技有限公司 Method, system, equipment and medium for predicting thermal failure of new energy vehicle battery

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5490223A (en) * 1993-06-22 1996-02-06 Kabushiki Kaisha Toshiba Pattern recognition apparatus
CN104036474B (en) * 2014-06-12 2017-12-19 厦门美图之家科技有限公司 A kind of Automatic adjustment method of brightness of image and contrast
WO2016142285A1 (en) * 2015-03-06 2016-09-15 Thomson Licensing Method and apparatus for image search using sparsifying analysis operators
EP3480766A1 (en) * 2015-04-23 2019-05-08 Rovi Guides, Inc. Systems and methods for improving accuracy in media asset recommendation models
KR101842612B1 (en) * 2016-10-12 2018-03-27 고려대학교 산학협력단 Method and apparatus for recognizing target sound using deep learning
JP7293988B2 (en) * 2019-08-27 2023-06-20 富士通株式会社 Learning program, determination processing program, learning device, determination processing device, learning method, and determination processing method
CN111462059B (en) * 2020-03-24 2023-09-29 湖南大学 Parallel processing method and device for intelligent target detection of fetal ultrasonic image

Also Published As

Publication number Publication date
GB202207373D0 (en) 2022-07-06
GB202306985D0 (en) 2023-06-28
WO2023224430A1 (en) 2023-11-23
GB2620817A (en) 2024-01-24

Similar Documents

Publication Publication Date Title
Choi et al. Convolutional attention networks for multimodal emotion recognition from speech and text data
Forgues et al. Bootstrapping dialog systems with word embeddings
CN107491435B (en) Method and device for automatically identifying user emotion based on computer
GB2620817A8 (en) Method and apparatus for on-device personalised analysis using a machine learning model
CN107943786B (en) Chinese named entity recognition method and system
US10909972B2 (en) Spoken language understanding using dynamic vocabulary
Zhang et al. Co-training succeeds in computational paralinguistics
Vinnarasu et al. Speech to text conversion and summarization for effective understanding and documentation
Zayene et al. 3D convolutional recurrent global neural network for speech emotion recognition
CN114722822A (en) Named entity recognition method, device, equipment and computer readable storage medium
Ando et al. Speech emotion recognition based on listener-dependent emotion perception models
US20200043477A1 (en) Sensor-Processing Systems Including Neuromorphic Processing Modules and Methods Thereof
Granger et al. Comparing hybrid NN-HMM and RNN for temporal modeling in gesture recognition
Qasim et al. Arabic speech recognition using deep learning methods: Literature review
Hayat et al. On the use of interpretable CNN for personality trait recognition from audio
Heracleous et al. Integrating language and emotion features for multilingual speech emotion recognition
Ku et al. Deep convolutional neural network with bottleneck structure using raw seismic waveform for earthquake classification
Yue et al. BiLSTM Chinese Text Sentiment Analysis Based on Pre‐attention
Talukdar et al. Training Dynamic based data filtering may not work for NLP datasets
Level et al. Introduction of semantic model to help speech recognition
Phyu et al. Articles classification in Myanmar language
Liu et al. Keyword retrieving in continuous speech using connectionist temporal classification
Vemulapalli et al. Audio-video based character recognition for handwritten mathematical content in classroom videos
Chaudhari et al. Artificial Intelligence System for Emotion Recognition and Text Analytics
Jithendra et al. Cognitive Model for Object Detection based on Speech-to-Text Conversion