GB2620817A8 - Method and apparatus for on-device personalised analysis using a machine learning model - Google Patents
Method and apparatus for on-device personalised analysis using a machine learning model Download PDFInfo
- Publication number
- GB2620817A8 GB2620817A8 GB2306985.9A GB202306985A GB2620817A8 GB 2620817 A8 GB2620817 A8 GB 2620817A8 GB 202306985 A GB202306985 A GB 202306985A GB 2620817 A8 GB2620817 A8 GB 2620817A8
- Authority
- GB
- United Kingdom
- Prior art keywords
- analysis
- model
- trained
- data item
- personalised
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title abstract 7
- 238000010801 machine learning Methods 0.000 title abstract 2
- 230000000007 visual effect Effects 0.000 abstract 2
- 230000011218 segmentation Effects 0.000 abstract 1
- 230000001629 suppression Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/096—Transfer learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/53—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/63—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/0895—Weakly supervised learning, e.g. semi-supervised or self-supervised learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/0985—Hyperparameter optimisation; Meta-learning; Learning-to-learn
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1612—Programme controls characterised by the hand, wrist, grip control
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0475—Generative networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- User Interface Of Digital Computer (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present application relates to a computer-implemented method for performing personalised visual or audio analysis on an electronic device using a trained machine learning, ML, model. The method comprises receiving a query data item for analysis by the trained ML model; comparing the received query data item with a plurality of support data items stored on the electronic device to determine a similarity between each of the received query data item and the support data items; and performing personalised analysis on the received query data item, using the trained ML model, the support data items and the determined similarities. The method may make use of a feature extractor or a cross-attention module of the trained ML model. Potential visual analysis applications of the method include its use in image classification, object recognition, semantic segmentation, grasp prediction, navigation, and image enhancement. Potential audio analysis applications include speech recognition, audio enhancement, noise suppression, and language translation. The method may be of particular use in mobile computing devices, and described embodiments include the use of the method to control autonomous robots or smartphones.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/KR2023/006858 WO2023224430A1 (en) | 2022-05-19 | 2023-05-19 | Method and apparatus for on-device personalised analysis using a machine learning model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB2207373.8A GB202207373D0 (en) | 2022-05-19 | 2022-05-19 | Method and apparatus for on-device user personalisation |
Publications (3)
Publication Number | Publication Date |
---|---|
GB202306985D0 GB202306985D0 (en) | 2023-06-28 |
GB2620817A GB2620817A (en) | 2024-01-24 |
GB2620817A8 true GB2620817A8 (en) | 2024-02-21 |
Family
ID=82220449
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GBGB2207373.8A Ceased GB202207373D0 (en) | 2022-05-19 | 2022-05-19 | Method and apparatus for on-device user personalisation |
GB2306985.9A Pending GB2620817A (en) | 2022-05-19 | 2023-05-11 | Method and apparatus for on-device personalised analysis using a machine learning model |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GBGB2207373.8A Ceased GB202207373D0 (en) | 2022-05-19 | 2022-05-19 | Method and apparatus for on-device user personalisation |
Country Status (2)
Country | Link |
---|---|
GB (2) | GB202207373D0 (en) |
WO (1) | WO2023224430A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117930028B (en) * | 2024-03-21 | 2024-05-17 | 成都赛力斯科技有限公司 | Method, system, equipment and medium for predicting thermal failure of new energy vehicle battery |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5490223A (en) * | 1993-06-22 | 1996-02-06 | Kabushiki Kaisha Toshiba | Pattern recognition apparatus |
CN104036474B (en) * | 2014-06-12 | 2017-12-19 | 厦门美图之家科技有限公司 | A kind of Automatic adjustment method of brightness of image and contrast |
WO2016142285A1 (en) * | 2015-03-06 | 2016-09-15 | Thomson Licensing | Method and apparatus for image search using sparsifying analysis operators |
EP3480766A1 (en) * | 2015-04-23 | 2019-05-08 | Rovi Guides, Inc. | Systems and methods for improving accuracy in media asset recommendation models |
KR101842612B1 (en) * | 2016-10-12 | 2018-03-27 | 고려대학교 산학협력단 | Method and apparatus for recognizing target sound using deep learning |
JP7293988B2 (en) * | 2019-08-27 | 2023-06-20 | 富士通株式会社 | Learning program, determination processing program, learning device, determination processing device, learning method, and determination processing method |
CN111462059B (en) * | 2020-03-24 | 2023-09-29 | 湖南大学 | Parallel processing method and device for intelligent target detection of fetal ultrasonic image |
-
2022
- 2022-05-19 GB GBGB2207373.8A patent/GB202207373D0/en not_active Ceased
-
2023
- 2023-05-11 GB GB2306985.9A patent/GB2620817A/en active Pending
- 2023-05-19 WO PCT/KR2023/006858 patent/WO2023224430A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
GB202207373D0 (en) | 2022-07-06 |
GB202306985D0 (en) | 2023-06-28 |
WO2023224430A1 (en) | 2023-11-23 |
GB2620817A (en) | 2024-01-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Choi et al. | Convolutional attention networks for multimodal emotion recognition from speech and text data | |
Forgues et al. | Bootstrapping dialog systems with word embeddings | |
CN107491435B (en) | Method and device for automatically identifying user emotion based on computer | |
GB2620817A8 (en) | Method and apparatus for on-device personalised analysis using a machine learning model | |
CN107943786B (en) | Chinese named entity recognition method and system | |
US10909972B2 (en) | Spoken language understanding using dynamic vocabulary | |
Zhang et al. | Co-training succeeds in computational paralinguistics | |
Vinnarasu et al. | Speech to text conversion and summarization for effective understanding and documentation | |
Zayene et al. | 3D convolutional recurrent global neural network for speech emotion recognition | |
CN114722822A (en) | Named entity recognition method, device, equipment and computer readable storage medium | |
Ando et al. | Speech emotion recognition based on listener-dependent emotion perception models | |
US20200043477A1 (en) | Sensor-Processing Systems Including Neuromorphic Processing Modules and Methods Thereof | |
Granger et al. | Comparing hybrid NN-HMM and RNN for temporal modeling in gesture recognition | |
Qasim et al. | Arabic speech recognition using deep learning methods: Literature review | |
Hayat et al. | On the use of interpretable CNN for personality trait recognition from audio | |
Heracleous et al. | Integrating language and emotion features for multilingual speech emotion recognition | |
Ku et al. | Deep convolutional neural network with bottleneck structure using raw seismic waveform for earthquake classification | |
Yue et al. | BiLSTM Chinese Text Sentiment Analysis Based on Pre‐attention | |
Talukdar et al. | Training Dynamic based data filtering may not work for NLP datasets | |
Level et al. | Introduction of semantic model to help speech recognition | |
Phyu et al. | Articles classification in Myanmar language | |
Liu et al. | Keyword retrieving in continuous speech using connectionist temporal classification | |
Vemulapalli et al. | Audio-video based character recognition for handwritten mathematical content in classroom videos | |
Chaudhari et al. | Artificial Intelligence System for Emotion Recognition and Text Analytics | |
Jithendra et al. | Cognitive Model for Object Detection based on Speech-to-Text Conversion |