BR112023005490A2 - Processamento de imagens usando redes neurais com base em autoatenção - Google Patents

Processamento de imagens usando redes neurais com base em autoatenção

Info

Publication number
BR112023005490A2
BR112023005490A2 BR112023005490A BR112023005490A BR112023005490A2 BR 112023005490 A2 BR112023005490 A2 BR 112023005490A2 BR 112023005490 A BR112023005490 A BR 112023005490A BR 112023005490 A BR112023005490 A BR 112023005490A BR 112023005490 A2 BR112023005490 A2 BR 112023005490A2
Authority
BR
Brazil
Prior art keywords
image
images
attention
self
processing
Prior art date
Application number
BR112023005490A
Other languages
English (en)
Inventor
Matthew Tinmouth Houlsby Neil
Gelly Sylvain
D Uszkoreit Jakob
Zhai Xiaohua
Heigold Georg
Klaus Beyer Lucas
Kolesnikov Alexander
Johannes Lorenz Minderer Matthias
Weissenborn Dirk
Deghani Mostafa
Dosovitskiy Alexey
Unterthiner Thomas
Original Assignee
Google Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Llc filed Critical Google Llc
Publication of BR112023005490A2 publication Critical patent/BR112023005490A2/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/97Determining parameters from multiple pictures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

PROCESSAMENTO DE IMAGENS USANDO REDES NEURAIS COM BASE EM AUTOATENÇÃO. Métodos, sistemas e aparelhos, incluindo programas de computador codificados em meio de armazenamento de computador, para processamento de imagens usando redes neurais com base em autoatenção. Um dos métodos inclui obter uma ou mais imagens compreendendo uma pluralidade de pixels; determinar, para cada imagem das uma ou mais imagens, uma pluralidade de remendos de imagem da imagem, em que cada remendo de imagem compreende um subconjunto diferente dos pixels da imagem; processar, para cada imagem das uma ou mais imagens, a correspondente pluralidade de remendos de imagem para gerar uma sequência de entrada compreendendo um respectivo elemento de entrada em cada uma de uma pluralidade de posições de entrada, em que uma pluralidade dos elementos de entrada corresponde aos respectivos remendos de imagem diferentes; e processar as sequências de entrada usando uma rede neural para gerar uma saída de rede que representa as uma ou mais imagens, em que a rede neural compreende uma ou mais camadas de rede neural de autoatenção.
BR112023005490A 2020-10-02 2021-10-04 Processamento de imagens usando redes neurais com base em autoatenção BR112023005490A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202063087135P 2020-10-02 2020-10-02
PCT/US2021/053424 WO2022072940A1 (en) 2020-10-02 2021-10-04 Processing images using self-attention based neural networks

Publications (1)

Publication Number Publication Date
BR112023005490A2 true BR112023005490A2 (pt) 2023-04-25

Family

ID=78414760

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023005490A BR112023005490A2 (pt) 2020-10-02 2021-10-04 Processamento de imagens usando redes neurais com base em autoatenção

Country Status (11)

Country Link
US (2) US20220108478A1 (pt)
EP (1) EP4196917A1 (pt)
JP (1) JP7536893B2 (pt)
KR (1) KR20230004710A (pt)
CN (1) CN115605878A (pt)
AU (2) AU2021354030B2 (pt)
BR (1) BR112023005490A2 (pt)
CA (1) CA3193958A1 (pt)
MX (1) MX2023003531A (pt)
TW (1) TW202215303A (pt)
WO (1) WO2022072940A1 (pt)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112287978B (zh) * 2020-10-07 2022-04-15 武汉大学 一种基于自注意力上下文网络的高光谱遥感图像分类方法
US11983920B2 (en) * 2021-12-20 2024-05-14 International Business Machines Corporation Unified framework for multigrid neural network architecture
WO2023229094A1 (ko) * 2022-05-27 2023-11-30 주식회사 엔씨소프트 행동 예측 방법 및 장치
CN114972897A (zh) * 2022-06-06 2022-08-30 京东科技控股股份有限公司 图像特征处理方法、装置、产品、介质及设备
CN114862881A (zh) * 2022-07-11 2022-08-05 四川大学 一种基于pet-ct的跨模态注意力肿瘤分割方法、***
KR102663467B1 (ko) * 2022-11-09 2024-05-09 국민대학교산학협력단 포인트 클라우드의 고해상화 장치 및 방법
CN115457042B (zh) * 2022-11-14 2023-03-24 四川路桥华东建设有限责任公司 一种基于蒸馏学习的螺纹套丝表面缺陷检测的方法及***
WO2024155850A1 (en) * 2023-01-18 2024-07-25 Vayu Robotics, Inc. Systems and methods for performing autonomous navigation

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6400996B1 (en) * 1999-02-01 2002-06-04 Steven M. Hoffberg Adaptive pattern recognition based control system and method
US6850252B1 (en) * 1999-10-05 2005-02-01 Steven M. Hoffberg Intelligent electronic appliance system and method
US7006881B1 (en) * 1991-12-23 2006-02-28 Steven Hoffberg Media recording device with remote graphic user interface
US6058190A (en) * 1997-05-27 2000-05-02 Pitney Bowes Inc. Method and system for automatic recognition of digital indicia images deliberately distorted to be non readable
US7904187B2 (en) * 1999-02-01 2011-03-08 Hoffberg Steven M Internet appliance system and method
JP4098021B2 (ja) 2002-07-30 2008-06-11 富士フイルム株式会社 シーン識別方法および装置ならびにプログラム
US7966072B2 (en) * 2005-02-18 2011-06-21 Palo Alto Investors Methods and compositions for treating obesity-hypoventilation syndrome
JP5258694B2 (ja) * 2009-07-27 2013-08-07 富士フイルム株式会社 医用画像処理装置および方法並びにプログラム
ITRM20130022A1 (it) * 2013-01-11 2014-07-12 Natural Intelligent Technologies S R L Procedimento e apparato di riconoscimento di scrittura a mano
US9536293B2 (en) * 2014-07-30 2017-01-03 Adobe Systems Incorporated Image assessment using deep convolutional neural networks
US9659384B2 (en) * 2014-10-03 2017-05-23 EyeEm Mobile GmbH. Systems, methods, and computer program products for searching and sorting images by aesthetic quality
US10803143B2 (en) * 2015-07-30 2020-10-13 Siemens Healthcare Gmbh Virtual biopsy techniques for analyzing diseases
EP3267368B1 (en) * 2016-07-06 2020-06-03 Accenture Global Solutions Limited Machine learning image processing
KR102559202B1 (ko) * 2018-03-27 2023-07-25 삼성전자주식회사 3d 렌더링 방법 및 장치
US10853725B2 (en) 2018-05-18 2020-12-01 Deepmind Technologies Limited Neural networks with relational memory
WO2020018585A1 (en) * 2018-07-16 2020-01-23 Accel Robotics Corporation Autonomous store tracking system
EP3932318A4 (en) 2019-02-28 2022-04-20 FUJIFILM Corporation LEARNING METHOD, LEARNING SYSTEM, LEARNED MODEL, PROGRAM AND DEVICE FOR GENERATION OF SUPER RESOLUTION IMAGES
US10825221B1 (en) * 2019-04-23 2020-11-03 Adobe Inc. Music driven human dancing video synthesis
WO2021176566A1 (ja) 2020-03-03 2021-09-10 日本電気株式会社 特徴変換装置、画像認識システム、特徴変換方法および非一時的なコンピュータ可読媒体

Also Published As

Publication number Publication date
JP2023533907A (ja) 2023-08-07
US11983903B2 (en) 2024-05-14
WO2022072940A1 (en) 2022-04-07
KR20230004710A (ko) 2023-01-06
AU2021354030A1 (en) 2022-11-24
MX2023003531A (es) 2023-04-19
CA3193958A1 (en) 2022-04-07
US20220108478A1 (en) 2022-04-07
AU2021354030B2 (en) 2023-11-30
AU2024201361A1 (en) 2024-03-21
US20240062426A1 (en) 2024-02-22
EP4196917A1 (en) 2023-06-21
CN115605878A (zh) 2023-01-13
TW202215303A (zh) 2022-04-16
JP7536893B2 (ja) 2024-08-20

Similar Documents

Publication Publication Date Title
BR112023005490A2 (pt) Processamento de imagens usando redes neurais com base em autoatenção
BR112019004798A8 (pt) Método implantado por computador e mídia de armazenamento
PH12019502186A1 (en) Method and apparatus for processing transaction requests
WO2020086123A8 (en) Data processing method and apparatus
BR112021020026A2 (pt) Método para processamento vídeo, aparelho para processar de dados de vídeo, meio de armazenamento e meio de gravação legíveis por computador não transitórios
EP3816886A4 (en) METHOD, APPARATUS, MANAGEMENT SYSTEM APPLIED TO A CUSTOMER GOODS SYSTEM, AND COMPUTER STORAGE SERVER AND MEDIUM
BR112018077198A2 (pt) sistemas e métodos para identificar conteúdos correspondentes
BR112018013602A2 (pt) métodos e sistemas de processamento de imagem
CY1124647T1 (el) Ταξινομηση βιολογικου ιστου σε ηλεκτρονικο υπολογιστη
MY190598A (en) Blockchain data processing method and apparatus
BR112015023345A2 (pt) criação in situ de alvos planos de recurso natural
WO2018185560A3 (en) METHODS, DEVICES AND SYSTEMS FOR ANATOMIC SURFACE EVALUATION
CY1124626T1 (el) Υπολογιστικο συστημα και μεθοδος υψηλων επιδοσεων
EP2953065A3 (en) Generating representations of input sequences using neural networks
BR112019013609A8 (pt) Método e aparelho de processamento de informação
JP2017054514A5 (pt)
BR112016014223A2 (pt) Sistemas, métodos e aparelho para recuperação de imagem
JP2017529628A5 (pt)
GB2545070A (en) Generating molecular encoding information for data storage
PL437715A1 (pl) Generowanie obrazu syntetycznego z chmury punktów 3D
BR112012014627A2 (pt) Método para geração computadorizada de uma imagem aprimorada com base em uma pluralidade de imagens, sistema capaz de gerar uma imagem aprimorada com base em uma pluralidade de imagens, método para detecção computadorizada de objetos em imagens de sar registradas e programa de computador
BR112019001747A2 (pt) método e aparelho de transcepção de dados, mídia de armazenamento legível por computador e produto de programa de computador
BR112021026664A2 (pt) Corte de vídeo automatizado usando importância relativa de objetos identificados
BR112015000367A2 (pt) método e dispositivo de comutação de imagem
EP3668015A3 (en) A multi-processor neural network processing apparatus