US20180197037A1 - Image upsampling system, training method thereof and image upsampling method - Google Patents

Image upsampling system, training method thereof and image upsampling method Download PDF

Info

Publication number
US20180197037A1
US20180197037A1 US15/518,404 US201615518404A US2018197037A1 US 20180197037 A1 US20180197037 A1 US 20180197037A1 US 201615518404 A US201615518404 A US 201615518404A US 2018197037 A1 US2018197037 A1 US 2018197037A1
Authority
US
United States
Prior art keywords
muxer
image
layer
convolutional network
upsampling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US15/518,404
Other versions
US10019642B1 (en
Inventor
Pablo Navarrete Michelini
Lijie Zhang
Jianmin He
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Assigned to BOE TECHNOLOGY GROUP CO., LTD. reassignment BOE TECHNOLOGY GROUP CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: HE, JIANMIN, ZHANG, LIJIE, Navarrete Michelini, Pablo
Application granted granted Critical
Publication of US10019642B1 publication Critical patent/US10019642B1/en
Publication of US20180197037A1 publication Critical patent/US20180197037A1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • G06K9/4628
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting
    • G06T3/4053Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/251Fusion techniques of input or preprocessed data
    • G06K9/6289
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting

Definitions

  • the present disclosure relates to an image signal processing technology, and more particularly, to an image upsampling system, a training method thereof, and an image upsampling method.
  • the resolution of an image can be raised by means of the standard methods such as a bicubic upsampling process and a linear upsampling process (for enlarging image resolution).
  • a 2 ⁇ upsampling mode is shown, four different filters: filter F 1 , filter F 2 , filter F 3 and filter F 4 , are applied to all the pixels (plus adjacent pixels) of an input image, each filter generates a quarter of the pixels of the output image, and the process may be understood as applying four filters (convolution) to the input image, and then interleaving or multiplexing to create a single output image that doubles the width and the height of the image.
  • embodiments of the present disclosure provide an image upsampling system, a training method thereof and an image upsampling method, for implementing high-quality upsampling of image resolution based on a convolutional neural network, to reduce a computation amount of upsampling, and to improve flexibility of adjusting the upsampling multiple.
  • An aspect of the present disclosure provides an image upsampling system, comprising: at least one first convolutional network and at least one muxer layer that are cascaded; an signal input end of the image upsampling system is connected with a signal input end of a first convolutional network in the at least one first convolutional network, and a signal output end of the image upsampling system is connected with a signal output end of a last muxer layer in the at least one muxer layer; a signal input end of every muxer layer in the at least one muxer layer is connected with a signal output end of a first convolutional network located in a stage prior to the muxer layer in the at least one first convolutional network, or connected with a signal output end of another muxer layer located in a stage prior to the muxer layer in the at least one muxer layer; the first convolutional network is configured for converting an image input to its signal input end into a plurality of feature images and outputting the feature images to the signal input end of the
  • a number of muxer layers is two or three.
  • a signal input end of each muxer layer is respectively connected with a signal output end of one corresponding first convolutional network in the at least one first convolutional network.
  • the muxer layers in the case where there are provided a plurality of muxer layers, the muxer layers have a same upsampling multiple.
  • the muxer layer has an upsampling multiple which is a prime number.
  • the muxer layer has an upsampling multiple which is 2.
  • the muxer layer is a self-adaptive interpolation filter.
  • the image upsampling system further comprises: a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system; the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
  • the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
  • Another aspect of the present disclosure further provides a display device, comprising any one of the above image upsampling systems.
  • Still another aspect of the present disclosure further provides a training method of any of one the image upsampling systems, comprising: initializing respective parameters in the image upsampling system; by using an original image signal as an output signal of the image upsampling system and using an image signal obtained by down-sampling the original image signal as an input signal of the image upsampling system, adjusting the respective parameters in the image upsampling system to allow the down-sampled image signal subjected to upsampling processing with the adjusted respective parameters to be the same as the original image signal.
  • initializing of the respective parameters in the image upsampling system includes: initializing weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • W ij ⁇ 1 / ( m ) ( i , j ) ⁇ ⁇ are ⁇ ⁇ preset ⁇ ⁇ anchor ⁇ ⁇ pixel 0 other ⁇ ⁇ pixel ⁇
  • m represents the number of feature images input to the filter unit; and initializing the biases of respective filter units to 0.
  • initializing of the respective parameters in the image upsampling system includes: initializing the weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • m represents the number of feature images input to the filter unit, and uniform ( ⁇ 1,1) represents a random number selected between ( ⁇ 1,1); and initializing the biases of respective filter units to 0.
  • FIG. 1 Further still another aspect of the present disclosure further provides a, method for performing image upsampling with any one of the above image upsampling systems, comprising: converting, by a first convolutional network, an input image input to the first convolutional network into a plurality of feature images and outputting the feature images; synthesizing, by a muxer layer, every n ⁇ n feature images in the feature images input to the muxer layer into a feature image with a resolution amplified by n ⁇ n times as larger as the input feature images, and outputting the same; the number of feature images input to the muxer layer being a multiple of n ⁇ n, and n being an integer greater than 1.
  • a feature images of an image are obtained by a convolutional network, upsampling processing is performed on the image with a muxer layer to synthesize every n ⁇ n feature images in an input signal into a feature image with the resolution amplified by n ⁇ n times; in the upsampling procedure with the muxer layer, information of respective input feature images is recorded in the generated feature image without loss; and thus, every time when the image passes through a muxer layer which can upsample by a multiple of n ⁇ , the image resolution can be increased by n ⁇ n times.
  • each muxer layer can execute an upsampling function of a single multiple, so that the system can flexibly adjust the upsampling multiple(s) according to needs, implement a universal upsampling system with respect to different upsampling multiples.
  • the muxer layer amplifies the resolution of the feature image(s) by n ⁇ n times, an signal amount input to a muxer layer or a first convolutional network of a next cascaded stage can be reduced so as to simplify the computation amount of upsampling.
  • FIG. 1 is a schematic diagram of a 2 ⁇ upsampling in the state of art
  • FIG. 2 a to FIG. 2 e are respectively structural schematic diagrams of an image upsampling system provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of upsampling by a muxer layer in an image upsampling system provided by an embodiment of the present disclosure
  • FIG. 4 is a structural schematic diagram of a convolutional network in an image upsampling system provided by an embodiment of the present disclosure.
  • FIG. 5 is a schematic flow chart of a training method of an image upsampling system provided by an embodiment of the present disclosure.
  • a convolutional neural network is a type of artificial neural network, and has become a research hotspot in the fields such as speech analysis and image recognition at present.
  • the weight sharing network structure of a convolutional neural network renders the convolutional neural network more similar to a biological neural network, and it can reduce the complexity of a network model and reduces the number of weights. This advantage is more obvious when the input of the network is a multi-dimensional image, and the image can be directly used as the network input, which avoids the complicated process such as feature extraction and data reconstruction in a traditional identification algorithm.
  • the convolutional neural network is a multi-layer sensor specially designed to identify a two-dimensional shape. This type of network structure can be applied to translation, scaling, tilting, or other forms of transformation.
  • an embodiment of the present disclosure provides an image upsampling system, a method for training the image upsampling system, and a method for performing upsampling on an input image according to the trained image upsampling system.
  • the image upsampling system performs upsampling by using the convolutional neural network, and, under the premise of ensuring substantially no loss of image information, can effectively converts a low-resolution image into a high-resolution image.
  • An image upsampling system provided by an embodiment of the present disclosure, as illustrated in FIG. 2 a to FIG. 2 d , comprises: at least one first convolutional network (CN) and at least one muxer layer (ML) cascaded; an signal input end of the image upsampling system is connected with a signal input end of one first convolutional network, and a signal output end of the image upsampling system is connected with a signal output end of one muxer layer.
  • CN first convolutional network
  • ML muxer layer
  • a signal input end of a muxer layer is connected with a signal output end of one first convolutional network or connected with a signal output end of another muxer layer.
  • the first convolutional network is configured for converting an image input to its signal input end into a plurality of feature images and outputting to the signal input end of the muxer layer.
  • the muxer layer is configured for synthesizing every n ⁇ n feature images in the feature images that are input to its signal input end into a feature image whose resolution is n ⁇ n times the resolution of the input feature images, and outputting the synthesized feature image; and the number of feature images input to the muxer layer is a multiple of n ⁇ n, where n is an integer greater than one.
  • image upsampling processing is performed with a muxer layer to synthesize every n ⁇ n feature images input to the muxer layer into one feature image with the resolution amplified by n ⁇ n times, and in the upsampling procedure with the muxer layer, the information of each input feature image is still recorded in the resultant feature image without loss; and thus, every time when the image passes through a muxer layer with an upsampling multiple (factor) of n, the image resolution can be increased by n ⁇ n times.
  • the muxer layers can respectively perform an upsampling function of an individual multiple, so that the system can flexibly adjust the upsampling multiple(s) according to needs and realize a universal image upsampling system with respect to different upsampling multiples. Further, because the number of feature images output by each muxer layer is reduced while the muxer layer amplifies the resolution of the feature image by n ⁇ n times, the signals amount input to the muxer layer or first convolutional network in a next cascaded stage can be reduced so as to alleviate the computation amount of upsampling.
  • the system comprises a plurality of muxer layers with an amplifying multiple of n times, then after the image is upsampled by the system, the image resolution can be improved by (n ⁇ n) ⁇ (n ⁇ n) times.
  • the system comprises two muxer layers with an upsampling multiple of 2 ⁇ (two times), after the image passes through the two muxer layers, the resolution is improved by 4 ⁇ 4 times; and if the system comprises three muxer layers with an upsampling multiple of 2 ⁇ , after the image passes through the three muxer layers, the resolution is improved by 4 ⁇ 4 ⁇ 4 times.
  • one muxer layer may be provided as illustrated in FIG. 2 a
  • two muxer layers may be provided as illustrated in FIG. 2 b and FIG. 2 c
  • three muxer layers may be provided as illustrated in FIG. 2 d.
  • the above-described image upsampling system provided by an embodiment of the present disclosure comprises one first convolutional network and one muxer layer which can upsample feature images by times of 2 ⁇ , 3 ⁇ or 5 ⁇ .
  • the above-described image upsampling system provided by the embodiment of the present disclosure as illustrated in FIG. 2 b and FIG. 2 c , may comprise two muxer layers which can upsample by 2 ⁇ times.
  • the above-described image upsampling system provided by the embodiment of the present disclosure may comprise three muxer layers which can upsample by 2 ⁇ times. In this way, the greater the required multiple, the larger the number of muxer layers correspondingly, and the greater the data computation amount performed by the corresponding system.
  • two or three muxer layers are generally provided, to perform two or three processes of upsampling.
  • the signal input end of the muxer layer is connected with the signal output end of the first convolutional network, and thus firstly, the feature images are obtained by the first convolutional network, then the obtained feature images are input to the signal input end of the corresponding muxer layer, that is, in the image upsampling system, the first convolutional network and the muxer layer are arranged in pair(s).
  • the upsampling multiples of the muxer layers may be the same or different from each other.
  • the upsampling multiple n of the respective muxer layer is generally elected to be the same.
  • the smaller the upsampling multiple of each muxer layer the smaller the computation amount, and the better the upsampling effect.
  • the upsampling multiple n of each muxer layer is set to be 2.
  • the above-described image upsampling system provided by an embodiment of the present disclosure, as illustrated in FIG. 2 e , further comprises: a second convolutional network, whose signal input end is connected with the signal output end of an muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system; the second convolutional network is configured for optimizing the picture quality of the feature image output by a muxer layer.
  • the second convolutional network can be configured for enhancing the picture quality of output picture according to actual needs, to improve quality of the output image.
  • both the first convolutional network and the second convolutional network may include at least one convolution layer composed of a plurality of filter units.
  • the number of convolution layers included in the first convolutional network and the second convolutional network may be set according to needs. And the number of filter units included in each convolution layer may be the same or different.
  • the number of convolution layers in each convolutional network is generally set to be no more than 10.
  • the first convolutional network at the first stage is connected with the signal input end of the image upsampling system and is composed of four convolution layers, each convolution layer includes 128 filter units, and each filter unit is composed of 3 ⁇ 3 filters; the filter at position [1,1] is set as the center pixel.
  • 128 feature images are generated and output to a next convolution layer, until the last convolution layer outputs 128 feature images to the muxer layer of a next (second) stage.
  • the muxer layer of the second stage After the muxer layer of the second stage receives the 128 feature images output by the first convolutional network of the first stage, it synthesizes every four input feature images into one feature image of the pixel resolution 4 times as larger as the input feature images (2 ⁇ upsampling), i.e., after the 128 input feature images pass through the muxer layer of the second stage, 32 feature images are output to the first convolutional network of the next (third) stage.
  • the first convolutional network of the third stage is composed of four convolution layers, each convolution layer includes 32 filter units, and each filter unit is composed of 3 ⁇ 3 filters; the filter at position [1,1] is set as the center pixel.
  • the muxer layer of the fourth stage After the muxer layer of the fourth stage receives the 32 feature images output by the first convolutional network of the third stage, it synthesizes every 4 input feature images into one feature image of pixel resolution of 4 times as larger as the input feature images (2 ⁇ upsampling), i.e., after the 32 input feature images pass through the fourth stage of muxer layer, 8 feature images are output to the second convolutional network of the next (fifth) stage.
  • the second convolutional network of the fifth stage is composed of four convolution layers, the first two convolution layers include 8 filter units, the third convolution layer includes 4 filter units, the fourth convolution layer includes 1 filter unit, and each filter unit is composed of 3 ⁇ 3 filters; the filter at position [1,1] is set as the center pixel.
  • 8 feature images are generated and output to the second convolution layer, after they pass through the second convolution layer, 8 feature images are generated and input to the third convolution layer, after they pass through the third convolution layer, 4 feature images are input to the fourth convolution layer, and finally, the fourth convolution layer outputs one feature image to the output end of the image upsampling system.
  • each muxer layer is substantially equivalent to a self-adaptive interpolation filter, and as illustrated in FIG. 3 , pixel values of every four feature images in the input feature image are combined in a staggered manner, to generate a feature image of a quadrupling pixel.
  • a working principle of the muxer layer is arranging pixel values of respective identical pixel-point positions in 4 input feature image in a matrix in an output feature image, so that in the upsampling procedure, no pixel information in the feature image will be modified (lost or added).
  • either the first convolutional network or the second convolutional network can be a neural network structure which uses images as input and output, each neural network structure includes a plurality of convolution layers, and each convolution layer is composed of a plurality of filters.
  • each neural network structure includes a plurality of convolution layers
  • each convolution layer is composed of a plurality of filters.
  • each box marked with a scalar weight W ij k is equivalent to a filter (for example, a filter having 3 ⁇ 3 or 5 ⁇ 5 core), and a bias b j k represents picture increment added to convolution output; k represents the serial number of the convolution layer, and i and j represent an input serial image number and an output image serial number, respectively.
  • an embodiment of the present disclosure further provides a training method of the above-described image upsampling system, as illustrated in FIG. 5 , comprising the following operations:
  • step S 501 with respect to the initializing respective parameters in the image upsampling system, a traditional initializing mode may be used, and weights Wij of respective filter units of the respective convolution layers of all the convolutional networks are set to be a small random numbers, and all the biases are initialized to be 0.
  • the traditional initializing way will not invoke any problem when applied to small-multiple (such as 2 ⁇ ) upsampling, but may invoke some problems when applied to high-multiple (such as 4 ⁇ ) upsampling in connection with several convolutional networks, and thus, the above-described training method provided by an embodiment of the present disclosure further provides two new ways with respect to the initializing respective parameters in the image upsampling system, specifically as follows:
  • W ij ⁇ 1 / ( m ) ( i , j ) ⁇ ⁇ are ⁇ ⁇ preset ⁇ ⁇ anchor ⁇ ⁇ pixel 0 other ⁇ ⁇ pixel ⁇
  • m represents the number of feature images input to the filter unit.
  • m represents the number of feature images input to the filter unit; and uniform ( ⁇ 1,1) represents a random number selected between ( ⁇ 1,1).
  • a small uniformly distributed noise value is added to the weights W ij of respective filter units, which facilitates the image upsampling system to have an ability of identifying noise after training.
  • an embodiment of the present disclosure further provides a method for performing image upsampling with the above-described image upsampling system; because the principle for the method to solve the problem is similar to that of an image upsampling system as described above, for implementation of the method, the implementation of the system may be referred to, and will not be repeated here.
  • An embodiment of the present disclosure provides a method for performing image upsampling with an image upsampling system, comprising: converting, by a first convolutional network, an input image input to the first convolutional network into a plurality of feature images having a specific feature and outputting the same; and synthesizing, by a muxer layer, every n ⁇ n feature images in the feature images input to the muxer layer into a feature image with the resolution amplified by n ⁇ n times as larger as the resolution of the input feature images, and outputting the same, wherein the number of feature images input to the muxer layer is a multiple of n ⁇ n, and n is an integer greater than 1.
  • each muxer layer after receiving the feature images, will perform upsampling processing on the feature images, and output the result to a next muxer layer, the next muxer layer performs upsampling processing on the received feature image(s), until the last muxer layer outputs a final up-sampled image.
  • the image upsampling system as described in the embodiment of the present disclosure may be implemented by a set of central processing units (CPU), and may also be implemented by a set of graphics processing unit (GPU), or may also be implemented by a field programmable gate array (FPGA).
  • CPU central processing units
  • GPU graphics processing unit
  • FPGA field programmable gate array
  • an embodiment of the present disclosure further provides a display device, comprising the above-described image upsampling system provided by the embodiment of the present disclosure;
  • the display device can be: a mobile phone, a tablet computer, a television, a monitor, a laptop computer, a digital photo frame, a wearable device, a navigator and any products or parts with a display function.
  • the embodiment of the above-described image upsampling system may be referred to, and will not be repeated here.
  • the feature images of an image are obtained by using the convolutional network, upsampling processing is performed on the images with the muxer layer to synthesize every n ⁇ n feature images in the input signal into a feature image with the resolution amplified by n ⁇ n times, in the upsampling procedure with the muxer layer, information of respective feature images in the input signal is recorded in the generated feature image(s) without loss; and thus, every time when the image passes through a muxer layer with an upsampling multiple of n, the image resolution can be increased by n ⁇ n times.
  • each muxer layer can execute an upsampling function of a single multiple so that the system may flexibly adjust the upsampling multiple according to needs, and implement a universal upsampling system for different upsampling multiples.
  • the signal amount input to the muxer layer or first convolutional network of a next cascaded stage can be reduced so as to simplify the computation amount of upsampling.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Image Processing (AREA)

Abstract

An image upsampling system, a training method thereof and an image upsampling method are provided, the feature images of an image are obtained by using the convolutional network, upsampling processing is performed on the images with the muxer layer to synthesize every n×n feature images in the input signal into a feature image with the resolution amplified by n×n times, in the upsampling procedure with the muxer layer, information of respective feature images in the input signal is recorded in the generated feature image(s) without loss; and thus, every time when the image passes through a muxer layer with an upsampling multiple of n, the image resolution can be increased by n×n times.

Description

    TECHNICAL FIELD
  • The present disclosure relates to an image signal processing technology, and more particularly, to an image upsampling system, a training method thereof, and an image upsampling method.
  • BACKGROUND
  • At present, in an image signal processing procedure, generally, the resolution of an image can be raised by means of the standard methods such as a bicubic upsampling process and a linear upsampling process (for enlarging image resolution). As illustrated in FIG. 1, a 2× upsampling mode is shown, four different filters: filter F1, filter F2, filter F3 and filter F4, are applied to all the pixels (plus adjacent pixels) of an input image, each filter generates a quarter of the pixels of the output image, and the process may be understood as applying four filters (convolution) to the input image, and then interleaving or multiplexing to create a single output image that doubles the width and the height of the image.
  • However, data computation amount of the current image upsampling system is relatively large, and the upsampling multiple (times) cannot be flexibly adjusted.
  • SUMMARY
  • In view of the above, embodiments of the present disclosure provide an image upsampling system, a training method thereof and an image upsampling method, for implementing high-quality upsampling of image resolution based on a convolutional neural network, to reduce a computation amount of upsampling, and to improve flexibility of adjusting the upsampling multiple.
  • An aspect of the present disclosure provides an image upsampling system, comprising: at least one first convolutional network and at least one muxer layer that are cascaded; an signal input end of the image upsampling system is connected with a signal input end of a first convolutional network in the at least one first convolutional network, and a signal output end of the image upsampling system is connected with a signal output end of a last muxer layer in the at least one muxer layer; a signal input end of every muxer layer in the at least one muxer layer is connected with a signal output end of a first convolutional network located in a stage prior to the muxer layer in the at least one first convolutional network, or connected with a signal output end of another muxer layer located in a stage prior to the muxer layer in the at least one muxer layer; the first convolutional network is configured for converting an image input to its signal input end into a plurality of feature images and outputting the feature images to the signal input end of the muxer layer connected therewith; the muxer layer is configured for synthesizing every n×n feature images in the feature images input to its signal input end into a feature image whose resolution is n×n times that of the input feature image and outputting the same; and a number of feature images input to the muxer layer is a multiple of n×n, n being an integer greater than one.
  • According to an embodiment of the present disclosure, a number of muxer layers is two or three.
  • According to an embodiment of the present disclosure, a signal input end of each muxer layer is respectively connected with a signal output end of one corresponding first convolutional network in the at least one first convolutional network.
  • According to an embodiment of the present disclosure, in the case where there are provided a plurality of muxer layers, the muxer layers have a same upsampling multiple.
  • According to an embodiment of the present disclosure, the muxer layer has an upsampling multiple which is a prime number.
  • According to an embodiment of the present disclosure, the muxer layer has an upsampling multiple which is 2.
  • According to an embodiment of the present disclosure, the muxer layer is a self-adaptive interpolation filter.
  • According to an embodiment of the present disclosure, the image upsampling system further comprises: a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system; the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
  • According to an embodiment of the present disclosure, the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
  • Another aspect of the present disclosure further provides a display device, comprising any one of the above image upsampling systems.
  • Still another aspect of the present disclosure further provides a training method of any of one the image upsampling systems, comprising: initializing respective parameters in the image upsampling system; by using an original image signal as an output signal of the image upsampling system and using an image signal obtained by down-sampling the original image signal as an input signal of the image upsampling system, adjusting the respective parameters in the image upsampling system to allow the down-sampled image signal subjected to upsampling processing with the adjusted respective parameters to be the same as the original image signal.
  • According to an embodiment of the present disclosure, in the above training method, initializing of the respective parameters in the image upsampling system includes: initializing weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
  • where m represents the number of feature images input to the filter unit; and initializing the biases of respective filter units to 0.
  • According to an embodiment of the present disclosure, in the above training method, initializing of the respective parameters in the image upsampling system includes: initializing the weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • W ij = W ij + uniform ( - 1 , 1 ) m W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
  • where m represents the number of feature images input to the filter unit, and uniform (−1,1) represents a random number selected between (−1,1); and initializing the biases of respective filter units to 0.
  • Further still another aspect of the present disclosure further provides a, method for performing image upsampling with any one of the above image upsampling systems, comprising: converting, by a first convolutional network, an input image input to the first convolutional network into a plurality of feature images and outputting the feature images; synthesizing, by a muxer layer, every n×n feature images in the feature images input to the muxer layer into a feature image with a resolution amplified by n×n times as larger as the input feature images, and outputting the same; the number of feature images input to the muxer layer being a multiple of n×n, and n being an integer greater than 1.
  • According to the embodiments of the present disclosure, in the image upsampling system, the training method thereof and the image upsampling method, a feature images of an image are obtained by a convolutional network, upsampling processing is performed on the image with a muxer layer to synthesize every n×n feature images in an input signal into a feature image with the resolution amplified by n×n times; in the upsampling procedure with the muxer layer, information of respective input feature images is recorded in the generated feature image without loss; and thus, every time when the image passes through a muxer layer which can upsample by a multiple of n×, the image resolution can be increased by n×n times. In addition, in the image upsampling system, more than one muxer layers can be provided for successive processes of upsampling, each muxer layer can execute an upsampling function of a single multiple, so that the system can flexibly adjust the upsampling multiple(s) according to needs, implement a universal upsampling system with respect to different upsampling multiples. Further, because the number of feature images output by each muxer layer is reduced while the muxer layer amplifies the resolution of the feature image(s) by n×n times, an signal amount input to a muxer layer or a first convolutional network of a next cascaded stage can be reduced so as to simplify the computation amount of upsampling.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a schematic diagram of a 2× upsampling in the state of art;
  • FIG. 2a to FIG. 2e are respectively structural schematic diagrams of an image upsampling system provided by an embodiment of the present disclosure;
  • FIG. 3 is a schematic diagram of upsampling by a muxer layer in an image upsampling system provided by an embodiment of the present disclosure;
  • FIG. 4 is a structural schematic diagram of a convolutional network in an image upsampling system provided by an embodiment of the present disclosure; and
  • FIG. 5 is a schematic flow chart of a training method of an image upsampling system provided by an embodiment of the present disclosure.
  • DETAILED DESCRIPTION
  • A convolutional neural network is a type of artificial neural network, and has become a research hotspot in the fields such as speech analysis and image recognition at present. The weight sharing network structure of a convolutional neural network renders the convolutional neural network more similar to a biological neural network, and it can reduce the complexity of a network model and reduces the number of weights. This advantage is more obvious when the input of the network is a multi-dimensional image, and the image can be directly used as the network input, which avoids the complicated process such as feature extraction and data reconstruction in a traditional identification algorithm. The convolutional neural network is a multi-layer sensor specially designed to identify a two-dimensional shape. This type of network structure can be applied to translation, scaling, tilting, or other forms of transformation.
  • Based on a convolutional neural network, an embodiment of the present disclosure provides an image upsampling system, a method for training the image upsampling system, and a method for performing upsampling on an input image according to the trained image upsampling system. The image upsampling system performs upsampling by using the convolutional neural network, and, under the premise of ensuring substantially no loss of image information, can effectively converts a low-resolution image into a high-resolution image.
  • Hereinafter, specific implementing modes of the image upsampling system, the training method thereof and the image upsampling method provided by the embodiments of the present disclosure will be described in detail in connection with the drawings.
  • An image upsampling system provided by an embodiment of the present disclosure, as illustrated in FIG. 2a to FIG. 2d , comprises: at least one first convolutional network (CN) and at least one muxer layer (ML) cascaded; an signal input end of the image upsampling system is connected with a signal input end of one first convolutional network, and a signal output end of the image upsampling system is connected with a signal output end of one muxer layer.
  • A signal input end of a muxer layer is connected with a signal output end of one first convolutional network or connected with a signal output end of another muxer layer.
  • The first convolutional network is configured for converting an image input to its signal input end into a plurality of feature images and outputting to the signal input end of the muxer layer. The muxer layer is configured for synthesizing every n×n feature images in the feature images that are input to its signal input end into a feature image whose resolution is n×n times the resolution of the input feature images, and outputting the synthesized feature image; and the number of feature images input to the muxer layer is a multiple of n×n, where n is an integer greater than one.
  • In the above-described image upsampling system provided by the embodiment of the present disclosure, image upsampling processing is performed with a muxer layer to synthesize every n×n feature images input to the muxer layer into one feature image with the resolution amplified by n×n times, and in the upsampling procedure with the muxer layer, the information of each input feature image is still recorded in the resultant feature image without loss; and thus, every time when the image passes through a muxer layer with an upsampling multiple (factor) of n, the image resolution can be increased by n×n times. In addition, in the image upsampling system, more than one muxer layers may be provided for successively upsampling, the muxer layers can respectively perform an upsampling function of an individual multiple, so that the system can flexibly adjust the upsampling multiple(s) according to needs and realize a universal image upsampling system with respect to different upsampling multiples. Further, because the number of feature images output by each muxer layer is reduced while the muxer layer amplifies the resolution of the feature image by n×n times, the signals amount input to the muxer layer or first convolutional network in a next cascaded stage can be reduced so as to alleviate the computation amount of upsampling.
  • It should be noted that, if the system comprises a plurality of muxer layers with an amplifying multiple of n times, then after the image is upsampled by the system, the image resolution can be improved by (n×n)×(n×n) times.
  • For example, if the system comprises two muxer layers with an upsampling multiple of 2× (two times), after the image passes through the two muxer layers, the resolution is improved by 4×4 times; and if the system comprises three muxer layers with an upsampling multiple of 2×, after the image passes through the three muxer layers, the resolution is improved by 4×4×4 times.
  • When specifically implemented, according to the required upsampling multiple, there may be provided a variety of specific implementing modes of the above-described image upsampling system provided by the embodiment of the present disclosure. For example, according to the required upsampling multiple, in the above-described image upsampling system provided by the embodiment of the present disclosure, one muxer layer may be provided as illustrated in FIG. 2a , or two muxer layers may be provided as illustrated in FIG. 2b and FIG. 2c , or three muxer layers may be provided as illustrated in FIG. 2 d.
  • Specifically, generally, when an upsampling process by an upsampling multiple such as 2×, 3× or 5× is needed, the above-described image upsampling system provided by an embodiment of the present disclosure, as illustrated in FIG. 2a , comprises one first convolutional network and one muxer layer which can upsample feature images by times of 2×, 3× or 5×. When an upsampling multiple of 4× is needed, the above-described image upsampling system provided by the embodiment of the present disclosure, as illustrated in FIG. 2b and FIG. 2c , may comprise two muxer layers which can upsample by 2× times. When an upsampling multiple of 8× is needed, the above-described image upsampling system provided by the embodiment of the present disclosure, as illustrated in FIG. 2d , may comprise three muxer layers which can upsample by 2× times. In this way, the greater the required multiple, the larger the number of muxer layers correspondingly, and the greater the data computation amount performed by the corresponding system. Thus, preferably, in the above-described image upsampling system provided by the embodiment of the present disclosure, two or three muxer layers are generally provided, to perform two or three processes of upsampling.
  • Further, when a plurality of muxer layers are provided in the above-described image upsampling system provided by the embodiment of the present disclosure, in order that when each muxer layer performs upsampling, high-quality feature images can be synthesized into a high-quality feature image whose resolution is n×n times the resolution of the input feature images, as illustrated in FIG. 2c and FIG. 2d , generally, the signal input end of the muxer layer is connected with the signal output end of the first convolutional network, and thus firstly, the feature images are obtained by the first convolutional network, then the obtained feature images are input to the signal input end of the corresponding muxer layer, that is, in the image upsampling system, the first convolutional network and the muxer layer are arranged in pair(s).
  • Further, in the case where a plurality of muxer layers are provided in the above-described image upsampling system provided by an embodiment of the present disclosure, the upsampling multiples of the muxer layers may be the same or different from each other. Usually, when there are provided a plurality of muxer layers, the upsampling multiple n of the respective muxer layer is generally elected to be the same. Moreover, the smaller the upsampling multiple of each muxer layer, the smaller the computation amount, and the better the upsampling effect. Thus, when the required upsampling multiple is larger, generally upsampling is performed many times, and the upsampling multiple n of each muxer layer is generally elected to be a prime number such as 2, 3, 5 or 7. Preferably, the upsampling multiple n of each muxer layer is set to be 2.
  • Further, the above-described image upsampling system provided by an embodiment of the present disclosure, as illustrated in FIG. 2e , further comprises: a second convolutional network, whose signal input end is connected with the signal output end of an muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system; the second convolutional network is configured for optimizing the picture quality of the feature image output by a muxer layer. Before the muxer layer of the last stage outputs a finally upsampled feature image, the second convolutional network can be configured for enhancing the picture quality of output picture according to actual needs, to improve quality of the output image.
  • Specifically, in the above-described image upsampling system provided by an embodiment of the present disclosure, both the first convolutional network and the second convolutional network may include at least one convolution layer composed of a plurality of filter units. The number of convolution layers included in the first convolutional network and the second convolutional network may be set according to needs. And the number of filter units included in each convolution layer may be the same or different. Generally, in order to facilitate the system to optimize its parameter(s), the number of convolution layers in each convolutional network is generally set to be no more than 10.
  • Hereinafter, the above-described image upsampling system provided by an embodiment of the present disclosure will be described below with reference to the structure shown in FIG. 2e , and with a case where two 2× muxer layers are configured for performing 4× upsampling as an example.
  • Specifically, the first convolutional network at the first stage is connected with the signal input end of the image upsampling system and is composed of four convolution layers, each convolution layer includes 128 filter units, and each filter unit is composed of 3×3 filters; the filter at position [1,1] is set as the center pixel. After the input image passes through the first convolution layer, 128 feature images are generated and output to a next convolution layer, until the last convolution layer outputs 128 feature images to the muxer layer of a next (second) stage.
  • After the muxer layer of the second stage receives the 128 feature images output by the first convolutional network of the first stage, it synthesizes every four input feature images into one feature image of the pixel resolution 4 times as larger as the input feature images (2× upsampling), i.e., after the 128 input feature images pass through the muxer layer of the second stage, 32 feature images are output to the first convolutional network of the next (third) stage.
  • The first convolutional network of the third stage is composed of four convolution layers, each convolution layer includes 32 filter units, and each filter unit is composed of 3×3 filters; the filter at position [1,1] is set as the center pixel. After the 32 feature images input from the muxer layer of the second stage pass through the first convolution layer, 32 feature images are generated and output to a next convolution layer, until the last convolution layer outputs 32 feature images to the muxer layer of the next (fourth) stage.
  • After the muxer layer of the fourth stage receives the 32 feature images output by the first convolutional network of the third stage, it synthesizes every 4 input feature images into one feature image of pixel resolution of 4 times as larger as the input feature images (2× upsampling), i.e., after the 32 input feature images pass through the fourth stage of muxer layer, 8 feature images are output to the second convolutional network of the next (fifth) stage.
  • The second convolutional network of the fifth stage is composed of four convolution layers, the first two convolution layers include 8 filter units, the third convolution layer includes 4 filter units, the fourth convolution layer includes 1 filter unit, and each filter unit is composed of 3×3 filters; the filter at position [1,1] is set as the center pixel. After the image of the input signal passes through the first convolution layer, 8 feature images are generated and output to the second convolution layer, after they pass through the second convolution layer, 8 feature images are generated and input to the third convolution layer, after they pass through the third convolution layer, 4 feature images are input to the fourth convolution layer, and finally, the fourth convolution layer outputs one feature image to the output end of the image upsampling system.
  • In the above-described procedure, each muxer layer is substantially equivalent to a self-adaptive interpolation filter, and as illustrated in FIG. 3, pixel values of every four feature images in the input feature image are combined in a staggered manner, to generate a feature image of a quadrupling pixel. As illustrated in FIG. 3, a working principle of the muxer layer is arranging pixel values of respective identical pixel-point positions in 4 input feature image in a matrix in an output feature image, so that in the upsampling procedure, no pixel information in the feature image will be modified (lost or added).
  • In the above-described procedure, either the first convolutional network or the second convolutional network can be a neural network structure which uses images as input and output, each neural network structure includes a plurality of convolution layers, and each convolution layer is composed of a plurality of filters. Hereinafter, the working principle will be briefly introduced by means of a neural network structure of two convolution layers in FIG. 4 as an example.
  • On the left side of FIG. 4, there are four input images, which generate three feature images after passing through respective filters of the first convolution layer, generate two feature images after passing through respective filters of the second convolution layer and then are output. Therein, each box marked with a scalar weight Wij k is equivalent to a filter (for example, a filter having 3×3 or 5×5 core), and a bias bj k represents picture increment added to convolution output; k represents the serial number of the convolution layer, and i and j represent an input serial image number and an output image serial number, respectively.
  • In the process of a system operation, the numerical values of the scalar weight Wij k and the bias bj k are relatively fixed, and before the system operation, it is necessary to train the system by using a series of standard input and output images, and depend on an application to adjust to adapt to some optimization criteria. Therefore, before operation of the above-described image upsampling system provided by an embodiment of the present disclosure, a series of training needs to be performed; and based on a same inventive concept, an embodiment of the present disclosure further provides a training method of the above-described image upsampling system, as illustrated in FIG. 5, comprising the following operations:
  • S501: initializing respective parameters in the image upsampling system; because a muxer layer does not introduce any parameter, the respective parameters in the image upsampling system are actually the parameters of all convolutional networks.
  • S502: using an original image signal as an output signal of the image upsampling system, using an image signal obtained by down-sampling the original image signal as an input signal of the image upsampling system, and adjusting respective parameters in the image upsampling system to allow the result from the down-sampled image signal subjected to upsampling processing with the adjusted respective parameters to be the same as the original image signal. Thereafter, the adjusted respective parameters are used as the upsampling parameter of the upsampling system, to up-sample the low-resolution image.
  • In step S501, with respect to the initializing respective parameters in the image upsampling system, a traditional initializing mode may be used, and weights Wij of respective filter units of the respective convolution layers of all the convolutional networks are set to be a small random numbers, and all the biases are initialized to be 0. The traditional initializing way will not invoke any problem when applied to small-multiple (such as 2×) upsampling, but may invoke some problems when applied to high-multiple (such as 4×) upsampling in connection with several convolutional networks, and thus, the above-described training method provided by an embodiment of the present disclosure further provides two new ways with respect to the initializing respective parameters in the image upsampling system, specifically as follows:
  • First way: initializing biases of respective filter units to be zero (0); and initializing weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
  • where m represents the number of feature images input to the filter unit.
  • Second way: initializing the biases of respective filter units to be 0; and initializing the weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
  • W ij = W ij + uniform ( - 1 , 1 ) m ; W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
  • where m represents the number of feature images input to the filter unit; and uniform (−1,1) represents a random number selected between (−1,1).
  • In the second initializing way, as compared with the first initializing way, a small uniformly distributed noise value is added to the weights Wij of respective filter units, which facilitates the image upsampling system to have an ability of identifying noise after training.
  • Based on the same inventive concept, an embodiment of the present disclosure further provides a method for performing image upsampling with the above-described image upsampling system; because the principle for the method to solve the problem is similar to that of an image upsampling system as described above, for implementation of the method, the implementation of the system may be referred to, and will not be repeated here.
  • An embodiment of the present disclosure provides a method for performing image upsampling with an image upsampling system, comprising: converting, by a first convolutional network, an input image input to the first convolutional network into a plurality of feature images having a specific feature and outputting the same; and synthesizing, by a muxer layer, every n×n feature images in the feature images input to the muxer layer into a feature image with the resolution amplified by n×n times as larger as the resolution of the input feature images, and outputting the same, wherein the number of feature images input to the muxer layer is a multiple of n×n, and n is an integer greater than 1.
  • Specifically, in the case where there are provided a plurality of muxer layers in the system, each muxer layer, after receiving the feature images, will perform upsampling processing on the feature images, and output the result to a next muxer layer, the next muxer layer performs upsampling processing on the received feature image(s), until the last muxer layer outputs a final up-sampled image.
  • The image upsampling system as described in the embodiment of the present disclosure may be implemented by a set of central processing units (CPU), and may also be implemented by a set of graphics processing unit (GPU), or may also be implemented by a field programmable gate array (FPGA).
  • Based on a same inventive concept, an embodiment of the present disclosure further provides a display device, comprising the above-described image upsampling system provided by the embodiment of the present disclosure; the display device can be: a mobile phone, a tablet computer, a television, a monitor, a laptop computer, a digital photo frame, a wearable device, a navigator and any products or parts with a display function. For implementation of the display device, the embodiment of the above-described image upsampling system may be referred to, and will not be repeated here.
  • In the image upsampling system, the training method thereof and the image upsampling method provided by some embodiments of the present disclosure, the feature images of an image are obtained by using the convolutional network, upsampling processing is performed on the images with the muxer layer to synthesize every n×n feature images in the input signal into a feature image with the resolution amplified by n×n times, in the upsampling procedure with the muxer layer, information of respective feature images in the input signal is recorded in the generated feature image(s) without loss; and thus, every time when the image passes through a muxer layer with an upsampling multiple of n, the image resolution can be increased by n×n times. In addition, in the image upsampling system, more than one muxer layer may be provided for successive processes of upsampling, each muxer layer can execute an upsampling function of a single multiple so that the system may flexibly adjust the upsampling multiple according to needs, and implement a universal upsampling system for different upsampling multiples. Further, because the number of feature images output by each muxer layer is reduced while the muxer layer amplifies the resolution of the feature images by n×n times, the signal amount input to the muxer layer or first convolutional network of a next cascaded stage can be reduced so as to simplify the computation amount of upsampling.
  • It is evident that one person skilled in the art can make various changes or modifications to the present invention without departure from the spirit and scope of the invention. Thus, if such changes and modifications to the present disclosure are within the scope of the claims of the present disclosure and equivalent thereof, the present disclosure also intends to include all such changes and modifications within its scope.
  • This application claims priority of Chinese Patent Application No. 201510595656.7 filed on Sep. 17, 2015, entitled “IMAGE UPSAMPLING SYSTEM, TRAINING METHOD THEREOF AND IMAGE UPSAMPLING METHOD”, the disclosure of which is incorporated herein by reference in its entirety as part of the present application.

Claims (20)

1: An image upsampling system, comprising:
at least one first convolutional network and at least one muxer layer that are cascaded;
wherein an signal input end of the image upsampling system is connected with a signal input end of a first convolutional network in the at least one first convolutional network, and a signal output end of the image upsampling system is connected with a signal output end of a last muxer layer in the at least one muxer layer;
a signal input end of every muxer layer in the at least one muxer layer is connected with a signal output end of a first convolutional network located in a stage prior to the muxer layer in the at least one first convolutional network, or connected with a signal output end of another muxer layer located in a stage prior to the muxer layer in the at least one muxer layer;
the first convolutional network is configured for converting an image input to its signal input end into a plurality of feature images and outputting the feature images to the signal input end of the muxer layer connected therewith;
the muxer layer is configured for synthesizing every n×n feature images in the feature images input to its signal input end into a feature image whose resolution is n×n times that of the input feature image and outputting the same; and a number of feature images input to the muxer layer is a multiple of n×n, n being an integer greater than one.
2: The image upsampling system according to claim 1, wherein a number of muxer layers is two or three.
3: The image upsampling system according to claim 1, wherein, a signal input end of each muxer layer is respectively connected with a signal output end of one corresponding first convolutional network in the at least one first convolutional network.
4: The image upsampling system according to claim 1, wherein, in the case where there are provided a plurality of muxer layers, the muxer layers have a same upsampling multiple.
5: The image upsampling system according to claim 1, wherein, the muxer layer has an upsampling multiple which is a prime number.
6: The image upsampling system according to claim 5, wherein, the muxer layer has an upsampling multiple which is 2.
7: The image upsampling system according to claim 1, wherein, the muxer layer is a self-adaptive interpolation filter.
8: The image upsampling system according to claim 1, further comprising:
a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system;
wherein the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
9: The image upsampling system according to claim 8, wherein, the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
10: A display device, comprising the image upsampling system according to claim 1.
11: A training method of the image upsampling system according to claim 1, comprising:
initializing respective parameters in the image upsampling system;
by using an original image signal as an output signal of the image upsampling system and using an image signal obtained by down-sampling the original image signal as an input signal of the image upsampling system, adjusting the respective parameters in the image upsampling system to allow the down-sampled image signal subjected to upsampling processing with the adjusted respective parameters to be the same as the original image signal.
12: The training method according to claim 11, wherein, initializing of the respective parameters in the image upsampling system includes:
initializing weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
where m represents the number of feature images input to the filter unit; and
initializing the biases of respective filter units to 0.
13: The training method according to claim 11, wherein, initializing of respective parameters in the image upsampling system includes:
initializing the weights Wij of respective filter units of respective convolution layers of the first convolutional network and the second convolutional network in the image upsampling system according to a formula below:
W ij = W ij + uniform ( - 1 , 1 ) m ; W ij = { 1 / ( m ) ( i , j ) are preset anchor pixel 0 other pixel }
where m represents the number of feature images input to the filter unit, and uniform (−1,1) represents a random number selected between (−1,1); and
initializing the biases of respective filter units to 0.
14: A method for performing image upsampling with the image upsampling system according to claim 1, comprising:
converting, by a first convolutional network, an input image input to the first convolutional network into a plurality of feature images and outputting the feature images;
synthesizing, by a muxer layer, every n×n feature images in the feature images input to the muxer layer into a feature image with a resolution amplified by n×n times as larger as the input feature images, and outputting the same; the number of feature images input to the muxer layer being a multiple of n×n, and n being an integer greater than 1.
15: The image upsampling system according to claim 2, further comprising:
a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system;
wherein the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
16: The image upsampling system according to claim 15, wherein, the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
17: The image upsampling system according to claim 3, further comprising:
a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system;
wherein the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
18: The image upsampling system according to claim 17, wherein, the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
19: The image upsampling system according to claim 4, further comprising:
a second convolutional network, whose signal input end is connected with a signal output end of the last muxer layer in the at least one muxer layer, and whose signal output end is connected with the signal output end of the image upsampling system;
wherein the second convolutional network is configured for optimizing picture quality of the feature images output by the muxer layer.
20: The image upsampling system according to claim 19, wherein, the first convolutional network and the second convolutional network include at least one convolution layer composed of a plurality of filter units.
US15/518,404 2015-09-17 2016-03-02 Image upsampling system, training method thereof and image upsampling method Active US10019642B1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201510595656 2015-09-17
CN201510595656.7A CN105120130B (en) 2015-09-17 2015-09-17 A kind of image raising frequency system, its training method and image raising frequency method
CN201510595656.7 2015-09-17
PCT/CN2016/075338 WO2017045374A1 (en) 2015-09-17 2016-03-02 Image up-conversion system, training method therefor and image up-conversion method

Publications (2)

Publication Number Publication Date
US10019642B1 US10019642B1 (en) 2018-07-10
US20180197037A1 true US20180197037A1 (en) 2018-07-12

Family

ID=54668007

Family Applications (1)

Application Number Title Priority Date Filing Date
US15/518,404 Active US10019642B1 (en) 2015-09-17 2016-03-02 Image upsampling system, training method thereof and image upsampling method

Country Status (3)

Country Link
US (1) US10019642B1 (en)
CN (1) CN105120130B (en)
WO (1) WO2017045374A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020062809A1 (en) * 2018-09-26 2020-04-02 Boe Technology Group Co., Ltd. Computer-implemented method, apparatus, and computer-program product
US11049217B2 (en) 2019-04-30 2021-06-29 Beijing Sensetime Technology Development Co., Ltd. Magnifying feature map
US11216910B2 (en) 2017-05-08 2022-01-04 Boe Technology Group Co., Ltd. Image processing system, image processing method and display device
US11321593B2 (en) 2017-02-23 2022-05-03 Beijing Sensetime Technology Development Co., Ltd Method and apparatus for detecting object, method and apparatus for training neural network, and electronic device
US11328184B2 (en) 2017-11-09 2022-05-10 Boe Technology Group Co., Ltd. Image classification and conversion method and device, image processor and training method therefor, and medium

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE202016009089U1 (en) 2015-02-19 2022-02-10 Magic Pony Technology Limited Online training of hierarchical algorithms
EP3278559B1 (en) 2015-03-31 2021-05-05 Magic Pony Technology Limited Training end-to-end video processes
GB201604345D0 (en) * 2016-03-14 2016-04-27 Magic Pony Technology Ltd Super resolution using fidelity transfer
GB201604672D0 (en) * 2016-03-18 2016-05-04 Magic Pony Technology Ltd Generative methods of super resolution
CN105120130B (en) 2015-09-17 2018-06-29 京东方科技集团股份有限公司 A kind of image raising frequency system, its training method and image raising frequency method
WO2017139927A1 (en) 2016-02-17 2017-08-24 Intel Corporation Region proposal for image regions that include objects of interest using feature maps from multiple layers of a convolutional neural network model
CN105611303B (en) * 2016-03-07 2019-04-09 京东方科技集团股份有限公司 Image compression system, decompression systems, training method and device, display device
CN105847968B (en) * 2016-03-21 2018-12-21 京东方科技集团股份有限公司 Based on the solution of deep learning as method and system
CN107231566B (en) * 2016-03-25 2020-12-18 阿里巴巴集团控股有限公司 Video transcoding method, device and system
CN109564636B (en) * 2016-05-31 2023-05-02 微软技术许可有限责任公司 Training one neural network using another neural network
RU2698997C1 (en) * 2016-09-06 2019-09-02 Электа, Инк. Neural network for generating synthetic medical images
CN108074215B (en) * 2016-11-09 2020-04-14 京东方科技集团股份有限公司 Image frequency-raising system, training method thereof, and image frequency-raising method
US10410322B2 (en) 2017-04-05 2019-09-10 Here Global B.V. Deep convolutional image up-sampling
US10489887B2 (en) * 2017-04-10 2019-11-26 Samsung Electronics Co., Ltd. System and method for deep learning image super resolution
CN107124609A (en) * 2017-04-27 2017-09-01 京东方科技集团股份有限公司 A kind of processing system of video image, its processing method and display device
JP7146372B2 (en) * 2017-06-21 2022-10-04 キヤノン株式会社 Image processing device, imaging device, image processing method, program, and storage medium
CN107610140A (en) * 2017-08-07 2018-01-19 中国科学院自动化研究所 Near edge detection method, device based on depth integration corrective networks
US10979718B2 (en) 2017-09-01 2021-04-13 Apple Inc. Machine learning video processing systems and methods
CN107729885B (en) * 2017-11-23 2020-12-29 中电科新型智慧城市研究院有限公司 Face enhancement method based on multiple residual error learning
CN109191382B (en) * 2018-10-18 2023-12-05 京东方科技集团股份有限公司 Image processing method, device, electronic equipment and computer readable storage medium
CN109903219B (en) * 2019-02-28 2023-06-30 深圳市商汤科技有限公司 Image processing method and device, electronic equipment and computer readable storage medium
CN110426560B (en) * 2019-06-28 2020-08-25 北京大学 Method for generating space-time upsampler of pulse array signal
CN110717405B (en) * 2019-09-17 2023-11-24 平安科技(深圳)有限公司 Face feature point positioning method, device, medium and electronic equipment
CN112788338B (en) * 2020-12-31 2022-08-26 展讯通信(天津)有限公司 Image compression and decompression method, equipment, device and storage medium

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4416288A (en) * 1980-08-14 1983-11-22 The Regents Of The University Of California Apparatus and method for reconstructing subsurface electrophysiological patterns
US5138695A (en) * 1989-10-10 1992-08-11 Hnc, Inc. Systolic array image processing system
US7028271B2 (en) * 2002-11-06 2006-04-11 Canon Kabushiki Kaisha Hierarchical processing apparatus
US7187811B2 (en) * 2003-03-18 2007-03-06 Advanced & Wise Technology Corp. Method for image resolution enhancement
JP4546157B2 (en) * 2004-06-03 2010-09-15 キヤノン株式会社 Information processing method, information processing apparatus, and imaging apparatus
CN101639937B (en) * 2009-09-03 2011-12-14 复旦大学 Super-resolution method based on artificial neural network
US9202144B2 (en) * 2013-10-30 2015-12-01 Nec Laboratories America, Inc. Regionlets with shift invariant neural patterns for object detection
CN104573731B (en) * 2015-02-06 2018-03-23 厦门大学 Fast target detection method based on convolutional neural networks
CN204948182U (en) * 2015-09-17 2016-01-06 京东方科技集团股份有限公司 A kind of image raising frequency system and display unit
CN105120130B (en) 2015-09-17 2018-06-29 京东方科技集团股份有限公司 A kind of image raising frequency system, its training method and image raising frequency method
US10726328B2 (en) * 2015-10-09 2020-07-28 Altera Corporation Method and apparatus for designing and implementing a convolution neural net accelerator
US11074492B2 (en) * 2015-10-07 2021-07-27 Altera Corporation Method and apparatus for performing different types of convolution operations with the same processing elements
US10614354B2 (en) * 2015-10-07 2020-04-07 Altera Corporation Method and apparatus for implementing layers on a convolutional neural network accelerator
US10997496B2 (en) * 2016-08-11 2021-05-04 Nvidia Corporation Sparse convolutional neural network accelerator
WO2018067603A1 (en) * 2016-10-04 2018-04-12 Magic Leap, Inc. Efficient data layouts for convolutional neural networks

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11321593B2 (en) 2017-02-23 2022-05-03 Beijing Sensetime Technology Development Co., Ltd Method and apparatus for detecting object, method and apparatus for training neural network, and electronic device
US11216910B2 (en) 2017-05-08 2022-01-04 Boe Technology Group Co., Ltd. Image processing system, image processing method and display device
US11328184B2 (en) 2017-11-09 2022-05-10 Boe Technology Group Co., Ltd. Image classification and conversion method and device, image processor and training method therefor, and medium
WO2020062809A1 (en) * 2018-09-26 2020-04-02 Boe Technology Group Co., Ltd. Computer-implemented method, apparatus, and computer-program product
CN110956575A (en) * 2018-09-26 2020-04-03 京东方科技集团股份有限公司 Method and device for converting image style and convolution neural network processor
US11842267B2 (en) 2018-09-26 2023-12-12 Boe Technology Group Co., Ltd. Computer-implemented method using a neural network comprising down-sampling segment, densely connected segment, and up-sampling segment, apparatus, and computer-program product
US11049217B2 (en) 2019-04-30 2021-06-29 Beijing Sensetime Technology Development Co., Ltd. Magnifying feature map

Also Published As

Publication number Publication date
CN105120130B (en) 2018-06-29
US10019642B1 (en) 2018-07-10
CN105120130A (en) 2015-12-02
WO2017045374A1 (en) 2017-03-23

Similar Documents

Publication Publication Date Title
US10019642B1 (en) Image upsampling system, training method thereof and image upsampling method
US10515435B2 (en) Apparatus for upscaling an image, method for training the same, and method for upscaling an image
US10311547B2 (en) Image upscaling system, training method thereof, and image upscaling method
US10043113B1 (en) Method and device for generating feature maps by using feature upsampling networks
CN111369440B (en) Model training and image super-resolution processing method, device, terminal and storage medium
US11216910B2 (en) Image processing system, image processing method and display device
CN204948182U (en) A kind of image raising frequency system and display unit
CN105976318A (en) Image super-resolution reconstruction method
KR20210045828A (en) Image processing apparatus and method
CN111986092B (en) Dual-network-based image super-resolution reconstruction method and system
CN112801904B (en) Hybrid degraded image enhancement method based on convolutional neural network
KR20190059157A (en) Method and Apparatus for Improving Image Quality
CN109949223A (en) Image super-resolution reconstructing method based on the dense connection of deconvolution
CN107203969A (en) A kind of high magnification image super-resolution rebuilding method of medium scale constraint
JP2022532669A (en) Methods and equipment for identifying videos
KR102092205B1 (en) Image processing method and apparatus for generating super resolution, inverse tone mapping and joint super resolution-inverse tone mapping processed multiple output image
KR101769042B1 (en) Heteroptics analysis device, heteroptically-altered image generating device for generating heteroptics to create perception approximating original image, heteroptics analysis method, method for generating heteroptically-altered image to create perception approximating original image, and program
EP2153404B1 (en) Method and device for selecting transform matrices for down-sampling dct image using learning-with-forgetting algorithm
CN111223046B (en) Image super-resolution reconstruction method and device
EP2153403B1 (en) Method and device for down-sampling a dct image in the dct domain
CN114638761B (en) Full-color sharpening method, equipment and medium for hyperspectral image
CN112419146B (en) Image processing method and device and terminal equipment
CN116266336A (en) Video super-resolution reconstruction method, device, computing equipment and storage medium
CN111461987B (en) Network construction method, image super-resolution reconstruction method and system
JP2020064557A (en) Image processing device, image processing system, imaging device, image processing method, and program

Legal Events

Date Code Title Description
STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4