SG11201903269QA - Method and apparatus for the access to bioinformatics data structured in access units - Google Patents

Method and apparatus for the access to bioinformatics data structured in access units

Info

Publication number
SG11201903269QA
SG11201903269QA SG11201903269QA SG11201903269QA SG11201903269QA SG 11201903269Q A SG11201903269Q A SG 11201903269QA SG 11201903269Q A SG11201903269Q A SG 11201903269QA SG 11201903269Q A SG11201903269Q A SG 11201903269QA SG 11201903269Q A SG11201903269Q A SG 11201903269QA
Authority
SG
Singapore
Prior art keywords
pct
access
international
october
data
Prior art date
Application number
SG11201903269QA
Inventor
Claudio Alberti
Giorgio Zoia
Daniele Renzi
Mohamed Baluch
Original Assignee
Genomsys Sa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/EP2016/074307 external-priority patent/WO2018068829A1/en
Priority claimed from PCT/EP2016/074311 external-priority patent/WO2018068830A1/en
Priority claimed from PCT/EP2016/074301 external-priority patent/WO2018068828A1/en
Priority claimed from PCT/EP2016/074297 external-priority patent/WO2018068827A1/en
Application filed by Genomsys Sa filed Critical Genomsys Sa
Priority claimed from PCT/US2017/041579 external-priority patent/WO2018071078A1/en
Publication of SG11201903269QA publication Critical patent/SG11201903269QA/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/50Compression of genetic data
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3086Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing a sliding window, e.g. LZ77
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/30Data warehousing; Computing architectures
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/10Protecting distributed programs or content, e.g. vending or licensing of copyrighted material ; Digital rights management [DRM]
    • G06F21/12Protecting executable software
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/10Signal processing, e.g. from mass spectrometry [MS] or from PCR
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/20Heterogeneous data integration
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3088Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing the use of a dictionary, e.g. LZ78
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3091Data deduplication
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3091Data deduplication
    • H03M7/3095Data deduplication using variable length segments
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6005Decoder aspects
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/70Type of the data to be coded, other than image and sound

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Bioethics (AREA)
  • Chemical & Material Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Genetics & Genomics (AREA)
  • Data Mining & Analysis (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Technology Law (AREA)
  • Multimedia (AREA)
  • Zoology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Evolutionary Computation (AREA)
  • Immunology (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Epidemiology (AREA)
  • Microbiology (AREA)
  • Public Health (AREA)

Abstract

PUBLISHEDUNDER THE PATENT COOPERATION TREATY (PCT) () INTERNATIONAL APPLICATION (19) World Intellectual Property Organization International Bureau (43) International Publication Date Iiiimmomiolollmonoloolomoionitiomovoimill (10) International Publication Number WO 2018/071078 Al 0 CC mode %anomie Region Aft Cies 1 M F.11 ,M AURJA 1 t•Uk). AUS.„1 19 April 2018 (19.04.2018) WIPO I PCT (51) International Patent Classification: 11 October 2016 (11.10.2016) EP GO6F 19/18 (2011.01) GO6F 21/12 (2013.01) PCT/EP2016/74301 GOOF 19/20 (2011.01) HO3M 7/30 (2006.01) 11 October 2016 (11.10.2016) EP GOOF 19/22 (2011.01) PCT/EP2017/017842 (21) International Application Number: 14 February 2017 (14.02.2017) PCT/EP2017/017841 US PCT/US2017/041579 14 February 2017 (14.02.2017) US (22) International Filing Date: (71) Applicant: GENOMSYS SA; Chemin De La Raye 13, 11 July 2017 (11.07.2017) 1024 Ecublens Vd (CH). (25) Filing Language: English (72) Inventor; and (26) Publication Language: English (71) Applicant: BALUCH, Mohamed, Khoso [US/US]; 4439 Woodsedge Ct, Chantilly, VA 20151 (US). (30) Priority Data: PCT/EP2016/074307 (72) Inventors: ALBERTI, Claudio; Chemin Des Esserts 1, 11 October 2016 (11.10.2016) EP PCT/EP2016/074297 11 October 2016 (11.10.2016) EP PCT/EP2016/074311 1213 Petit-lancy (CH). ZOIA, Giorgio; Chemin Des Croix- rouges 10, 1007 Lausanne (CH). RENZI, Daniele; Route Aloys-fauquez 105, 1018 Lausanne (CH). (54) Title: METHOD AND APPARATUS FOR THE ACCESS TO BIOINFORMATICS DATA STRUCTURED IN ACCESS UNITS GRC mode AUO_CI AtiiO Region 1 Figure 17 (57) : Method and apparatus for the coding and selective access of compressed genomic sequence data produced by genomic sequencing machines. The coding process is based on aligning sequence reads with respect to pre-existing or constructed reference sequences, on classifying and coding the sequence reads by means of sets of descriptors, and further partitioning the descriptor sets into access units of different types. Efficient selective access to specific genomic regions with the guarantee of retrieving all sequence reads mapped to those regions, is provided by: signaling the type of data mapping configuration used to store or transmit the descriptor sets, determining the minimum number of access units that need to be retrieved and decoded to access a genomic region, providing a master index table that contain all information for optimizing the data access process. [Continued on next page] GC N O N O GC O O WO 2018/071078 Al MIDEDIMOMOIDEIREEMONOMOIONMEMOVOIS (74) Agent: BILICKI, Byron et al.; The Bilicki Law Finn, P.C., 1285 North Main Street, Jamestown, NY 14701 (US). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Declarations under Rule 4.17: — as to the identity of the inventor (Rule 4.17(0) Published: — with international search report (Art. 21(3))
SG11201903269QA 2016-10-11 2017-07-11 Method and apparatus for the access to bioinformatics data structured in access units SG11201903269QA (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
PCT/EP2016/074307 WO2018068829A1 (en) 2016-10-11 2016-10-11 Method and apparatus for compact representation of bioinformatics data
PCT/EP2016/074311 WO2018068830A1 (en) 2016-10-11 2016-10-11 Method and system for the transmission of bioinformatics data
PCT/EP2016/074301 WO2018068828A1 (en) 2016-10-11 2016-10-11 Method and system for storing and accessing bioinformatics data
PCT/EP2016/074297 WO2018068827A1 (en) 2016-10-11 2016-10-11 Efficient data structures for bioinformatics information representation
EP2017017841 2017-02-14
EP2017017842 2017-02-14
PCT/US2017/041579 WO2018071078A1 (en) 2016-10-11 2017-07-11 Method and apparatus for the access to bioinformatics data structured in access units

Publications (1)

Publication Number Publication Date
SG11201903269QA true SG11201903269QA (en) 2019-05-30

Family

ID=86772539

Family Applications (2)

Application Number Title Priority Date Filing Date
SG11201903269QA SG11201903269QA (en) 2016-10-11 2017-07-11 Method and apparatus for the access to bioinformatics data structured in access units
SG11201907415SA SG11201907415SA (en) 2016-10-11 2017-12-14 Method and systems for the reconstruction of genomic reference sequences from compressed genomic sequence reads

Family Applications After (1)

Application Number Title Priority Date Filing Date
SG11201907415SA SG11201907415SA (en) 2016-10-11 2017-12-14 Method and systems for the reconstruction of genomic reference sequences from compressed genomic sequence reads

Country Status (12)

Country Link
US (1) US11763918B2 (en)
EP (1) EP3583249B1 (en)
KR (2) KR102421458B1 (en)
AU (2) AU2017342754A1 (en)
CA (2) CA3040057A1 (en)
CL (1) CL2019000971A1 (en)
CO (1) CO2019009919A2 (en)
IL (3) IL268649B1 (en)
MX (3) MX2019004129A (en)
PH (1) PH12019501880A1 (en)
SG (2) SG11201903269QA (en)
WO (1) WO2018151786A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200058379A1 (en) * 2018-08-20 2020-02-20 The Board Of Trustees Of The Leland Stanford Junior University Systems and Methods for Compressing Genetic Sequencing Data and Uses Thereof
US20200104463A1 (en) * 2018-09-28 2020-04-02 Chris Glode Genomic network service user interface
CN111326216B (en) * 2020-02-27 2023-07-21 中国科学院计算技术研究所 Rapid partitioning method for big data gene sequencing file
KR102497634B1 (en) * 2020-12-21 2023-02-08 부산대학교 산학협력단 Method and apparatus for compressing fastq data through character frequency-based sequence reordering

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6303297B1 (en) 1992-07-17 2001-10-16 Incyte Pharmaceuticals, Inc. Database for storage and analysis of full-length sequences
JP3429674B2 (en) 1998-04-28 2003-07-22 沖電気工業株式会社 Multiplex communication system
AU2001255344A1 (en) 2000-04-12 2001-11-12 King Faisal Specialist Hospital And Research Centre System for identifying and analyzing expression of are-containing genes
US20040153255A1 (en) 2003-02-03 2004-08-05 Ahn Tae-Jin Apparatus and method for encoding DNA sequence, and computer readable medium
US7805282B2 (en) 2004-03-30 2010-09-28 New York University Process, software arrangement and computer-accessible medium for obtaining information associated with a haplotype
US8340914B2 (en) 2004-11-08 2012-12-25 Gatewood Joe M Methods and systems for compressing and comparing genomic data
US7424371B2 (en) 2004-12-21 2008-09-09 Helicos Biosciences Corporation Nucleic acid analysis
WO2007132461A2 (en) 2006-05-11 2007-11-22 Ramot At Tel Aviv University Ltd. Classification of protein sequences and uses of classified proteins
US8116988B2 (en) 2006-05-19 2012-02-14 The University Of Chicago Method for indexing nucleic acid sequences for computer based searching
SE531398C2 (en) 2007-02-16 2009-03-24 Scalado Ab Generating a data stream and identifying positions within a data stream
AU2011258875B2 (en) 2010-05-25 2016-05-05 The Regents Of The University Of California Bambam: parallel comparative analysis of high-throughput sequencing data
KR101922129B1 (en) 2011-12-05 2018-11-26 삼성전자주식회사 Method and apparatus for compressing and decompressing genetic information using next generation sequencing(NGS)
US10333547B2 (en) 2012-08-13 2019-06-25 Gurologic Microsystems Oy Encoder and method for encoding input data using a plurality of different transformations or combinations of transformations
US9679104B2 (en) 2013-01-17 2017-06-13 Edico Genome, Corp. Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform
WO2014145503A2 (en) 2013-03-15 2014-09-18 Lieber Institute For Brain Development Sequence alignment using divide and conquer maximum oligonucleotide mapping (dcmom), apparatus, system and method related thereto
JP6054790B2 (en) 2013-03-28 2016-12-27 三菱スペース・ソフトウエア株式会社 Gene information storage device, gene information search device, gene information storage program, gene information search program, gene information storage method, gene information search method, and gene information search system
WO2014186604A1 (en) 2013-05-15 2014-11-20 Edico Genome Corp. Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform
KR101522087B1 (en) 2013-06-19 2015-05-28 삼성에스디에스 주식회사 System and method for aligning genome sequnce considering mismatch
US20150032711A1 (en) 2013-07-06 2015-01-29 Victor Kunin Methods for identification of organisms, assigning reads to organisms, and identification of genes in metagenomic sequences
US10902937B2 (en) 2014-02-12 2021-01-26 International Business Machines Corporation Lossless compression of DNA sequences
GB2527588B (en) 2014-06-27 2016-05-18 Gurulogic Microsystems Oy Encoder and decoder
GB2528460B (en) 2014-07-21 2018-05-30 Gurulogic Microsystems Oy Encoder, decoder and method
US10230390B2 (en) 2014-08-29 2019-03-12 Bonnie Berger Leighton Compressively-accelerated read mapping framework for next-generation sequencing

Also Published As

Publication number Publication date
EP3583249A4 (en) 2020-12-02
CO2019009919A2 (en) 2019-12-10
US20200043570A1 (en) 2020-02-06
WO2018151786A1 (en) 2018-08-23
PH12019501880A1 (en) 2020-06-29
EP3583249A1 (en) 2019-12-25
AU2017398951A1 (en) 2019-10-10
MX2019004131A (en) 2020-01-30
MX2019004129A (en) 2019-08-29
CA3040057A1 (en) 2018-04-19
IL268649B1 (en) 2024-03-01
KR20190062551A (en) 2019-06-05
KR20190113969A (en) 2019-10-08
SG11201907415SA (en) 2019-09-27
IL268650B1 (en) 2024-03-01
CA3052773A1 (en) 2018-08-23
US11763918B2 (en) 2023-09-19
AU2017342754A1 (en) 2019-05-02
KR102421458B1 (en) 2022-07-14
EP3583249B1 (en) 2023-07-19
CL2019000971A1 (en) 2019-08-23
IL268650A (en) 2019-10-31
MX2019009682A (en) 2019-10-09
IL268649A (en) 2019-10-31
IL265956A (en) 2019-05-30

Similar Documents

Publication Publication Date Title
SG11201903271UA (en) Method and systems for the indexing of bioinformatics data
SG11201903269QA (en) Method and apparatus for the access to bioinformatics data structured in access units
SG11201807334SA (en) Methods, compositions, and devices for information storage
SG11201906395PA (en) Blockchain based data processing method and device
SG11201804190YA (en) Method and system for blockchain variant using digital signatures
SG11201901550WA (en) Method and apparatus for data processing
SG11201909950QA (en) Identifying entities in electronic medical records
SG11201805520UA (en) User interface
SG11201904942YA (en) Blockchain-based service execution method and apparatus, and electronic device
SG11201907476XA (en) Advanced signalling of regions of interest in omnidirectional visual media
SG11201906875RA (en) Ultra-reliable low-latency communication indication channelization designs
SG11201903310UA (en) Service control and user identity authentication based on virtual reality
SG11201902667UA (en) Methods and systems for chromatography data analysis
SG11201805562QA (en) Genomic infrastructure for on-site or cloud-based dna and rna processing and analysis
SG11201806853VA (en) Abstracted graphs from social relationship graph
SG11201407486PA (en) Compositions and methods for modulating utrn expression
SG11201803928UA (en) Processing data using dynamic partitioning
SG11201908556UA (en) Methods and devices for providing transaction data to blockchain system for processing
SG11201900331VA (en) Protected indexing and querying of large sets of textual data
SG11201804712PA (en) Biofuel
SG11201806825RA (en) A data source system agnostic fact category partitioned information repository and methods for the insertion and retrieval of data using the information repository
SG11201811425TA (en) Techniques for in-memory key range searches
SG11201909271XA (en) Energy management system
SG11201809758SA (en) Apparatus and method
SG11201903175VA (en) Efficient data structures for bioinformatics information representation