IN2013MU03472A - - Google Patents

Info

Publication number
IN2013MU03472A
IN2013MU03472A IN3472MU2013A IN2013MU03472A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A IN 3472MU2013 A IN3472MU2013 A IN 3472MU2013A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A
Authority
IN
India
Prior art keywords
file
indexing
segments
index
nodes
Prior art date
Application number
Inventor
Arun Vasu
Jishnu Kurunthala
Original Assignee
Tata Consultancy Services Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Ltd filed Critical Tata Consultancy Services Ltd
Priority to IN3472MU2013 priority Critical patent/IN2013MU03472A/en
Priority to US14/498,598 priority patent/US9846702B2/en
Publication of IN2013MU03472A publication Critical patent/IN2013MU03472A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

ABSTRACT INDEXING OF FILE IN A HADOOP CLUSTER A file indexing system (102) for indexing a file to be stored onto a distributed file system (104) includes a segmentation module (122) to segment the file into a plurality of segments. The file indexing system (102) further includes an index generation module (124) to initiate indexing of the file through a plurality of nodes of a Hadoop cluster, where each of the plurality of nodes indexes one or more segments from amongst the plurality of segments to generate at least one index corresponding to the one or more segments. The file indexing system (102) further includes an index transfer module (126) to store the at least one index onto the distributed file system (104). <To be published with Figure 1>
IN3472MU2013 2013-10-31 2013-10-31 IN2013MU03472A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31
US14/498,598 US9846702B2 (en) 2013-10-31 2014-09-26 Indexing of file in a hadoop cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31

Publications (1)

Publication Number Publication Date
IN2013MU03472A true IN2013MU03472A (en) 2015-07-24

Family

ID=52996626

Family Applications (1)

Application Number Title Priority Date Filing Date
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31

Country Status (2)

Country Link
US (1) US9846702B2 (en)
IN (1) IN2013MU03472A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294721A (en) * 2016-08-08 2017-01-04 无锡天脉聚源传媒科技有限公司 A kind of company-data statistics and deriving method and device

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104834730B (en) * 2015-05-15 2018-06-01 北京京东尚科信息技术有限公司 data analysis system and method
US9961068B2 (en) 2015-07-21 2018-05-01 Bank Of America Corporation Single sign-on for interconnected computer systems
CN105354251B (en) * 2015-10-19 2018-10-30 国家电网公司 Electric power cloud data management indexing means based on Hadoop in electric system
CN105868253A (en) * 2015-12-23 2016-08-17 乐视网信息技术(北京)股份有限公司 Data importing and query methods and apparatuses
CN105740727A (en) * 2016-02-02 2016-07-06 上海斐讯数据通信技术有限公司 Distributed storage method and system of private data
US20200126010A1 (en) * 2016-06-15 2020-04-23 Solix Technologies, Inc. Enterprise Business Record Management System
CN106294842A (en) * 2016-08-19 2017-01-04 浪潮(北京)电子信息产业有限公司 A kind of data interactive method, platform and distributed file system
CN106487582A (en) * 2016-09-21 2017-03-08 努比亚技术有限公司 A kind of method and apparatus of deployment search server
CN106776929A (en) * 2016-11-30 2017-05-31 北京锐安科技有限公司 A kind of method for information retrieval and device
CN106649800A (en) * 2016-12-29 2017-05-10 南威软件股份有限公司 Solr-based Chinese search method
CN106844700A (en) * 2017-02-03 2017-06-13 山东浪潮商用***有限公司 It is a kind of to ask tax system based on Sorl
CN107066595A (en) * 2017-04-19 2017-08-18 济南浪潮高新科技投资发展有限公司 A kind of many application searches method of servicing of big data and system
CN107273515A (en) * 2017-06-21 2017-10-20 国网内蒙古东部电力有限公司信息通信分公司 Power grid data asset resource retrieval and display based on polymorphic data indexing technology
US10936681B2 (en) * 2017-08-03 2021-03-02 International Business Machines Corporation Generalized search engine for abstract data types with skimming and approximate retrieval
US11194804B2 (en) 2017-12-05 2021-12-07 Walmart Apollo, Llc System and method for an index search engine
US11392544B2 (en) * 2018-02-06 2022-07-19 Samsung Electronics Co., Ltd. System and method for leveraging key-value storage to efficiently store data and metadata in a distributed file system
WO2020112993A1 (en) * 2018-11-28 2020-06-04 Jpmorgan Chase Bank, N.A. Systems and methods for data usage monitoring in multi-tenancy enabled hadoop clusters
US11294938B2 (en) 2019-01-03 2022-04-05 International Business Machines Corporation Generalized distributed framework for parallel search and retrieval of unstructured and structured patient data across zones with hierarchical ranking
CN109766360A (en) * 2019-01-09 2019-05-17 北京一览群智数据科技有限责任公司 A kind of list screening method and device
CN110297971B (en) * 2019-05-30 2022-09-20 百度在线网络技术(北京)有限公司 Personalized resource retrieval method, device, equipment and computer readable storage medium
US20220277054A1 (en) * 2021-02-26 2022-09-01 State Farm Mutual Automobile Insurance Company Data migration of search indexes across search-engine deployments

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8856289B2 (en) * 2006-12-29 2014-10-07 Prodea Systems, Inc. Subscription management of applications and services provided through user premises gateway devices
US8082258B2 (en) * 2009-02-10 2011-12-20 Microsoft Corporation Updating an inverted index in a real time fashion
US20110196854A1 (en) * 2010-02-05 2011-08-11 Sarkar Zainul A Providing a www access to a web page
US20120030018A1 (en) * 2010-07-28 2012-02-02 Aol Inc. Systems And Methods For Managing Electronic Content
US8650159B1 (en) * 2010-08-26 2014-02-11 Symantec Corporation Systems and methods for managing data in cloud storage using deduplication techniques
US9092151B1 (en) * 2010-09-17 2015-07-28 Permabit Technology Corporation Managing deduplication of stored data
CN108664555A (en) * 2011-06-14 2018-10-16 慧与发展有限责任合伙企业 Deduplication in distributed file system
US20150112996A1 (en) * 2013-10-23 2015-04-23 Microsoft Corporation Pervasive search architecture

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294721A (en) * 2016-08-08 2017-01-04 无锡天脉聚源传媒科技有限公司 A kind of company-data statistics and deriving method and device

Also Published As

Publication number Publication date
US9846702B2 (en) 2017-12-19
US20150120695A1 (en) 2015-04-30

Similar Documents

Publication Publication Date Title
IN2013MU03472A (en)
IN2015DN03160A (en)
IL252772A0 (en) Generating card stacks with queries on online social networks
MX2015008570A (en) Modifying structured search queries on online social networks.
PH12016500957A1 (en) Data management for connected devices
MX347812B (en) Using inverse operators for queries on online social networks.
WO2014179145A3 (en) Drive level encryption key management in a distributed storage system
MX353716B (en) Structured search queries based on social-graph information.
NZ754204A (en) Object tracking system optimization and tools
CL2015003348A1 (en) Hybrid power / fiber cable
JP2014096164A5 (en)
WO2014165439A3 (en) Automated storage and retrieval system and control system thereof
MX369047B (en) Systems and methods for mapping and routing based on clustering.
GB2525788A (en) Data synchronization
IN2012DE01073A (en)
GB2514275A (en) Identifying and ranking solutions from multiple data sources
ES2722408T3 (en) A wind power plant, and a method to increase the reactive power capacity of a wind power plant
MX361879B (en) Thematic repositories for transaction management.
WO2015167427A3 (en) Data distribution based on network information
GB2530454A (en) Optimization of instruction groups across group boundaries
MX356937B (en) Contact aggregation in a social network.
ES2596662A1 (en) Electrical distribution network (Machine-translation by Google Translate, not legally binding)
MX2015008571A (en) Ambiguous structured search queries on online social networks.
CA2912019C (en) Systems and methods for generating issue networks
MX346840B (en) Vertical-based query optionalizing.