A SURVEY PAPER ON ELASTIC SEARCH SIMILARITY ALGORITHM
DOI:
https://doi.org/10.22159/ajpcr.2017.v10s1.19757Keywords:
Elasticsearch, Lucene, BigData, Ranking Algorithm, Indexing, Mapping, ScoringAbstract
Elasticsearch is a web search tool in view of Lucene. Apache Lucene is a free and open-source data retrieval programming library. Versatile Search gives a conveyed, multitenant-fit full-content web search tool with a HTTP web interface and pattern free JSON archives. It is created in Java and has been released as open source under the terms of the Apache License. Elasticsearch can be utilized to pursuit a wide range of records. It gives adaptable hunt, has close continuous pursuit, and backings multitenancy. It is appropriated, which implies that records can be partitioned into shards and every shard can have zero or more duplicates. Every hub has one or more shards, and goes about as a facilitator to delegate operations to the right shard(s). Elasticsearch is like a wrapper on top of Lucene. In this paper a detailed description of how lucene's scoring algorithm works and how elasticsearch uses it as similarity algorithmâ€
Downloads
References
Kononenko O. Mining Modern Repositories with Elasticsearch. Hyderabad, India: MSR; 2014.
Sematext. Elasticsearch Refresh Interval vs. Indexing Performance. Available from: http://www.bit.ly/1iZoPGc.
Divya MS, Goyal SK. Elastic search: An advanced and quick search technique to handle voluminous data. COMPUSOFT Int J Adv Comput Technol 2013;2(6):171-5.
Long B, Chapelle JB, Zhang Y. Ranking through expected loss optimization. IEEE Trans Knowl Data Eng 2015;27(5):267-74.
Lin J, Ryaboy D, Weil K. Full-text indexing for optimizing selection operations in large-scale data analytics. San Jose, California, New York, USA: ACM; 2011.Available from: http://www.acm.org/publications.
Unpluggable Similarity. Available from: https://www.elastic.co/guide/en/ elasticsearch/guide/current/pluggablesimilarites.htm.
Butler MH, Rutherford J. Distributed Lucene: A Distributed Free Text Index for Hadoop. ???: HP Laboratories, HPL; 2008.
Published
How to Cite
Issue
Section
The publication is licensed under CC By and is open access. Copyright is with author and allowed to retain publishing rights without restrictions.