search engines

searx

22 May 2020Last Commit6494 (983/yr)Github Stars504Issues

A privacy-respecting, hackable metasearch engine.

Pronunciation: səːks

List of running instances.

See the documentation and the wiki for more information.

Go to the searx-docker project.

For all of the details, follow this step by step installation.

Note: the documentation needs to be updated.

Bugs or suggestions? Visit the issue tracker.

meilisearch

23 May 2020Last Commit5054 (2424/yr)Github Stars87Issues

Lightning Fast, Ultra Relevant, and Typo-Tolerant Search Engine 🔍

MeiliSearch is a powerful, fast, open-source, easy to use and deploy search engine. Both searching and indexing are highly customizable. Features such as typo-tolerance, filters, and synonyms are provided out-of-the-box. For more information about features go to our documentation.

MeiliSearch helps the Rust community find crates on crates.meilisearch.com

If you have the Rust toolchain already installed on your local system, clone the repository and change it to your working directory.

yacy_search_server

05 May 2020Last Commit1829 (354/yr)Github Stars131Issues

The YaCy search engine software provides results from a network of independent peers, instead of a central server. It is a distributed network where no single entity decides what to list or order it appears in.

User privacy is central to YaCy, and it runs on each user's computer, where search terms are hashed before they being sent to the network. Everyone can create their individual search indexes and rankings, and a truly customized search portal.

Each YaCy user is either part of a large search network (search indexes can be exchanged with other installation over a built-in peer-to-peer network protocol) or the user runs YaCy to produce a personal search portal that is either public or private.

ambar

28 Apr 2020Last Commit1459 (418/yr)Github Stars3Issues

Ambar is an open-source document search engine with automated crawling, OCR, tagging and instant full-text search.

Ambar defines a new way to implement full-text document search into your workflow.

Tutorial: Mastering Ambar Search Queries

Ambar 2.0 only supports local fs crawling, if you need to crawl an SMB share of an FTP location - just mount it using standard linux tools. Crawling is automatic, no schedule is needed due to crawlers monitor file system events and automatically process new, changed and removed files.

open-source-search-engine

04 May 2020Last Commit1126 (165/yr)Github Stars67Issues

An open source web and enterprise search engine and spider/crawler. As can be seen on http://www.gigablast.com/ .

See html/faq.html for all administrative documentation including the quick start instructions.

Alternatively, visit http://www.gigablast.com/faq.html

See html/developer.html for all code documentation.

Alternatively, visit http://www.gigablast.com/developer.html

Contact me for feature requests or help in general. I will work for free for good use cases. mattdwells@hotmail.com.

sist2

17 May 2020Last Commit142 (217/yr)Github Stars5Issues

sist2 (Simple incremental search tool)

Warning: sist2 is in early development

* See format support
** See Archive files
*** See OCR

Have an Elasticsearch (>= 6.X.X) instance running

Download sist2 executable

See Usage guide

* Windows users: sist2 runs under WSL

See Usage guide for more details

* See Archive files

sist2 will scan files stored into archive files (zip, tar, 7z...) as if they were directly in the file system. Recursive (archives inside archives) scan is also supported.

Limitations:

To check if a media file can be parsed without seek, execute cat file.mp4 | ffprobe -