Sosse
Self-hosted server monitoring tool that provides selenium based search engine and crawler with offline archiving.
Overview
Selenium Open Source Search Engine & crawler Crawl, archive, and search web pages—including JavaScript-heavy sites—with Sosse. Open source, flexible, and Selenium-powered. The project has 402 GitHub stars and is licensed under AGPL-3.0.
Key Features
Source: GitHub README
- 🌍 Web Page Search: Search the content of web pages, including dynamically rendered ones, with advanced queries.
- 🕑 Recurring Crawling: Crawl pages at fixed intervals or adapt the rate based on content changes.
- 🔖 Web Page Archiving: Archive HTML content, adjust links for local use, download required assets, and support
- 🏷️ Tags: Organize and filter crawled or archived pages using tags for better search and management.
- 📂 File Downloads: Batch download binary files from web pages.
- 📡 Webhooks: Integrate with external services using highly flexible webhooks. Connect to proprietary AI platforms
- 🔔 Atom Feeds: Generate content feeds for websites that don’t have them, or receive updates when a new page
- 🔒 Authentication: The crawler can authenticate to access private pages and retrieve content.
- 👥 Permissions: Admins can configure crawlers and view statistics, while authenticated users can search or do so anonymously.
- 👤 Search Features: Includes private search history (doc),
Normalized Features
Source: tool-features-normalized.json
docker, postgresql, rss atom, tags, webhooks.
Deploy
Features
Integrations & APIs
- RSS / Atom Feeds
- Webhooks
Search & Discovery
- Tags / Labels
Related Databases & Data Tools Tools
View all 122 →Supabase
99KThe open-source Firebase alternative — Postgres database, Auth, instant APIs, Realtime subscriptions, Edge Functions, Storage, and Vector embeddings.
Prometheus
63KAn open-source monitoring system with a dimensional data model, flexible query language, efficient time series database and modern alerting approach.
NocoDB
62KTurn your existing database into a collaborative spreadsheet interface — without moving a single row of data.
Meilisearch
56KLightning-fast, typo-tolerant search engine with an intuitive API. Drop-in replacement for Algolia that you can self-host for free.
DBeaver
49KFree universal database management tool for developers, DBAs, and analysts. Supports 100+ databases including PostgreSQL, MySQL, SQLite, MongoDB, and more.
Milvus
43KMilvus is a high-performance open-source vector database built for AI applications, supporting billion-scale similarity search with sub-second latency.