unsubbed.co

Bacalhau

Bacalhau lets you run platform for fast entirely on your own server.

Overview

Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data. Globally Distributed Compute Orchestrator ⚡Compute Over Data (CoD) The project has 851 GitHub stars and is licensed under Apache-2.0.

Key Features

Source: GitHub README

  • Single Binary Simplicity: Bacalhau is a single self-contained binary that functions as a client, orchestrator, and compute node—making it incredibly easy to set up and scale
  • Modular Architecture: Support for multiple execution engines (Docker, WebAssembly) and storage providers through clean interfaces
  • Orchestrator-Compute Model: A dedicated orchestrator coordinates job scheduling, while compute nodes run tasks
  • Flexible Storage Integrations: Integrates with S3, HTTP/HTTPS, IPFS, and local storage systems
  • Multiple Job Types: Support for batch, ops, daemon, and service job types for different workflow requirements
  • Declarative & Imperative Submissions: Define jobs in YAML (declarative) or pass arguments via CLI (imperative)
  • Publisher Support: Output results to local volumes, S3, or other storage backends

Getting Started

Source: GitHub README

curl -sL https://get.bacalhau.org/install.sh | bash

Normalized Features

Source: tool-features-normalized.json

docker.