Warning
This repo was archived because Protocol Labs no longer operates Hydra Boosters for the IPFS network.
For more information, see: https://discuss.ipfs.tech/t/dht-hydra-peers-dialling-down-non-bridging-functionality-on-2022-12-01/15567
A DHT Indexer node & Peer Router
A new type of DHT node designed to accelerate the Content Resolution & Content Providing on the IPFS Network. A (cute) Hydra with one belly full of records and many heads (Peer IDs) to tell other nodes about them, charged with rocket boosters to transport other nodes to their destination faster.
[openssl support (lower CPU usage)]
go get -tags=openssl github.com/libp2p/hydra-booster
[standard (sub-optimal)]
go get github.com/libp2p/hydra-booster
Run a hydra booster with a single head:
go run ./main.go
Pass the -nheads=N
option to run N heads at a time in the same process. It periodically prints out a status line with information about total peers, uptime, and memory usage.
go run ./main.go -nheads=5
Alternatively you can use the HYDRA_NHEADS
environment var to specify the number of heads.
There's also a nicer UI option, this is intended to be run in a tmux window or something so you can see statistics about your contribution to the network. Use the -ui-theme
flag to switch to it:
go run ./main.go -ui-theme=gooey # also "none" to turn off logging
Usage of hydra-booster:
-bootstrap-conc int
How many concurrent bootstraps to run (default 32)
-bootstrap-peers string
A CSV list of peer addresses to bootstrap from.
-bucket-size int
Specify the bucket size, note that for some protocols this must be a specific value i.e. for "/ipfs" it MUST be 20 (default 20)
-db string
Datastore directory (for LevelDB store) or postgresql:// connection URI (for PostgreSQL store)
-provider-store
A non-default provider store to use (currently only supports "dynamodb,table=<string>,ttl=<ttl-in-seconds>,queryLimit=<int>").
-disable-db-create
Don't create table and index in the target database (default false).
-disable-prefetch
Disables pre-fetching of discovered provider records (default false).
-disable-prov-counts
Disable counting provider records for metrics reporting (default false).
-disable-prov-gc
Disable provider record garbage collection (default false).
-disable-providers
Disable storing and retrieving provider records, note that for some protocols, like "/ipfs", it MUST be false (default false).
-disable-values
Disable storing and retrieving value records, note that for some protocols, like "/ipfs", it MUST be false (default false).
-enable-relay
Enable libp2p circuit relaying for this node (default false).
-httpapi-addr string
Specify an IP and port to run the HTTP API server on (default "127.0.0.1:7779")
-idgen-addr string
Address of an idgen HTTP API endpoint to use for generating private keys for heads
-mem
Use an in-memory database. This overrides the -db option
-metrics-addr string
Specify an IP and port to run Prometheus metrics and pprof HTTP server on (default "127.0.0.1:9758")
-name string
A name for the Hydra (for use in metrics)
-nheads int
Specify the number of Hydra heads to create. (default -1)
-port-begin int
If set, begin port allocation here (default -1)
-protocol-prefix string
Specify the DHT protocol prefix (default "/ipfs") (default "/ipfs")
-pstore string
Peerstore directory for LevelDB store (defaults to in-memory store)
-random-seed string
Seed to use to generate IDs (useful if you want to have persistent IDs). Should be Base64 encoded and 256bits
-id-offset
What offset in the sequence of keys generated from random-seed to start from
-stagger duration
Duration to stagger nodes starts by
-ui-theme string
UI theme, "logey", "gooey" or "none" (default "logey")
Alternatively, some flags can be set via environment variables. Note that flags take precedence over environment variables.
HYDRA_BOOTSTRAP_PEERS string
A CSV list of peer addresses to bootstrap from.
HYDRA_DB string
Datastore directory (for LevelDB store) or postgresql:// connection URI (for PostgreSQL store)
HYDRA_PSTORE string
Peerstore directory for LevelDB store (defaults to in-memory store)
HYDRA_PROVIDER_STORE string
A non-default provider store to use (currently only supports "dynamodb,table=<string>,ttl=<ttl-in-seconds>,queryLimit=<int>").
HYDRA_DISABLE_DBCREATE
Don't create table and index in the target database (default false).
HYDRA_DISABLE_PREFETCH
Disables pre-fetching of discovered provider records (default false).
HYDRA_DISABLE_PROV_COUNTS
Disable counting provider records for metrics reporting (default false).
HYDRA_DISABLE_PROV_GC
Disable provider record garbage collection (default false).
HYDRA_IDGEN_ADDR string
Address of an idgen HTTP API endpoint to use for generating private keys for heads
HYDRA_NAME string
A name for the Hydra (for use in metrics)
HYDRA_NHEADS int
Specify the number of Hydra heads to create. (default -1)
HYDRA_PORT_BEGIN int
If set, begin port allocation here (default -1)
HYDRA_RANDOM_SEED string
Seed to use to generate IDs (useful if you want to have persistent IDs). Should be Base64 encoded and 256bits
HYDRA_ID_OFFSET int
What offset in the sequence of keys generated from random-seed to start from
Only run a hydra-booster
on machines with public IP addresses. Having more DHT nodes behind NATs makes DHT queries in general slower, as connecting in generally takes longer and sometimes doesnt even work (resulting in a timeout).
When running with -nheads
, please make sure to bump the ulimit to something fairly high. Expect ~500 connections per node you're running (so with -nheads=10
, try setting ulimit -n 5000
)
The total number of heads a single Hydra can have depends on the resources of the machine it's running on. To get the desired number of heads you may need to run multiple Hydras on multiple machines. There's a couple of challenges with this:
- Peer IDs of Hydra heads are balanced in the DHT. When running multiple Hydras it's necessary to designate one of the Hydras to be the "idgen server" and the rest to be "idgen clients" so that all Peer IDs in the Hydra swarm are balanced. Use the
-idgen-addr
flag orHYDRA_IDGEN_ADDR
environment variable to ensure all Peer IDs in the Hydra swarm are balanced perfectly. - A datastore is shared by all Hydra heads but not by all Hydras. Use the
-db
flag orHYDRA_DB
environment variable to specify a PostgreSQL database connection string that can be shared by all Hydras in the swarm. - When sharing a datastore between multiple Hydras, ensure only one Hydra in the swarm is performing GC on provider records by using the
-disable-prov-gc
flag orHYDRA_DISABLE_PROV_GC
environment variable, and ensure only one Hydra is counting the provider records in the datastore by using the-disable-prov-counts
flag orHYDRA_DISABLE_PROV_COUNTS
environment variable.
If the "dynamodb" provider store is specified, then provider records will not be stored in the datastore, but in a DynamoDB table that must conform with the following schema:
key
- type: bytes
- primary key
ttl
- type: int
- sort key
The command line / environment variable requires various arguments for configuring the provider store:
- table
- string, required
- the DynamoDB table name
- ttl
- int, required
- the duration in seconds for the provider record TTL, after which DynamoDB will evict the entry
- queryLimit
- int, required
- limit for the # records to retrieve from DynamoDB for a single GET_PROVIDERS DHT query
A GET_PROVIDERS DHT query will result in >=1 DynamoDB queries. The provider store will follow the pagination until the query limit is reached, or no more records are available. DynamoDB will return up to 1 MB of records in a single query page. The providers are sorted by descending TTL, so the most-recently-added providers will be returned first. When the query limit is reached, the remaining providers are truncated.
The provider store uses the default AWS SDK credential store, which will search for credentials in environment variables, ~/.aws, the EC2 instance metadata service, ECS agent, etc.
Some notes and caveats:
- This does not use consistent reads, so read-after-write is eventually consistent. Consistency is usually achieved so quickly that it's unnoticeable.
- If the system receives two ADD_PROVIDER messages for the same multihash in the same millisecond, they will race and only one will win, since records are keyed on (multihash, ttl). This should be rare. The
prov_ddb_collisions
counter is incremented when this happens.
- Update version number in
version.go
. - Create a semver tag with "v" prefix e.g.
git tag v0.1.7
. - Publish a new image to docker hub
- Scale the hydras down and then back up to pick up the change
# Build your container
docker build -t hydra-booster .
# Get it to run
docker run hydra-booster
# Commit new version
docker commit -m="some commit message" <CONTAINER_ID> libp2p/hydra-booster
# Push to docker hub (must be logged in, do docker login)
docker push libp2p/hydra-booster
Install Prometheus and then start it using the provided config:
prometheus --config.file=promconfig.yaml --storage.tsdb.path=prometheus-data
Next start the Hydra Booster, specifying the IP and port to run metrics on:
go run ./main.go -nheads=5 -metrics-addr=127.0.0.1:9090
You should now be able to access metrics at http://127.0.0.1:9090.
By default the HTTP API is available at http://127.0.0.1:7779.
Returns an ndjson list of peers created by the Hydra: their IDs and mulitaddrs. Example output:
{"Addrs":["/ip4/127.0.0.1/tcp/50277","/ip4/192.168.0.3/tcp/50277"],"ID":"12D3KooWHacdCMnm4YKDJHn72HPTxc6LRGNzbrbyVEnuLFA3FXCZ"}
{"Addrs":["/ip4/127.0.0.1/tcp/50280","/ip4/192.168.0.3/tcp/50280","/ip4/90.198.150.147/tcp/50280"],"ID":"12D3KooWQnUpnw6xS2VrJw3WuCP8e92fsEDnh4tbqyrXW5AVJ7oe"}
...
Returns an ndjson list of provider records stored by the Hydra Booster node.
Fetches provider record(s) available on the network by CID. Use the nProviders
query string parameter to signal the number of provider records to find. Returns an ndjson list of provider peers: their IDs and mulitaddrs. Will return HTTP status code 404 if no records were found.
Generate and add a balanced Peer ID to the server's xor trie and return it for use by another Hydra Booster peer. Returns a base64 encoded JSON string. Example output:
"CAESQNcYNr0ENfml2IaiE97Kf3hGTqfB5k5W+C2/dW0o0sJ7b7zsvxWMedz64vKpS2USpXFBKKM9tWDmcc22n3FBnow="
Remove a balanced Peer ID from the server's xor trie. Accepts a base64 encoded JSON string.
Returns a list of ndjson peers with open connections optionally filtered by Hydra head. Example output:
{"ID":"12D3KooWKdEMLcKJWk8Swc3KbBJSjpJfNMKZUhcG8LnYPA3XH8Bh","Peer":{"ID":"QmbLHAnMoJPWSCR5Zhtx6BHJX9KiKNN6tpvbUcqanj75Nb","Addr":"/ip4/147.75.83.83/tcp/4001","Direction":2}}
{"ID":"12D3KooWKdEMLcKJWk8Swc3KbBJSjpJfNMKZUhcG8LnYPA3XH8Bh","Peer":{"ID":"QmQCU2EcMqAqQPR2i9bChDtGNJchTbq5TbXJJ16u19uLTa","Addr":"/ip6/2604:1380:0:c100::1/tcp/4001","Direction":2}}
{"ID":"12D3KooWKdEMLcKJWk8Swc3KbBJSjpJfNMKZUhcG8LnYPA3XH8Bh","Peer":{"ID":"QmNnooDu7bfjPFoTZYxMNLWUQJyrVwtbZg5gBMjTezGAJN","Addr":"/ip4/147.75.69.143/tcp/4001","Direction":2}}
{"ID":"12D3KooWKdEMLcKJWk8Swc3KbBJSjpJfNMKZUhcG8LnYPA3XH8Bh","Peer":{"ID":"QmaCpDMGvV2BGHeYERUEnRQAwe3N8SzbUtfsmvsqQLuvuJ","Addr":"/ip4/104.131.131.82/tcp/4001","Direction":2}}
{"ID":"12D3KooWKdEMLcKJWk8Swc3KbBJSjpJfNMKZUhcG8LnYPA3XH8Bh","Peer":{"ID":"QmcZf59bWwK5XFi76CZX8cbJ4BhTzzA3gU1ZjYZcYW3dwt","Addr":"/ip6/2604:1380:3000:1f00::1/tcp/4001","Direction":2}}
{"ID":"12D3KooWA6MQcQhLAWDJFqWAUNyQf9MuFUGVf3LMo232x8cnrK3p","Peer":{"ID":"QmbLHAnMoJPWSCR5Zhtx6BHJX9KiKNN6tpvbUcqanj75Nb","Addr":"/ip6/2604:1380:2000:7a00::1/tcp/4001","Direction":2}}
{"ID":"12D3KooWA6MQcQhLAWDJFqWAUNyQf9MuFUGVf3LMo232x8cnrK3p","Peer":{"ID":"QmQCU2EcMqAqQPR2i9bChDtGNJchTbq5TbXJJ16u19uLTa","Addr":"/ip6/2604:1380:0:c100::1/tcp/4001","Direction":2}}
{"ID":"12D3KooWA6MQcQhLAWDJFqWAUNyQf9MuFUGVf3LMo232x8cnrK3p","Peer":{"ID":"QmaCpDMGvV2BGHeYERUEnRQAwe3N8SzbUtfsmvsqQLuvuJ","Addr":"/ip4/104.131.131.82/tcp/4001","Direction":2}}
{"ID":"12D3KooWA6MQcQhLAWDJFqWAUNyQf9MuFUGVf3LMo232x8cnrK3p","Peer":{"ID":"QmNnooDu7bfjPFoTZYxMNLWUQJyrVwtbZg5gBMjTezGAJN","Addr":"/ip6/2604:1380:1000:6000::1/tcp/4001","Direction":2}}
{"ID":"12D3KooWA6MQcQhLAWDJFqWAUNyQf9MuFUGVf3LMo232x8cnrK3p","Peer":{"ID":"QmcZf59bWwK5XFi76CZX8cbJ4BhTzzA3gU1ZjYZcYW3dwt","Addr":"/ip4/147.75.94.115/tcp/4001","Direction":2}}
The hydra-booster project is dual-licensed under Apache 2.0 and MIT terms:
- Apache License, Version 2.0, (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)