macrocosm-os · p-ferreira · Apr 4, 2024 · Jan 29, 2024 · Jan 30, 2024 · Jan 30, 2024
diff --git a/.gitignore b/.gitignore
@@ -2,6 +2,9 @@
 __pycache__/
 *.py[cod]
 *$py.class
+.DS_Store
+**/.DS_Store
+
 
 # C extensions
 *.so

diff --git a/README.md b/README.md
@@ -19,110 +19,13 @@ This repository is the **official codebase for Bittensor Subnet 1 (SN1) v1.0.0+,
 
 # Introduction
 
-This repo defines an incentive mechanism to create a distributed conversational AI. 
+This repo defines an incentive mechanism to create a distributed conversational AI for Subnet 1 (SN1). 
 
-Validators and miners are based on large language models (LLM). The [validation process](#validation) uses **[internet-scale datasets](#tools)** and **[goal-driven](#tasks)** behaviour to drive **[human-like conversations](#agents)**. 
+Validators and miners are based on large language models (LLM). The validation process uses **internet-scale datasets and goal-driven behaviour to drive human-like conversations**. 
 
-</div>
-
-# Compute Requirements
-
-1. To run a **validator**, you will need at least 24GB of VRAM. 
-2. To run the default Zephyr **miner**, you will need at least 18GB of VRAM. 
-
-</div>
-
-# Validation
-The design of the network's incentive mechanism is based on two important requirements:
-
-### 1. Validation should mimic human interactions
-
-It is imperative that the validation process engages with miners in the same way as real users. The reasons for this are as follows:
-- Miners will compete and continuously improve at performing the validation task(s), and so this fine tuning should be aligned with the goals of the subnet.
-- It should not be possible to distinguish between validation and API client queries so that miners always serve requests (even when they do not receive emissions for doing so).
-
-In the context of this subnet, miners are required to be intelligent AI assistants that provide helpful and correct responses to a range of queries. 
-
-### 2. Reward models should mimic human preferences
-
-In our experience, we have found that it is tricky to evaluate whether miner responses are high quality. Existing methods typically rely on using LLMs to score completions given a prompt, but this is often exploited and gives rise to many adversarial strategies.
-
-In the present version, the validator produces one or more **reference** answers which all miner responses are compared to. Those which are most similar to the reference answer will attain the highest rewards and ultimately gain the most incentive.
-
-**We presently use a combination of string literal similarity and semantic similarity as the basis for rewarding.**
-
-# Tools
-Contexts, which are the basis of conversations, are from external APIs (which we call tools) which ensure that conversations remain grounded in factuality. Contexts are also used to obtain ground-truth answers.
-
-Currently, the tooling stack includes:
-1. Wikipedia API 
-2. StackOverflow 
-3. mathgenerator
-
-More tooling will be included in future releases. 
-
-# Tasks
-The validation process supports an ever-growing number of tasks. Tasks drive agent behaviour based on specific goals, such as; 
-- Question answering
-- Summarization
-- Code debugging
-- Mathematics
- and more. 
-
-Tasks contain a **query** (basic question/problem) and a **reference** (ideal answer), where a downstream HumanAgent creates a more nuanced version of the **query**.
-
-# Agents
-
-In order to mimic human interactions, validators participate in a roleplaying game where they take on the persona of **random** human users. Equipped with this persona and a task, validators prompt miners in a style and tone that is similar to humans and drive the conversation in order to reach a pre-defined goal. We refer to these prompts as **challenges**. 
-
-Challenges are based on the query by wrapping the query in an agent persona which results in a lossy "one-way" function. This results in challenges that are overall more interesting, and less predictable.
-
-The [diagram below](#validation-diagram) illustrates the validation flow.
-
-#### Our approach innovatively transforms straightforward queries into complex challenges, a process akin to a 'hash function', requiring advanced NLP for resolution. This transformation is crucial for preventing simple lookups in source documents, ensuring that responses necessitate authentic analytical effort.
-
-
-# Validation Diagram
-![sn1 overview](assets/sn1-overview.png)
-
-# Running Validators
-These validators are designed to run and update themselves automatically. To run a validator, follow these steps:
-
-1. Install this repository, you can do so by following the steps outlined in [the installation section](#installation).
-2. Install [Weights and Biases](https://docs.wandb.ai/quickstart) and run `wandb login` within this repository. This will initialize Weights and Biases, enabling you to view KPIs and Metrics on your validator. (Strongly recommended to help the network improve from data sharing)
-3. Install [PM2](https://pm2.io/docs/runtime/guide/installation/) and the [`jq` package](https://jqlang.github.io/jq/) on your system.
-   **On Linux**:
-   ```bash
-   sudo apt update && sudo apt install jq && sudo apt install npm && sudo npm install pm2 -g && pm2 update
-   ``` 
-   **On Mac OS**
-   ```bash
-   brew update && brew install jq && brew install npm && sudo npm install pm2 -g && pm2 update
-   ```
-4. Run the `run.sh` script which will handle running your validator and pulling the latest updates as they are issued. 
-   ```bash
-   pm2 start run.sh --name s1_validator_autoupdate -- --wallet.name <your-wallet-name> --wallet.hotkey <your-wallet-hot-key>
-   ```
-
-This will run **two** PM2 processes: one for the validator which is called `s1_validator_main_process` by default (you can change this in `run.sh`), and one for the run.sh script (in step 4, we named it `s1_validator_autoupdate`). The script will check for updates every 30 minutes, if there is an update then it will pull it, install it, restart `s1_validator_main_process` and then restart itself.
-
-
-> Important: vLLM currently faces a [notable limitation](https://github.com/vllm-project/vllm/issues/3012) in designating a specific GPU for model execution via code. Consequently, to employ a particular CUDA device for your model's operations, it's necessary to manually adjust your environment variable `CUDA_VISIBLE_DEVICES`. For instance, setting `export CUDA_VISIBLE_DEVICES=1,2` will explicitly define the CUDA devices available for use.
-
-
-
-# Available Miners
-
-Miners are scored based on the similarity between their completions and the reference answer. Furthermore, they should utilize the same API tools as the validators in order to be able to closely reproduce the reference answer. We currently provide the following miners out-of-the-box:
-1. [Zephyr 7B](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)
-2. [OpenAI](https://platform.openai.com/docs/introduction) (GPT variants)
-3. wiki-agent ([GPT ReAct agent with langchain](https://python.langchain.com/docs/modules/agents/agent_types/react))
-
 
 </div>
 
----
-
 # Installation
 This repository requires python3.8 or higher. To install it, simply clone this repository and run the [install.sh](./install.sh) script.
 ```bash
@@ -131,70 +34,61 @@ cd prompting
 bash install.sh
 ```
 
-Alternatively, if you are running on a clean Ubuntu machine, you can run `python scripts/setup_ubuntu_machine.sh` to effortlessly install everything you need. If you are wanting to run an OpenAI miner, you will need to place your OpenAI API key in the `OPENAI_API_KEY` variable in the script. 
-
 </div>
 
----
-# Running
-
-We encourage miners to use testnet as this gives you a risk-free playground before running on mainnet. If you require test tao, please reach out to [email protected]
+# Compute Requirements
 
-Prior to running a miner or validator, you must [create a wallet](https://github.com/opentensor/docs/blob/main/reference/btcli.md) and [register the wallet to a netuid](https://github.com/opentensor/docs/blob/main/subnetworks/registration.md). Once you have done so, you can run the miner and validator with the following commands.
+1. To run a **validator**, you will need at least 24GB of VRAM. 
+2. To run the default huggingface **miner**, you will need at least 18GB of VRAM. 
 
-For miners and validators running on mainnet we **strongly encourage** you to use a [local subtensor](https://github.com/opentensor/subtensor).
+</div>
 
+# How to Run
+You can use the following command to run a miner or a validator. 
 
 ```bash
-# To run the validator
-python neurons/validator.py
+python <SCRIPT_PATH>
     --netuid 1
     --subtensor.network <finney/local/test>
     --neuron.device cuda
-    --wallet.name <your validator wallet>  # Must be created using the bittensor-cli
-    --wallet.hotkey <your validator hotkey> # Must be created using the bittensor-cli
-    --logging.debug # Run in debug mode, alternatively --logging.trace for trace mode
-
-```
-
-```bash
-# To run the miner
-python neurons/miners/BASE_MINER/miner.py 
-    --netuid 1
-    --subtensor.network <finney/local/test>
-    --wallet.name <your miner wallet> # Must be created using the bittensor-cli
-    --wallet.hotkey <your validator hotkey> # Must be created using the bittensor-cli
+    --wallet.name <your wallet> # Must be created using the bittensor-cli
+    --wallet.hotkey <your hotkey> # Must be created using the bittensor-cli
     --logging.debug # Run in debug mode, alternatively --logging.trace for trace mode
+    --axon.port #VERY IMPORTANT: set the port to be one of the open TCP ports on your machine
 ```
-where `BASE_MINER` is [zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta), which is a fine-tuned Mistral-7B, however you can choose any of the supplied models found in `neurons/miners`. 
 
-</div>
+where `SCRIPT_PATH` is either: 
+1. neurons/miners/huggingface/miner.py
+2. neurons/miners/openai/miner.py
+3. neurons/validator.py
 
----
+For ease of use, you can run the scripts as well with PM2. Installation of PM2 is: 
+**On Linux**:
+```bash
+sudo apt update && sudo apt install jq && sudo apt install npm && sudo npm install pm2 -g && pm2 update
+``` 
 
+Example of running a SOLAR miner: 
+```bash
+pm2 start neurons/miners/huggingface/miner.py --interpreter python3 --name solar_miner -- --netuid 1  --subtensor.network finney --wallet.name my_wallet --wallet.hotkey m1 --neuron.model_id NousResearch/Nous-Hermes-2-SOLAR-10.7B --axon.port 21988 --logging.debug 
+``` 
 
+# Testnet 
+We highly recommend that you run your miners on testnet before deploying on main. This is give you an opportunity to debug your systems, and ensure that you will not lose valuable immunity time. The SN1 testnet is **netuid 61**. 
 
-# Real-time monitoring with wandb integration
+In order to run on testnet, you will need to go through the same hotkey registration proceure as on main, but using **testtao**. You will need to ask for some in the community discord if you do not have any. 
 
-Check out real-time public logging by looking at the project [here](https://wandb.ai/opentensor-dev/alpha-validators)
+To run:
 
-## License
-This repository is licensed under the MIT License.
-```text
-# The MIT License (MIT)
-# Copyright © 2024 Yuma Rao
+```bash
+pm2 start neurons/miners/huggingface/miner.py --interpreter python3 --name solar_miner -- --netuid 61  --subtensor.network test --wallet.name my_test_wallet --wallet.hotkey m1 --neuron.model_id NousResearch/Nous-Hermes-2-SOLAR-10.7B --axon.port 21988 --logging.debug 
+```
 
-# Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated
-# documentation files (the “Software”), to deal in the Software without restriction, including without limitation
-# the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software,
-# and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
+# Limitations
+> Important: vLLM currently faces a [notable limitation](https://github.com/vllm-project/vllm/issues/3012) in designating a specific GPU for model execution via code. Consequently, to employ a particular CUDA device for your model's operations, it's necessary to manually adjust your environment variable `CUDA_VISIBLE_DEVICES`. For instance, setting `export CUDA_VISIBLE_DEVICES=1,2` will explicitly define the CUDA devices available for use.
 
-# The above copyright notice and this permission notice shall be included in all copies or substantial portions of
-# the Software.
+# Resources
+The archiecture and methodology of SN1 is complex, and as such we have created a comprehensive resource to outline our design. Furthermore, we have strict requirements for how miners should interact with the network. Below are the currently available resources for additional information: 
 
-# THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO
-# THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
-# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
-# OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
-# DEALINGS IN THE SOFTWARE.
-```
+1. [SN1 architecture details](docs/SN1_validation.md)
+2. [StreamMiner requirements](docs/stream_miner_template.md)
diff --git a/docs/SN1_validation.md b/docs/SN1_validation.md
@@ -0,0 +1,53 @@
+
+# Validation
+The design of the network's incentive mechanism is based on two important requirements:
+
+### 1. Validation should mimic human interactions
+
+It is imperative that the validation process engages with miners in the same way as real users. The reasons for this are as follows:
+- Miners will compete and continuously improve at performing the validation task(s), and so this fine tuning should be aligned with the goals of the subnet.
+- It should not be possible to distinguish between validation and API client queries so that miners always serve requests (even when they do not receive emissions for doing so).
+
+In the context of this subnet, miners are required to be intelligent AI assistants that provide helpful and correct responses to a range of queries. 
+
+### 2. Reward models should mimic human preferences
+
+In our experience, we have found that it is tricky to evaluate whether miner responses are high quality. Existing methods typically rely on using LLMs to score completions given a prompt, but this is often exploited and gives rise to many adversarial strategies.
+
+In the present version, the validator produces one or more **reference** answers which all miner responses are compared to. Those which are most similar to the reference answer will attain the highest rewards and ultimately gain the most incentive.
+
+**We presently use a combination of string literal similarity and semantic similarity as the basis for rewarding.**
+
+# Tools
+Contexts, which are the basis of conversations, are from external APIs (which we call tools) which ensure that conversations remain grounded in factuality. Contexts are also used to obtain ground-truth answers.
+
+Currently, the tooling stack includes:
+1. Wikipedia API 
+2. StackOverflow 
+3. mathgenerator
+
+More tooling will be included in future releases. 
+
+# Tasks
+The validation process supports an ever-growing number of tasks. Tasks drive agent behaviour based on specific goals, such as; 
+- Question answering
+- Summarization
+- Code debugging
+- Mathematics
+ and more. 
+
+Tasks contain a **query** (basic question/problem) and a **reference** (ideal answer), where a downstream HumanAgent creates a more nuanced version of the **query**.
+
+# Agents
+
+In order to mimic human interactions, validators participate in a roleplaying game where they take on the persona of **random** human users. Equipped with this persona and a task, validators prompt miners in a style and tone that is similar to humans and drive the conversation in order to reach a pre-defined goal. We refer to these prompts as **challenges**. 
+
+Challenges are based on the query by wrapping the query in an agent persona which results in a lossy "one-way" function. This results in challenges that are overall more interesting, and less predictable.
+
+The [diagram below](#validation-diagram) illustrates the validation flow.
+
+#### Our approach innovatively transforms straightforward queries into complex challenges, a process akin to a 'hash function', requiring advanced NLP for resolution. This transformation is crucial for preventing simple lookups in source documents, ensuring that responses necessitate authentic analytical effort.
+
+
+# Validation Diagram
+![sn1 overview](../assets/sn1-overview.png)