This script processes data from an Azure Blob Storage container and uploads the results to another container.
- Connects to Azure Blob Storage using a connection string
- Downloads a CSV file ('iris.csv') from the 'raw' container
- Loads the CSV data into a pandas DataFrame
- Filters the DataFrame to include only 'setosa' species
- Uploads the filtered data as a new CSV file ('iris_setosa.csv') to the 'out' container
- Azure Storage Blob library
- pandas library
Ensure the connection string and container/blob names are correctly set before running the script.
Handle the connection string securely and do not expose it in public repositories.
generated with perplexity.ai with the following prompt:
generate a very short and simple doc for this script to be kept as markdown in the repo: manipulate_csv.py