Python HOW: Run Prodigy with a Remote MySQL Database using Docker
Learn how to run Prodigy with a connection to a remote database as a Docker container. This would allow you to deploy it to any Web App service with ease (e.g., Azure Web App Service or AWS Beanstalk)
Prerequisites
This article assumes that you have already migrated the default Prodigy database prodigy.db
schema to a remote MySQL server. If you have not done this already, this is described in detail in 👉 article
Project structure 👷🏻
Your project structure should look like this once you are done:
For our demo, we will use the news_headlines
dataset provided by Prodigy. Open the link and copy the data into data/dataset.jsonl
Copy your proprietary prodigy wheel to wheel
. Make sure it’s the Linux version
prodigy.json 📜
By default, prodigy looks for its configuration in a global configuration file $HOME/.prodigy/prodigy.json
. It also checks for a local…