Python HOW: Run Prodigy with a Remote MySQL Database using Docker

Gabriel Harris Ph.D.
3 min readJun 15, 2021

Learn how to run Prodigy with a connection to a remote database as a Docker container. This would allow you to deploy it to any Web App service with ease (e.g., Azure Web App Service or AWS Beanstalk)

source: author

Prerequisites

This article assumes that you have already migrated the default Prodigy database prodigy.db schema to a remote MySQL server. If you have not done this already, this is described in detail in 👉 article

Project structure 👷🏻

Your project structure should look like this once you are done:

For our demo, we will use the news_headlines dataset provided by Prodigy. Open the link and copy the data into data/dataset.jsonl

Copy your proprietary prodigy wheel to wheel. Make sure it’s the Linux version

prodigy.json 📜

By default, prodigy looks for its configuration in a global configuration file $HOME/.prodigy/prodigy.json. It also checks for a local…

--

--

Gabriel Harris Ph.D.

I’m an End-to-End Lead Data Scientist and Data Science Manager. My articles are love letters to my future self