Aim is to build a system which is able to handle long running processes in a distributed fashion. It imports products from a CSV file and load it into a database. There are half a million product details to be imported into the database.
Technology- Python and MySQL
- Download the project using-
https://github.com/piyushdua50/large_file_processor.git - Place the
products.csvfile in src folder. - Setup the database.
- Login to your MySQL database
- Go to the src/database folder
- Execute the
SCHEMA.sqlscript in order to create and use the schema - Execute the
PRODUCTS.sqlin order the create the PRODUCTS table - Execute the
PRODUCT_COUNT_DTL.sqlin order to create the PRODUCT_COUNT_DTL table
- Now install the python dependencies
- pandas-
pip install pandas - os-sys-
pip install os-sys - mysql-connector-
pip install mysql-connector - python-dotenv-
pip install python-dotenv
- pandas-
- Now provide the database user, database password and database name in .env file.
- Execute this command-
python src/server.py