Covid-19 Per Capita New Case in Dash

Dash app showing Covid new cases by state/county.

This app implements per-capita normalized plots of linear case growth of Covid-19, using data from the New York Times. It is running live on my personal web site.

However it is running on a very small Digital Ocean droplet, so reducing memory usage has proven important.

The download_and_merge.py script checks for the most recent commit time of the data in the NY Times repository, then download and process the files. The repository is checked hourly. The state data file is relatively small, so it is processed with pandas and all the population data merged at once, before uploading to sqlite3 with sql alchemy.

The county data is much larger, and the merges more memory instensive, at least on relative to the memory on my virtual server, using pandas. I do some light processing on the raw csv file line by line in python, then use sqlite3 command line utility to efficiently load the csv file into the sqlite database with county_processing.sql. This is much faster than pandas + sql alchemy, uses less memory, and less cpu.

sqlite3 then joins the county data and county population and stores a new enhanced table so no join is required at runtime. The github api is used to check the NY Time repository every hour, and only update the database if the files have been updated.

The rest_api.py file contains a simple Flask app that serves the sqlite data over a REST api, returning json data. This is used to share the app to a Streamlit version (source), running on the Streamlit Cloud.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.gitignore		.gitignore
2019_county_populations.csv		2019_county_populations.csv
LICENSE		LICENSE
README.md		README.md
SCPRC-EST2019-18+POP-RES.csv		SCPRC-EST2019-18+POP-RES.csv
about.md		about.md
county_processing.sql		county_processing.sql
covid_dash.py		covid_dash.py
covid_streamlit.py		covid_streamlit.py
dataLoader.py		dataLoader.py
download_and_merge.py		download_and_merge.py
main_page.md		main_page.md
md_text.py		md_text.py
new_counties.csv		new_counties.csv
rest_api.py		rest_api.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

2019_county_populations.csv

2019_county_populations.csv

LICENSE

LICENSE

README.md

README.md

SCPRC-EST2019-18+POP-RES.csv

SCPRC-EST2019-18+POP-RES.csv

about.md

about.md

county_processing.sql

county_processing.sql

covid_dash.py

covid_dash.py

covid_streamlit.py

covid_streamlit.py

dataLoader.py

dataLoader.py

download_and_merge.py

download_and_merge.py

main_page.md

main_page.md

md_text.py

md_text.py

new_counties.csv

new_counties.csv

rest_api.py

rest_api.py

Repository files navigation

Covid-19 Per Capita New Case in Dash

About

Releases

Packages

Languages

License

astrowonk/covid_dash

Folders and files

Latest commit

History

Repository files navigation

Covid-19 Per Capita New Case in Dash

About

Topics

Resources

License

Stars

Watchers

Forks

Languages