Basic modeling with Real-World Data.
Author: Zachary Levonian
A good entrypoint to this analyisis is the Jupyter notebook that explores the dataset or the Jupyter notebook that analyses the data by fitting multiple logistic regression models.
Otherwise, the directory layout is:
notebookcontains the analysis notebooks.srccontains thecovid_modelingPython package with helper functions and classes to support the analysis.testscontainspytesttests for thecovid_modelingpackage.datais presumed to be the location of the input data... see the Data section below for more details.figurescontains any images produced within the analysis notebooks.
The data is a representative but fabricated sample of research data provided by ConcertAI. I don't have permission to share it publicly. Note that all PII is random in the data (e.g. names and addresses are random).
The structure of the data is visible in the analysis notebooks, and a sample is contained in the test resources (tests/resources/).
This repository uses poetry as its package manager, coordinated by make.
To install poetry and needed dependencies, run make install.
To run tests, run make test.
A few useful commands:
poetry run <command>- Run the given command, e.g.poetry run pytestinvokes the tests.source $(poetry env info --path)/bin/activate- An alternative topoetry shellthat's less buggy in conda environments.
