- Functions tag_table_with_age, tag_table_period_length, tag_table_by_type
- Quartiles computed from plot_concepts_set does not depend on value selection anymore
- Pyarrow fix now work on spark executors.
- Fix OMOP _date columns issue
- omop teva module
- Support for pyarrow > 0.17.0
- biology module refacto
- load_koalas() not by default in init.py but called in the improve_performance function
- adding app_name in improve_performances to facilitate app monitoring
- Generation of an inclusion/exclusion flowchart in plotting
- improve_performance moved from init.py to io/improve_performance.py file
- Caching in spark instead of koalas to improve speed
- Module
event_sequences
to visualize individual sequences of events. - Module
age_pyramid
to quickly visualize the age and gender distributions in a cohort.
- BaseData class as a parent class for HiveData, PandasData and PostgresData.
- Phentyping class with 4 implemented phenotyes.
- Custom logger to display useful information during computation.
- Add caching to speedup computations.
- Updated method to persist tables as parquet locally, with a support for ORC-stored I2B2 database.
- Allow saving DB locally in client or cluster mode.
- Add data cleaning function to handle incorrect datetime in spark.
- Filter biology config on care site.
- Adding person-dependent
datetime_ref
toplot_age_pyramid
.
- Consultations date for OMOP & I2B2
- New BackendDispatcher to handle framework-specific functions
- I2B2 to OMOP connector
- Adding CITATION.cff
- Using
mike
as a documentation provider
- Correct build to PyPI
- Renaming from
EDS-Scikit
toeds-scikit
- Various project metadata
- Full CI pipeline
- License checker in CI
- BackendDispatcher object to help with pandas / koalas manipulation
- Broken links in documentation and badges
- Initial commit to GitHub