Skip to content

skrub (previously dirty-cat) related dataset files. Includes script, raw datasets, etc.

Notifications You must be signed in to change notification settings

skrub-data/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 

Repository files navigation

Datasets

Download and denormalization scripts for skrub datasets.

Contains also:

  • Correspondence table between KEN Embeddings and their figshare download ID[1].
  • Happiness score dataset from the World Happiness Report 2022[2].
  • Bike sharing dataset from the UCI Machine Learning Repository[3].

References

[1]
https://soda-inria.github.io/ken_embeddings/

[2]
Helliwell, J. F., Layard, R., Sachs, J. D., De Neve, J.-E., Aknin, L. B., & Wang, S. (Eds.). (2022). World Happiness Report 2022. New York: Sustainable Development Solutions Network.

[2]
Fanaee-T,Hadi. (2013). Bike Sharing. UCI Machine Learning Repository. https://doi.org/10.24432/C5W894.

About

skrub (previously dirty-cat) related dataset files. Includes script, raw datasets, etc.

Resources

Stars

Watchers

Forks

Languages