Datasets

Open-source datasets developed by HALE Lab for advancing research in healthcare analytics, natural language processing, and medical artificial intelligence.

Healthcare Text & NLP Datasets

Dataset of trash objects for waste classification and detection. It contains about 17785 waste object images divided into seven classes (glass, plastic, metal, e-waste, cardboard, paper, medical waste), which are further subdivided into several sub-classes.

Dataset Usage Guidelines

Attribution: Please cite our relevant publications when using these datasets in your research.

License: Most datasets are released under Creative Commons or MIT licenses. Check individual dataset repositories for specific terms.

Ethics: All datasets have been anonymized and comply with healthcare data privacy regulations. Use responsibly for research purposes.