Amazon Web Services (AWS) datasets: https://registry.opendata.aws/
Awesome Public Datasets: https://github.com/awesomedata/awesome-public-datasets
BigQuery Public Datasets: https://cloud.google.com/bigquery/public-data/
Bureau of Labor Statistics: https://www.bls.gov/data/
Eurostat: open data from the EU statistical office: https://ec.europa.eu/eurostat/data/database
Google Dataset Search: https://datasetsearch.research.google.com/
Harvard Dataverse: https://dataverse.harvard.edu/
Kaggle Datasets: https://www.kaggle.com/datasets
Large Health Data Sets: https://www.ehdp.com/links/datasets.htm
UC Irvine Machine Learning Repository: http://archive.ics.uci.edu/ml/index.php
U.S. Government open data: https://www.data.gov/
World Bank Open Data: https://data.worldbank.org/
"Preprints are complete and public drafts of scientific documents, not yet certified by peer review. These documents ensure that the findings of the research community are widely disseminated, priorities of discoveries are established and they invite feedback and discussion to help improve the work.
Certification by peer review is the key distinction between a preprint and an accepted author manuscript or published article. Many preprints are submitted to journals for publication, and as a result, subsequent versions of the paper may also be made available after peer review. Readers of preprints should be aware that any aspect of the research, including the results and conclusions, may change as a result of peer review (see PMC Disclaimer). Authors may also revise preprints and post updated versions to the preprint server." (NIH 2024, "Preprint Pilot")
To view more about preprints available through PubMed, see NIH Preprint Pilot.