公开的数据集汇总(持续更新中...)

合适的数据集对于深层神经网络的训练至关重要,今天我们一起来看看现在已经公开的数据集下载汇总,本文中的内容来源于网络。主要是方便自己以后学习工作中使用,本数据集定期更新。

Table of Contents

Agriculture
Biology
Climate+Weather
ComplexNetworks
ComputerNetworks
DataChallenges
EarthScience
Economics
Education
Energy
Finance
GIS
Government
Healthcare
ImageProcessing
MachineLearning
Museums
NaturalLanguage
Neuroscience
Physics
ProstateCancer
Psychology+Cognition
PublicDomains
SearchEngines
SocialNetworks
SocialSciences
Software
Sports
TimeSeries
Transportation
eSports
Complementary Collections
Agriculture
OK_ICON Hyperspectral benchmark dataset on soil moisture
OK_ICON U.S. Department of Agriculture's Nutrient Database
OK_ICON U.S. Department of Agriculture's PLANTS Database
Biology
FIXME_ICON 1000 Genomes [fixme]
OK_ICON American Gut (Microbiome Project)
OK_ICON Broad Bioimage Benchmark Collection (BBBC)
OK_ICON Broad Cancer Cell Line Encyclopedia (CCLE)
OK_ICON Cell Image Library
OK_ICON Complete Genomics Public Data
OK_ICON EBI ArrayExpress
OK_ICON EBI Protein Data Bank in Europe
OK_ICON ENCODE project
OK_ICON Electron Microscopy Pilot Image Archive (EMPIAR)
OK_ICON Ensembl Genomes
OK_ICON Gene Expression Omnibus (GEO)
OK_ICON Gene Ontology (GO) - GO annotation files
OK_ICON Global Biotic Interactions (GloBI)
OK_ICON Harvard Medical School (HMS) LINCS Project
OK_ICON Human Genome Diversity Project
OK_ICON Human Microbiome Project (HMP)
OK_ICON ICOS PSP Benchmark
OK_ICON International HapMap Project
FIXME_ICON Journal of Cell Biology DataViewer [fixme]
OK_ICON KEGG - KEGG is a database resource for understanding high-level functions [...]
OK_ICON MIT Cancer Genomics Data
OK_ICON NCBI Proteins
OK_ICON NCBI Taxonomy
OK_ICON NCI Genomic Data Commons
OK_ICON NIH Microarray data
OK_ICON OpenSNP genotypes data
OK_ICON Pathguid - Protein-Protein Interactions Catalog
OK_ICON Protein Data Bank
OK_ICON Psychiatric Genomics Consortium
OK_ICON PubChem Project
OK_ICON PubGene (now Coremine Medical)
OK_ICON Sanger Catalogue of Somatic Mutations in Cancer (COSMIC)
OK_ICON Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC)
OK_ICON Sequence Read Archive(SRA)
OK_ICON Stanford Microarray Data
OK_ICON Stowers Institute Original Data Repository
OK_ICON Systems Science of Biological Dynamics (SSBD) Database
OK_ICON The Cancer Genome Atlas (TCGA), available via Broad GDAC
OK_ICON The Catalogue of Life
OK_ICON The Personal Genome Project
OK_ICON UCSC Public Data
OK_ICON UniGene
OK_ICON Universal Protein Resource (UnitProt)
Climate+Weather
OK_ICON Actuaries Climate Index
OK_ICON Australian Weather
OK_ICON Aviation Weather Center - Consistent, timely and accurate weather [...]
OK_ICON Brazilian Weather - Historical data (In Portuguese) - Data related to [...]
OK_ICON Canadian Meteorological Centre
OK_ICON Climate Data from UEA (updated monthly)
OK_ICON Dutch Weather - The KNMI Data Center (KDC) portal provides access to KNMI [...]
OK_ICON European Climate Assessment & Dataset
OK_ICON Global Climate Data Since 1929
OK_ICON NASA Global Imagery Browse Services
OK_ICON NOAA Bering Sea Climate
OK_ICON NOAA Climate Datasets
OK_ICON NOAA Realtime Weather Models
OK_ICON NOAA SURFRAD Meteorology and Radiation Datasets
OK_ICON The World Bank Open Data Resources for Climate Change
OK_ICON UEA Climatic Research Unit
OK_ICON WU Historical Weather Worldwide
OK_ICON WorldClim - Global Climate Data
ComplexNetworks
OK_ICON AMiner Citation Network Dataset
OK_ICON CrossRef DOI URLs
OK_ICON DBLP Citation dataset
OK_ICON DIMACS Road Networks Collection
OK_ICON NBER Patent Citations
OK_ICON NIST complex networks data collection
OK_ICON Network Repository with Interactive Exploratory Analysis Tools
OK_ICON Protein-protein interaction network
OK_ICON PyPI and Maven Dependency Network
OK_ICON Scopus Citation Database
OK_ICON Small Network Data
OK_ICON Stanford GraphBase
OK_ICON Stanford Large Network Dataset Collection
FIXME_ICON Stanford Longitudinal Network Data Sources [fixme]
OK_ICON The Koblenz Network Collection
OK_ICON The Laboratory for Web Algorithmics (UNIMI)
OK_ICON UCI Network Data Repository
OK_ICON UFL sparse matrix collection
FIXME_ICON WSU Graph Database [fixme]
ComputerNetworks
OK_ICON 3.5B Web Pages from CommonCrawl 2012
OK_ICON 53.5B Web clicks of 100K users in Indiana Univ.
OK_ICON CAIDA Internet Datasets
OK_ICON CRAWDAD Wireless datasets from Dartmouth Univ.
OK_ICON ClueWeb09 - 1B web pages
OK_ICON ClueWeb12 - 733M web pages
OK_ICON CommonCrawl Web Data over 7 years
OK_ICON Criteo click-through data
OK_ICON Internet-Wide Scan Data Repository
OK_ICON MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...]
OK_ICON OONI: Open Observatory of Network Interference - Internet censorship data
OK_ICON Open Mobile Data by MobiPerf
OK_ICON The Peer-to-Peer Trace Archive - Real-world measurements play a key role [...]
OK_ICON Rapid7 Sonar Internet Scans
OK_ICON UCSD Network Telescope, IPv4 /8 net
DataChallenges
OK_ICON Bruteforce Database
OK_ICON Challenges in Machine Learning
OK_ICON CrowdANALYTIX dataX
FIXME_ICON D4D Challenge of Orange [fixme]
OK_ICON DrivenData Competitions for Social Good
OK_ICON ICWSM Data Challenge (since 2009)
OK_ICON KDD Cup by Tencent 2012
OK_ICON Kaggle Competition Data
OK_ICON Localytics Data Visualization Challenge
OK_ICON Netflix Prize
OK_ICON Space Apps Challenge
OK_ICON Telecom Italia Big Data Challenge
OK_ICON TravisTorrent Dataset - MSR'2017 Mining Challenge
OK_ICON TunedIT - Data mining & machine learning data sets, algorithms, challenges
OK_ICON Yelp Dataset Challenge
EarthScience
OK_ICON 38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...]
OK_ICON AQUASTAT - Global water resources and uses
OK_ICON BODC - marine data of ~22K vars
OK_ICON EOSDIS - NASA's earth observing system data
OK_ICON Earth Models
OK_ICON Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements
OK_ICON Marinexplore - Open Oceanographic Data
OK_ICON Alabama Real-Time Coastal Observing System
OK_ICON National Estuarine Research Reserves System-Wide Monitoring Program - [...]
OK_ICON Oil and Gas Authority Open Data - The dataset covers 12,500 offshore [...]
OK_ICON Smithsonian Institution Global Volcano and Eruption Database
OK_ICON USGS Earthquake Archives
Economics
OK_ICON American Economic Association (AEA)
OK_ICON EconData from UMD
OK_ICON Economic Freedom of the World Data
OK_ICON Historical MacroEconomic Statistics
OK_ICON INFORUM - Interindustry Forecasting at the University of Maryland
OK_ICON DBnomics – the world's economic database - Aggregates hundreds of [...]
OK_ICON International Trade Statistics
OK_ICON Internet Product Code Database
OK_ICON Joint External Debt Data Hub
OK_ICON Jon Haveman International Trade Data Links
OK_ICON OpenCorporates Database of Companies in the World
OK_ICON Our World in Data
OK_ICON SciencesPo World Trade Gravity Datasets
OK_ICON The Atlas of Economic Complexity
OK_ICON The Center for International Data
OK_ICON The Observatory of Economic Complexity
FIXME_ICON UN Commodity Trade Statistics [fixme]
OK_ICON UN Human Development Reports
Education
OK_ICON College Scorecard Data
OK_ICON Student Data from Free Code Camp
Energy
OK_ICON AMPds
OK_ICON BLUEd
OK_ICON COMBED
OK_ICON ECO
OK_ICON EIA
OK_ICON Global Power Plant Database - The Global Power Plant Database is a [...]
OK_ICON HES - Household Electricity Study, UK
OK_ICON HFED
OK_ICON PLAID - The Plug Load Appliance Identification Dataset
OK_ICON REDD
OK_ICON Smart Meter Data Portal - The Smart Meter Data Portal is part of the [...]
OK_ICON Tracebase
FIXME_ICON UK-DALE - UK Domestic Appliance-Level Electricity [fixme]
OK_ICON WHITED
OK_ICON iAWE
Finance
OK_ICON Blockmodo Coin Registry - A registry of JSON formatted information files [...]
OK_ICON CBOE Futures Exchange
OK_ICON Google Finance
OK_ICON Google Trends
OK_ICON NASDAQ
OK_ICON NYSE Market Data
OK_ICON OANDA
FIXME_ICON OSU Financial data [fixme]
OK_ICON Quandl
OK_ICON St Louis Federal
OK_ICON Yahoo Finance
GIS
OK_ICON ArcGIS Open Data portal
OK_ICON Cambridge, MA, US, GIS data on GitHub
OK_ICON Factual Global Location Data
OK_ICON IEEE Geoscience and Remote Sensing Society DASE Website
OK_ICON Geo Maps - High Quality GeoJSON maps programmatically generated
OK_ICON Geo Spatial Data from ASU
OK_ICON Geo Wiki Project - Citizen-driven Environmental Monitoring
OK_ICON GeoFabrik - OSM data extracted to a variety of formats and areas
OK_ICON GeoNames Worldwide
OK_ICON Global Administrative Areas Database (GADM) - Geospatial data organized [...]
OK_ICON Homeland Infrastructure Foundation-Level Data
OK_ICON Landsat 8 on AWS
OK_ICON List of all countries in all languages
OK_ICON National Weather Service GIS Data Portal
OK_ICON Natural Earth - vectors and rasters of the world
OK_ICON OpenAddresses
OK_ICON OpenStreetMap (OSM)
OK_ICON Pleiades - Gazetteer and graph of ancient places
OK_ICON Reverse Geocoder using OSM data
OK_ICON Robin Wilson - Free GIS Datasets
OK_ICON TIGER/Line - U.S. boundaries and roads
OK_ICON TZ Timezones shapfiles
OK_ICON TwoFishes - Foursquare's coarse geocoder
OK_ICON UN Environmental Data
OK_ICON World boundaries from the U.S. Department of State
OK_ICON World countries in multiple formats
Government
OK_ICON Alberta, Province of Canada
OK_ICON Antwerp, Belgium
OK_ICON Argentina (non official)
OK_ICON Datos Argentina - Portal de datos abiertos de la República Argentina. [...]
OK_ICON Austin, TX, US
OK_ICON Australia (abs.gov.au)
OK_ICON Australia (data.gov.au)
OK_ICON Austria (data.gv.at)
OK_ICON Baton Rouge, LA, US
OK_ICON Beersheba, Israel - Open Data Portal (Smart7 OpenData)
OK_ICON Belgium
OK_ICON Brazil
OK_ICON Buenos Aires, Argentina
OK_ICON Calgary, AB, Canada
OK_ICON Cambridge, MA, US
OK_ICON Canada
OK_ICON Chicago
OK_ICON Chile
OK_ICON China
OK_ICON Dallas Open Data
OK_ICON DataBC - data from the Province of British Columbia
OK_ICON Denver Open Data
OK_ICON Durham, NC Open Data
OK_ICON Edmonton, AB, Canada
OK_ICON England LGInform
OK_ICON EuroStat
OK_ICON EveryPolitician - Ongoing project collating and sharing data on every [...]
OK_ICON Federal Committee on Statistical Methodology (FCSM) (formerly FedStats)
OK_ICON Finland
OK_ICON France
OK_ICON Fredericton, NB, Canada
OK_ICON Gatineau, QC, Canada
OK_ICON Germany
OK_ICON Ghent, Belgium
OK_ICON Glasgow, Scotland, UK
OK_ICON Greece
OK_ICON Guardian world governments
OK_ICON Halifax, NS, Canada
OK_ICON Helsinki Region, Finland
FIXME_ICON Hong Kong, China [fixme]
OK_ICON Houston, TX, US
OK_ICON Indian Government Data
OK_ICON Indonesian Data Portal
OK_ICON Ireland's Open Data Portal
OK_ICON Israel's Open Data Portal
OK_ICON Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...]
OK_ICON Japan
OK_ICON Laval, QC, Canada
OK_ICON Lexington, KY
OK_ICON London Datastore, UK
OK_ICON London, ON, Canada
OK_ICON Los Angeles Open Data
OK_ICON Luxembourg - Luxembourgish Open Data Portal
OK_ICON MassGIS, Massachusetts, U.S.
OK_ICON Metropolitain Transportation Commission (MTC), California, US
OK_ICON Mexico
OK_ICON Missisauga, ON, Canada
OK_ICON Moldova
OK_ICON Moncton, NB, Canada
OK_ICON Montreal, QC, Canada
OK_ICON Mountain View, California, US (GIS)
FIXME_ICON NYC Open Data [fixme]
OK_ICON NYC betanyc
OK_ICON Netherlands
OK_ICON New Zealand
OK_ICON OECD
OK_ICON Oakland, California, US
OK_ICON Oklahoma
OK_ICON Open Data for Africa
OK_ICON Open Government Data (OGD) Platform India
OK_ICON OpenDataSoft's list of 1,600 open data
OK_ICON Oregon
OK_ICON Ottawa, ON, Canada
OK_ICON Palo Alto, California, US
OK_ICON OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...]
OK_ICON Portland, Oregon
OK_ICON Portugal - Pordata organization
OK_ICON Puerto Rico Government
OK_ICON Quebec City, QC, Canada
OK_ICON Quebec Province of Canada
OK_ICON Regina SK, Canada
OK_ICON Rio de Janeiro, Brazil
OK_ICON Romania
OK_ICON Russia
OK_ICON San Diego, CA
FIXME_ICON San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] [fixme]
OK_ICON San Francisco Data sets
OK_ICON San Jose, California, US
OK_ICON San Mateo County, California, US
OK_ICON Saskatchewan, Province of Canada
OK_ICON Seattle
OK_ICON Singapore Government Data
OK_ICON South Africa Trade Statistics
OK_ICON South Africa
OK_ICON State of Utah, US
OK_ICON Switzerland
OK_ICON Taiwan gov
OK_ICON Taiwan
OK_ICON Tel-Aviv Open Data
OK_ICON Texas Open Data
FIXME_ICON The World Bank [fixme]
OK_ICON Toronto, ON, Canada
OK_ICON Tunisia
FIXME_ICON U.K. Government Data [fixme]
FIXME_ICON U.S. American Community Survey [fixme]
OK_ICON U.S. CDC Public Health datasets
OK_ICON U.S. Census Bureau
OK_ICON U.S. Department of Housing and Urban Development (HUD)
OK_ICON U.S. Federal Government Agencies
OK_ICON U.S. Federal Government Data Catalog
OK_ICON U.S. Food and Drug Administration (FDA)
OK_ICON U.S. National Center for Education Statistics (NCES)
OK_ICON U.S. Open Government
OK_ICON UK 2011 Census Open Atlas Project
OK_ICON U.S. Patent and Trademark Office (USPTO) Bulk Data Products
OK_ICON Uganda Bureau of Statistics
OK_ICON Ukraine
OK_ICON United Nations
FIXME_ICON Uruguay [fixme]
FIXME_ICON Valley Transportation Authority (VTA), California, US [fixme]
OK_ICON Vancouver, BC Open Data Catalog
OK_ICON Victoria, BC, Canada
OK_ICON Vienna, Austria
OK_ICON U.S. Congressional Research Service (CRS) Reports
Healthcare
OK_ICON Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...]
OK_ICON EHDP Large Health Data Sets
OK_ICON GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc.
OK_ICON Gapminder World demographic databases
OK_ICON MeSH, the vocabulary thesaurus used for indexing articles for PubMed
OK_ICON Medicare Coverage Database (MCD), U.S.
OK_ICON Medicare Data Engine of medicare.gov Data
OK_ICON Medicare Data File
OK_ICON Number of Ebola Cases and Deaths in Affected Countries (2014)
OK_ICON Open-ODS (structure of the UK NHS)
OK_ICON OpenPaymentsData, Healthcare financial relationship data
OK_ICON PhysioBank Databases - A large and growing archive of physiological data.
OK_ICON The Cancer Imaging Archive (TCIA)
OK_ICON The Cancer Genome Atlas project (TCGA)
OK_ICON World Health Organization Global Health Observatory
OK_ICON Informatics for Integrating Biology & the Bedside
ImageProcessing
OK_ICON 10k US Adult Faces Database
OK_ICON 2GB of Photos of Cats
OK_ICON Adience Unfiltered faces for gender and age classification
OK_ICON Affective Image Classification
OK_ICON Animals with attributes
OK_ICON CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...]
OK_ICON Caltech Pedestrian Detection Benchmark
OK_ICON Chars74K dataset - Character Recognition in Natural Images (both English [...]
OK_ICON Danbooru Tagged Anime Illustration Dataset - A large-scale anime image [...]
FIXME_ICON DukeMTMC Data Set - DukeMTMC aims to accelerate advances in multi-target [...] [fixme]
OK_ICON Face Recognition Benchmark
FIXME_ICON Flickr: 32 Class Brand Logos [fixme]
OK_ICON GDXray - X-ray images for X-ray testing and Computer Vision
OK_ICON HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...]
OK_ICON ImageNet (in WordNet hierarchy)
OK_ICON Indoor Scene Recognition
OK_ICON International Affective Picture System, UFL
OK_ICON KITTI Vision Benchmark Suite
OK_ICON Labeled Information Library of Alexandria - Biology and Conservation - [...]
OK_ICON MNIST database of handwritten digits, near 1 million examples
OK_ICON Massive Visual Memory Stimuli, MIT
OK_ICON Open Images From Google - Pictures with segmentation masks for 2.8 [...]
OK_ICON SUN database, MIT
FIXME_ICON Several Shape-from-Silhouette Datasets [fixme]
OK_ICON Stanford Dogs Dataset
OK_ICON The Action Similarity Labeling (ASLAN) Challenge
OK_ICON The Oxford-IIIT Pet Dataset
OK_ICON Violent-Flows - Crowd Violence / Non-violence Database and benchmark
OK_ICON Visual genome
OK_ICON YouTube Faces Database
MachineLearning
OK_ICON All-Age-Faces Dataset - Contains 13'322 Asian face images distributed [...]
OK_ICON Context-aware data sets from five domains
OK_ICON Delve Datasets for classification and regression
OK_ICON Discogs Monthly Data
OK_ICON Free Music Archive
OK_ICON IMDb Database
FIXME_ICON Keel Repository for classification, regression and time series [fixme]
OK_ICON Labeled Faces in the Wild (LFW)
OK_ICON Lending Club Loan Data
OK_ICON Machine Learning Data Set Repository
OK_ICON Million Song Dataset
OK_ICON More Song Datasets
OK_ICON MovieLens Data Sets
OK_ICON New Yorker caption contest ratings
OK_ICON RDataMining - "R and Data Mining" ebook data
OK_ICON Registered Meteorites on Earth
OK_ICON Restaurants Health Score Data in San Francisco
OK_ICON UCI Machine Learning Repository
OK_ICON Yahoo! Ratings and Classification Data
OK_ICON YouTube-BoundingBoxes
OK_ICON Youtube 8m
OK_ICON eBay Online Auctions (2012)
Museums
OK_ICON Canada Science and Technology Museums Corporation's Open Data
OK_ICON Cooper-Hewitt's Collection Database
OK_ICON Minneapolis Institute of Arts metadata
OK_ICON Natural History Museum (London) Data Portal
OK_ICON Rijksmuseum Historical Art Collection
OK_ICON Tate Collection metadata
OK_ICON The Getty vocabularies
NaturalLanguage
OK_ICON Automatic Keyphrase Extraction
OK_ICON Blizzard Challenge Speech - The speech + text data comes from [...]
OK_ICON Blogger Corpus
OK_ICON CLiPS Stylometry Investigation Corpus
OK_ICON ClueWeb09 FACC
OK_ICON ClueWeb12 FACC
OK_ICON DBpedia - 4.58M things with 583M facts
OK_ICON Flickr Personal Taxonomies
OK_ICON Freebase of people, places, and things
OK_ICON German Political Speeches Corpus - Collection of political speeches from [...]
OK_ICON Google Books Ngrams (2.2TB)
OK_ICON Google MC-AFP - Generated based on the public available Gigaword dataset [...]
OK_ICON Google Web 5gram (1TB, 2006)
OK_ICON Gutenberg eBooks List
OK_ICON Hansards text chunks of Canadian Parliament
OK_ICON LJ Speech - Speech dataset consisting of 13,100 short audio clips of a [...]
FIXME_ICON M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] [fixme]
OK_ICON Microsoft MAchine Reading COmprehension Dataset (or MS MARCO)
OK_ICON Machine Comprehension Test (MCTest) of text from Microsoft Research
OK_ICON Machine Translation of European languages
FIXME_ICON Making Sense of Microposts 2013 - Concept Extraction [fixme]
OK_ICON Making Sense of Microposts 2016 - Named Entity rEcognition and Linking
OK_ICON Multi-Domain Sentiment Dataset (version 2.0)
OK_ICON Noisy speech database for training speech enhancement algorithms and TTS [...]
OK_ICON Open Multilingual Wordnet
OK_ICON POS/NER/Chunk annotated data
OK_ICON Personae Corpus
OK_ICON SMS Spam Collection in English
OK_ICON SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles)
OK_ICON Stanford Question Answering Dataset (SQuAD)
OK_ICON USENET postings corpus of 2005~2011
OK_ICON Universal Dependencies
OK_ICON Webhose - News/Blogs in multiple languages
OK_ICON Wikidata - Wikipedia databases
OK_ICON Wikipedia Links data - 40 Million Entities in Context
OK_ICON WordNet databases and tools
OK_ICON WorldTree Corpus of Explanation Graphs for Elementary Science Questions - [...]
Neuroscience
OK_ICON Allen Institute Datasets
OK_ICON Brain Catalogue
OK_ICON Brainomics
FIXME_ICON CodeNeuro Datasets [fixme]
OK_ICON Collaborative Research in Computational Neuroscience (CRCNS)
OK_ICON FCP-INDI
OK_ICON Human Connectome Project
OK_ICON NDAR
OK_ICON NIMH Data Archive
OK_ICON NeuroData
OK_ICON NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...]
OK_ICON Neuroelectro
OK_ICON OASIS
OK_ICON OpenNEURO
OK_ICON OpenfMRI
OK_ICON Study Forrest
Physics
OK_ICON CERN Open Data Portal
OK_ICON Crystallography Open Database
OK_ICON IceCube - South Pole Neutrino Observatory
OK_ICON Ligo Open Science Center (LOSC) - Gravitational wave data from the LIGO [...]
OK_ICON NASA Exoplanet Archive
OK_ICON NSSDC (NASA) data of 550 space spacecraft
OK_ICON Sloan Digital Sky Survey (SDSS) - Mapping the Universe
ProstateCancer
OK_ICON EOPC-DE-Early-Onset-Prostate-Cancer-Germany - Early Onset Prostate Cancer [...]
OK_ICON GENIE - Data from the Genomics Evidence Neoplasia Information Exchange [...]
OK_ICON Genomic-Hallmarks-Prostate-Adenocarcinoma-CPC-GENE - Comprehensive [...]
OK_ICON MSK-IMPACT-Clinical-Sequencing-Cohort-MSKCC-Prostate-Cancer - Targeted [...]
OK_ICON Metastatic-Prostate-Adenocarcinoma-MCTP - Comprehensive profiling of 61 [...]
OK_ICON Metastatic-Prostate-Cancer-SU2CPCF-Dream-Team - Comprehensive analysis of [...]
OK_ICON NPCR-2001-2015 - Database from CDC's National Program of Cancer [...]
OK_ICON NPCR-2005-2015 - Database from CDC's National Program of Cancer [...]
OK_ICON NaF-Prostate - NaF Prostate is a collection of F-18 NaF positron emission [...]
OK_ICON Neuroendocrine-Prostate-Cancer - Whole exome and RNA Seq data of [...]
OK_ICON PLCO-Prostate-Diagnostic-Procedures - The Prostate Diagnostic Procedures [...]
OK_ICON PLCO-Prostate-Medical-Complications - The Prostate Medical Complications [...]
OK_ICON PLCO-Prostate-Screening-Abnormalities - The Prostate Screening [...]
OK_ICON PLCO-Prostate-Screening - The Prostate Screening dataset (177,315 [...]
OK_ICON PLCO-Prostate-Treatments - The Prostate Treatments dataset (13,409 [...]
OK_ICON PLCO-Prostate - The Prostate dataset is a comprehensive dataset that [...]
OK_ICON PRAD-CA-Prostate-Adenocarcinoma-Canada - Prostate Adenocarcinoma - [...]
OK_ICON PRAD-FR-Prostate-Adenocarcinoma-France - Prostate Adenocarcinoma - [...]
OK_ICON PRAD-UK-Prostate-Adenocarcinoma-United-Kingdom - Prostate Adenocarcinoma [...]
OK_ICON PROSTATEx-Challenge - Retrospective set of prostate MR studies. All [...]
OK_ICON Prostate-3T - The Prostate-3T project provided imaging data to TCIA as [...]
OK_ICON Prostate-Adenocarcinoma-Broad-Cornell-2012 - Comprehensive profiling of [...]
OK_ICON Prostate-Adenocarcinoma-Broad-Cornell-2013 - Comprehensive profiling of [...]
OK_ICON Prostate-Adenocarcinoma-CNA-study-MSKCC - Copy-number profiling of 103 [...]
OK_ICON Prostate-Adenocarcinoma-Fred-Hutchinson-CRC - Comprehensive profiling of [...]
OK_ICON Prostate Adenocarcinoma (MSKCC/DFCI) - Whole Exome Sequencing of 1013 [...]
OK_ICON Prostate-Adenocarcinoma-MSKCC - MSKCC Prostate Oncogenome Project. 181 [...]
OK_ICON Prostate-Adenocarcinoma-Organoids-MSKCC - Exome profiling of prostate [...]
OK_ICON Prostate-Adenocarcinoma-Sun-Lab - Whole-genome and Transcriptome [...]
OK_ICON Prostate-Adenocarcinoma-TCGA-PanCancer-Atlas - Comprehensive TCGA [...]
OK_ICON Prostate-Adenocarcinoma-TCGA - Integrated profiling of 333 primary [...]
OK_ICON Prostate-Diagnosis - PCa T1- and T2-weighted magnetic resonance images [...]
OK_ICON Prostate-Fused-MRI-Pathology - The Prostate Fused-MRI-Pathology [...]
OK_ICON Prostate-MRI - The Prostate-MRI collection of prostate Magnetic Resonance [...]
OK_ICON Prostate-R - The popular statistical package R contains a prostate cancer [...]
OK_ICON QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset is a [...]
OK_ICON QIN-PROSTATE - The QIN PROSTATE collection of the Quantitative Imaging [...]
OK_ICON SEER-YR1973_2015.SEER9 - The SEER November 2017 Research Data files from [...]
OK_ICON SEER-YR1992_2015.SJ_LA_RG_AK - The SEER November 2017 Research Data files [...]
OK_ICON SEER-YR2000_2015.CA_KY_LO_NJ_GA - The SEER November 2017 Research Data [...]
OK_ICON SEER-YR2000_2015.CA_KY_LO_NJ_GA - The July - December 2005 diagnoses for [...]
OK_ICON TCGA-PRAD-US - TCGA Prostate Adenocarcinoma (499 samples).
Psychology+Cognition
FIXME_ICON OSU Cognitive Modeling Repository Datasets [fixme]
PublicDomains
OK_ICON Amazon
OK_ICON Archive.org Datasets
OK_ICON Archive-it from Internet Archive
OK_ICON CMU JASA data archive
OK_ICON CMU StatLab collections
FIXME_ICON Data.World [fixme]
OK_ICON Data360
OK_ICON Enigma Public
OK_ICON Google
OK_ICON Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...]
FIXME_ICON Infochimps [fixme]
OK_ICON KDNuggets Data Collections
OK_ICON Microsoft Azure Data Market Free DataSets
OK_ICON Microsoft Data Science for Research
OK_ICON Microsoft Research Open Data
OK_ICON Numbray
OK_ICON Open Library Data Dumps
OK_ICON Reddit Datasets
OK_ICON RevolutionAnalytics Collection
OK_ICON Sample R data sets
OK_ICON StatSci.org
OK_ICON Stats4Stem R data sets (archived)
OK_ICON The Washington Post List
OK_ICON UCLA SOCR data collection
OK_ICON UFO Reports
OK_ICON Wikileaks 911 pager intercepts
OK_ICON Yahoo Webscope
SearchEngines
OK_ICON Academic Torrents of data sharing from UMB
FIXME_ICON DataMarket (Qlik) [fixme]
OK_ICON Datahub.io
OK_ICON Harvard Dataverse Network of scientific data
OK_ICON ICPSR (UMICH)
OK_ICON Institute of Education Sciences
OK_ICON National Technical Reports Library
OK_ICON Open Data Certificates (beta)
OK_ICON OpenDataNetwork - A search engine of all Socrata powered data portals
OK_ICON Statista.com - statistics and Studies
OK_ICON Zenodo - An open dependable home for the long-tail of science
SocialNetworks
OK_ICON 72 hours #gamergate Twitter Scrape
OK_ICON Ancestry.com Forum Dataset over 10 years
OK_ICON CMU Enron Email of 150 users
OK_ICON Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape
OK_ICON EDRM Enron EMail of 151 users, hosted on S3
OK_ICON Facebook Data Scrape (2005)
OK_ICON Facebook Social Networks from LAW (since 2007)
OK_ICON Foursquare from UMN/Sarwat (2013)
OK_ICON GitHub Collaboration Archive
FIXME_ICON Google Scholar citation relations [fixme]
OK_ICON High-Resolution Contact Networks from Wearable Sensors
OK_ICON Indie Map: social graph and crawl of top IndieWeb sites
OK_ICON Mobile Social Networks from UMASS
OK_ICON Network Twitter Data
OK_ICON Reddit Comments
OK_ICON Skytrax' Air Travel Reviews Dataset
OK_ICON Social Twitter Data
OK_ICON SourceForge.net Research Data
OK_ICON Twitter Data for Online Reputation Management
OK_ICON Twitter Data for Sentiment Analysis
OK_ICON Twitter Graph of entire Twitter site
FIXME_ICON Twitter Scrape Calufa May 2011 [fixme]
OK_ICON UNIMI/LAW Social Network Datasets
OK_ICON United States Congress Twitter Data - Daily datasets with tweets of 1100+ [...]
OK_ICON Yahoo! Graph and Social Data
OK_ICON Youtube Video Social Graph in 2007,2008
SocialSciences
OK_ICON ACLED (Armed Conflict Location & Event Data Project)
OK_ICON Canadian Legal Information Institute
FIXME_ICON Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc [fixme]
OK_ICON Correlates of War Project
OK_ICON Cryptome Conspiracy Theory Items
FIXME_ICON Datacards [fixme]
OK_ICON European Social Survey
OK_ICON FBI Hate Crime 2013 - aggregated data
FIXME_ICON Fragile States Index [fixme]
OK_ICON GDELT Global Events Database
OK_ICON General Social Survey (GSS) since 1972
OK_ICON German Social Survey
OK_ICON Global Religious Futures Project
OK_ICON Gun Violence Data - A comprehensive, accessible database that contains [...]
OK_ICON Humanitarian Data Exchange
FIXME_ICON INFORM Index for Risk Management [fixme]
OK_ICON Institute for Demographic Studies
OK_ICON International Networks Archive
OK_ICON International Social Survey Program ISSP
OK_ICON International Studies Compendium Project
OK_ICON James McGuire Cross National Data
OK_ICON MIT Reality Mining Dataset
OK_ICON MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste
OK_ICON Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...]
OK_ICON Minnesota Population Center
OK_ICON Notre Dame Global Adaptation Index (ND-GAIN)
OK_ICON Open Crime and Policing Data in England, Wales and Northern Ireland
OK_ICON OpenSanctions - A global database of persons and companies of political, [...]
OK_ICON Paul Hensel General International Data Page
OK_ICON PewResearch Internet Survey Project
OK_ICON PewResearch Society Data Collection
FIXME_ICON Political Polarity Data [fixme]
OK_ICON StackExchange Data Explorer
OK_ICON Terrorism Research and Analysis Consortium
OK_ICON Texas Inmates Executed Since 1984
OK_ICON Titanic Survival Data Set
FIXME_ICON UCB's Archive of Social Science Data (D-Lab) [fixme]
OK_ICON UCLA Social Sciences Data Archive
OK_ICON UN Civil Society Database
OK_ICON UPJOHN for Labor Employment Research
OK_ICON Universities Worldwide
OK_ICON Uppsala Conflict Data Program
OK_ICON World Bank Open Data
OK_ICON WorldPop project - Worldwide human population distributions
Software
OK_ICON FLOSSmole data about free, libre, and open source software development
OK_ICON GHTorrent - Scalable, queriable, offline mirror of data offered through [...]
OK_ICON Libraries.io Open Source Repository and Dependency Metadata
OK_ICON Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...]
OK_ICON Code duplicates - 2k Java file and 600 Java function pairs labeled as [...]
OK_ICON Commit messages - 1.3 billion GitHub commit messages till March 2019
OK_ICON Pull Request review comments - 25.3 million GitHub PR review comments [...]
OK_ICON Source Code Identifiers - 41.7 million distinct splittable identifiers [...]
Sports
OK_ICON American Ninja Warrior Obstacles - Contains every obstacle in the history [...]
OK_ICON Betfair Historical Exchange Data
OK_ICON Cricsheet Matches (cricket)
OK_ICON Ergast Formula 1, from 1950 up to date (API)
OK_ICON Football/Soccer resources (data and APIs)
OK_ICON Lahman's Baseball Database
OK_ICON Pinhooker: Thoroughbred Bloodstock Sale Data
OK_ICON Pro Kabadi season 1 to 7 - Pro Kabadi League is a professional-level [...]
OK_ICON Retrosheet Baseball Statistics
OK_ICON Tennis database of rankings, results, and stats for ATP
OK_ICON Tennis database of rankings, results, and stats for WTA
TimeSeries
OK_ICON 3W dataset - To the best of its authors' knowledge, this is the first [...]
OK_ICON Databanks International Cross National Time Series Data Archive
OK_ICON Hard Drive Failure Rates
OK_ICON Heart Rate Time Series from MIT
FIXME_ICON Time Series Data Library (TSDL) from MU [fixme]
OK_ICON UC Riverside Time Series Dataset
Transportation
OK_ICON Airlines OD Data 1987-2008
OK_ICON Ford GoBike Data (formerly Bay Area Bike Share Data)
OK_ICON Bike Share Systems (BSS) collection
OK_ICON Dutch Traffic Information
OK_ICON GeoLife GPS Trajectory from Microsoft Research
OK_ICON German train system by Deutsche Bahn
FIXME_ICON Hubway Million Rides in MA [fixme]
OK_ICON Montreal BIXI Bike Share
OK_ICON NYC Taxi Trip Data 2009-
OK_ICON NYC Taxi Trip Data 2013 (FOIA/FOILed)
OK_ICON NYC Uber trip data April 2014 to September 2014
OK_ICON Open Traffic collection
OK_ICON OpenFlights - airport, airline and route data
OK_ICON Philadelphia Bike Share Stations (JSON)
OK_ICON Plane Crash Database, since 1920
OK_ICON RITA Airline On-Time Performance data
OK_ICON RITA/BTS transport data collection (TranStat)
OK_ICON Renfe (Spanish National Railway Network) dataset
OK_ICON Toronto Bike Share Stations (JSON and GBFS files)
OK_ICON Transport for London (TFL)
OK_ICON Travel Tracker Survey (TTS) for Chicago
OK_ICON U.S. Bureau of Transportation Statistics (BTS)
OK_ICON U.S. Domestic Flights 1990 to 2009
OK_ICON U.S. Freight Analysis Framework since 2007
eSports
OK_ICON OpenDota data dump
Complementary Collections
Data Packaged Core Datasets
Database of Scientific Code Contributions
A growing collection of public datasets: CoolDatasets.
DataWrangling: Some Datasets Available on the Web
Inside-r: Finding Data on the Internet
OpenDataMonitor: An overview of available open data resources in Europe
Quora: Where can I find large datasets open to the public?
RS.io: 100+ Interesting Data Sets for Statistics
StaTrek: Leveraging open data to understand urban lives

下载地址:https://github.com/awesomedata/awesome-public-datasets

你可能感兴趣的:(算法)