- Visi komputer
- List of datasets for machine-learning research
- Automated machine learning
- Active learning (machine learning)
- Boosting (machine learning)
- Online machine learning
- Statistical classification
- List of datasets in computer vision and image processing
- Supervised learning
- Leakage (machine learning)
- Applications of artificial intelligence
Terminator 3: Rise of the Machines (2003)
Hot Tub Time Machine (2010)
The Last Samurai (2003)
Escape Plan (2013)
List of datasets for machine-learning research GudangMovies21 Rebahinxxi LK21
These datasets are used in machine learning (ML) research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning), computer hardware, and, less-intuitively, the availability of high-quality training datasets. High-quality labeled training datasets for supervised and semi-supervised machine learning algorithms are usually difficult and expensive to produce because of the large amount of time needed to label the data. Although they do not need to be labeled, high-quality datasets for unsupervised learning can also be difficult and costly to produce.
Many organizations, including governments, publish and share their datasets. The datasets are classified, based on the licenses, as Open data and Non-Open data.
The datasets from various governmental-bodies are presented in List of open government data sites. The datasets are ported on open data portals. They are made available for searching, depositing and accessing through interfaces like Open API. The datasets are made available as various sorted types and subtypes.
List of sorting used for datasets
The data portal is classified based on its type of license. The open source license based data portals are known as open data portals which are used by many government organizations and academic institutions.
List of open data portals
List of portals suitable for multiple types of applications
The data portal sometimes lists a wide variety of subtypes of datasets pertaining to many machine learning applications.
List of portals suitable for a specific subtype of applications
The data portals which are suitable for a specific subtype of machine learning application are listed in the subsequent sections.
Image data
Text data
These datasets consist primarily of text for tasks such as natural language processing, sentiment analysis, translation, and cluster analysis.
= Reviews
== News articles
== Messages
== Twitter and tweets
== Dialogues
== Legal
== Other text
=Sound data
These datasets consist of sounds and sound features used for tasks such as speech recognition and speech synthesis.
= Speech
== Music
== Other sounds
=Signal data
Datasets containing electric signal information requiring some sort of signal processing for further analysis.
= Electrical
== Motion-tracking
== Other signals
=Physical data
Datasets from physical systems.
= High-energy physics
== Systems
== Astronomy
== Earth science
== Other physical
=Biological data
Datasets from biological systems.
= Human
== Animal
== Fungi
== Plant
== Microbe
== Drug discovery
=Anomaly data
Question answering data
This section includes datasets that deals with structured data.
Dialog or instruction prompted data
This section includes datasets that ...
Cybersecurity
Climate and sustainability
Code data
Multivariate data
= Financial
== Weather
== Census
== Transit
== Internet
== Games
== Other multivariate
=Curated repositories of datasets
As datasets come in myriad formats and can sometimes be difficult to use, there has been considerable work put into curating and standardizing the format of datasets to make them easier to use for machine learning research.
OpenML: Web platform with Python, R, Java, and other APIs for downloading hundreds of machine learning datasets, evaluating algorithms on datasets, and benchmarking algorithm performance against dozens of other algorithms.
PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms. Provides classification and regression datasets in a standardized format that are accessible through a Python API.
Metatext NLP: https://metatext.io/datasets web repository maintained by community, containing nearly 1000 benchmark datasets, and counting. Provides many tasks from classification to QA, and various languages from English, Portuguese to Arabic.
Appen: Off The Shelf and Open Source Datasets hosted and maintained by the company. These biological, image, physical, question answering, signal, sound, text, and video resources number over 250 and can be applied to over 25 different use cases.
See also
Comparison of deep learning software
List of manual image annotation tools
List of biological databases
References
Kata Kunci Pencarian:
GitHub - ihaagrawal/Machine-learning-datasets
![Machine Learning Datasets | Various Types of Datasets for Data Scientists](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fcdn.educba.com%2Facademy%2Fwp-content%2Fuploads%2F2020%2F01%2Fmachine-learning-datasets.jpg)
Machine Learning Datasets | Various Types of Datasets for Data Scientists
![Datasets for Machine Learning - PostNetwork Academy](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fwww.postnetwork.co%2Fwp-content%2Fuploads%2Fdatasets.png)
Datasets for Machine Learning - PostNetwork Academy
![14 Best Datasets for Machine Learning - HashDork](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fhashdork.com%2Fwp-content%2Fuploads%2F2021%2F11%2Fdatasets_1-768x481.jpg)
14 Best Datasets for Machine Learning - HashDork
![Top Machine Learning Datasets - The Ultimate Guide](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fintellipaat.com%2FmediaFiles%2F2018%2F07%2FHow-Do-We-Get-the-Right-Data-for-Machine-Learning.png)
Top Machine Learning Datasets - The Ultimate Guide
![Best Public Datasets for Machine Learning | 365 Data Science](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2F365datascience.com%2Fwp-content%2Fuploads%2F2020%2F08%2FBest-Public-DataSets-for-Machine-Learning.jpg)
Best Public Datasets for Machine Learning | 365 Data Science
![Best free datasets for machine learning and data science. What are the ...](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fi.pinimg.com%2Foriginals%2F3c%2Fae%2F2f%2F3cae2f906d9d4b01c2d9f3457e1b65be.jpg)
Best free datasets for machine learning and data science. What are the ...
![Free Datasets for Machine Learning & Deep Learning - Analytics Yogi](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fvitalflux.com%2Fwp-content%2Fuploads%2F2021%2F02%2Fdataset_publicly_available_free_machine_learning.png)
Free Datasets for Machine Learning & Deep Learning - Analytics Yogi
![15 Best Machine Learning Datasets For Free](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fwww.blog.duomly.com%2Fwp-content%2Fuploads%2F2019%2F07%2Fblog_machine_learning_datasets_duomly_programming_courses.png)
15 Best Machine Learning Datasets For Free
![Information on 14 machine learning datasets | Download Scientific Diagram](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fwww.researchgate.net%2Fpublication%2F327138203%2Ffigure%2Ftbl1%2FAS%3A852876633059350%401580353043542%2FInformation-on-14-machine-learning-datasets.png)
Information on 14 machine learning datasets | Download Scientific Diagram
![70+ Machine Learning Datasets & Project Ideas – Work on real-time Data ...](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fdata-flair.training%2Fblogs%2Fwp-content%2Fuploads%2Fsites%2F2%2F2019%2F11%2F70-machine-learning-datasets-projects.jpg)
70+ Machine Learning Datasets & Project Ideas – Work on real-time Data ...
![70+ Machine Learning Datasets & Project Ideas – Work on real-time Data ...](https://res.cloudinary.com/dyadcr1f1/image/fetch/f_auto,q_auto/https%3A%2F%2Fdata-flair.training%2Fblogs%2Fwp-content%2Fuploads%2Fsites%2F2%2F2019%2F11%2Fmachine-learning-datasets-for-beginners-1.jpg)
70+ Machine Learning Datasets & Project Ideas – Work on real-time Data ...