As we all know datasets acts as a cheap local copy of the data we care about so that we do not have to keep on making expensive high-latency calls to the database.
Here are some of the datasets
IMAGE SEGMENTATION: It is typically used to locate objects and boundaries like curves,lines etc; in images. More precisely,image segmentation in the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.
IRIS: It is also known as Fisher's Iris. This is used in cluster analysis. The data set contains two clusters with a rather obvious seperation.
One contains Iris Setasa, while others contains both Iris Virginica and Iris Versicolor. The data set is approximated by the closest tree with some penalty for the excessive number of nodes bending and stretching. Then so-called "Metro Tree".
IONOSPHERE:This dataset is used to distinguish between good and bad radar returns. A good return is one indicating evidence of some type of structure in the ionosphere. A bad returns simply through the ionosphere.
These are used in computational experiments.
TIME SERIES:Machine learning can be applied to time series datasets. These are problems where a numeric or categorical value must be predicted but the rows of data are ordered by time. A problem when getting started in time series forecastimg with Machine learning is finding good quality standard datasets on which to practice.
MNIST: This dataset is a large database of handwritten digits that is commonly used for training various image processing systems. The database is also widely used for training and testing in the field Machine learning.
Here are some of the datasets
IMAGE SEGMENTATION: It is typically used to locate objects and boundaries like curves,lines etc; in images. More precisely,image segmentation in the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.
One contains Iris Setasa, while others contains both Iris Virginica and Iris Versicolor. The data set is approximated by the closest tree with some penalty for the excessive number of nodes bending and stretching. Then so-called "Metro Tree".
These are used in computational experiments.
 





comment 0 Comments
more_vert