Welcome to the UC Irvine Machine Learning Repository: We currently maintain 284 data sets as a service to the machine learning community

We currently maintain 284 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. Our old web site is still available, for those who prefer the old format. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians. We have also set up a mirror site for the Repository.

Default Task

Classification (202)Regression (36)Clustering (31)Other (50)

Attribute Type

Categorical (36)Numerical (152)Mixed (56)

Data Type

Multivariate (217)Univariate (15)Sequential (24)Time-Series (40)Text (26)Domain-Theory (20)Other (21)


Life Sciences (73)Physical Sciences (40)CS / Engineering (73)Social Sciences (18)Business (14)Game (9)Other (55)

# Attributes

Less than 10 (69)10 to 100 (125)Greater than 100 (42)

# Instances

Less than 100 (13)100 to 1000 (109)Greater than 1000(132)

Format Type

Matrix (202)Non-Matrix (82)

