Monotone Learning Datasets

The purpose of this dataset collection is to provide a test bed for monotone classifiers. The original data were collected from various resources, such as UCI and WEKA. A general preprocessing procedure was applied, taking the following aspects into consideration:

Instances with missing values are ignored.
Nominal values are transformed to numeric ones, according to the semantic.
Features are normalized between 0 and 1.
Datasets are formatted as .csv files with the class information in the last column.

We are consistently searching for real-world monotone learning datasets. If you have anything in mind, please do not hesitate to contact Weiwei Cheng.

Dataset	# Instances	# Features	# Classes
breast cancer	278	7	2
car evaluation	1728	6	4
car mpg	392	7	36
cpu	209	6	4
den bosch	119	8	2
employee rejection acceptance (ERA)	1000	4	4
employee selection (ESL)	488	4	9
journal ranking	172	5	4
lectures evaluation (LEV)	1000	4	5
mammographic	830	6	2
social workers decisions (SWD)	1000	10	4

Download (in .zip-format)

These datasets have been used in:

Ali Fallah Tehrani, Weiwei Cheng, Eyke Hüllermeier
Choquistic regression: generalizing logistic regression using the Choquet integral [pdf][bibtex][slides]
Proceedings of the 7th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT-2011): 868-875, Atlantis Press
Aix-les-Bains, France, July 2011 Nomination for Best Student Paper Award
Ali Fallah Tehrani, Weiwei Cheng, Eyke Hüllermeier
Preference learning using the Choquet integral: the case of multipartite ranking [pdf][bibtex]
IEEE Transactions on Fuzzy Systems, 2012.

Monotone Learning Datasets

Further information: