Computer Science

126 Datasets

Datasets


KDD Cup 1999 Data

Please see task description.

classification, multivariate

MSNBC.com Anonymous Web Data

The data comes from Internet Information Server (IIS) logs for msnbc.com and news-related portions of msn.com for the entire day of September, 28, 1999 (P…

sequential

Pioneer-1 Mobile Robot Data

The data were collected over a series of specifically designed trials. Our hope was to cover most of the types of sensory interactions that a Pioneer migh…

multivariate, time-series

Syskill and Webert Web Page R…

The HTML source of a web page is given. Users looked at each web page and inidated on a 3 point scale (hot medium cold) 50-100 pages per domain. However, …

classification, multivariate, text

UNIX User Data

This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years (USE…

sequential, text

banknote authentication

Data were extracted from images that were taken from genuine and forged banknote-like specimens. For digitization, an industrial camera usually used for …

classification, multivariate

Gas Sensor Array Drift Datase…

This data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension of …

causa, classification, clustering, multivariate, regression, time-series

UJI Pen Characters

We create a character database by collecting samples from 11 writers. Each writer contributed with letters (lower and uppercase), digits, and other chara…

classification, multivariate, sequential

OpinRank Review Dataset

Car Reviews ------------ -Full reviews of cars for model-years 2007, 2008, and 2009 -There are about 140-250 cars for each model year -Extracted fields in…

text

Gisette

The digits have been size-normalized and centered in a fixed-size image of dimension 28x28. The original data were modified for the purpose of the feature…

classification, multivariate