Social Sciences

23 Datasets

Datasets


US Census Data (1990)

The data was collected as part of the 1990 census. There are 68 categorical attributes. This data set was derived from the USCensus1990raw data set. The…

clustering, multivariate

Census-Income (KDD)

This data set contains weighted census data extracted from the 1994 and 1995 Current Population Surveys conducted by the U.S. Census Bureau. The data cont…

classification, multivariate

Insurance Company Benchmark (…

Information about customers consists of 86 variables and includes product usage data and socio-demographic data derived from zip area codes. The data was …

description, multivariate, regression

IPUMS Census Database

The original source for this data set is the IPUMS project (RugglesSobek, 1997). The IPUMS project is a large collection of federal census data which has …

multivariate

Communities and Crime

Many variables are included so that algorithms that select or learn weights for attributes could be tested. However, clearly unrelated attributes were n…

multivariate, regression

Communities and Crime Unnorma…

The source datasets needed to be combined via programming. Many variables are included so that algorithms that select or learn weights for attributes coul…

multivariate, regression

NYSK

Documents are first obtained via a Web search using AMIEI: an integrated platform for delivering enterprise intelligence, developed by AMI Software ([Web …

clustering, multivariate, sequential, text

Bike Sharing Dataset

Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Thro…

regression, univariate

Twitter Data set for Arabic S…

--- By using a tweet crawler, we collect 2000 labelled tweets (1000 positive tweets and 1000 negative ones) on various topics such as: politics an…

classification, text

BlogFeedback

This data originates from blog posts. The raw HTML-documents of the blog posts were crawled and processed. The prediction task associated with the data …

multivariate, regression