Predict the match percentage

★★★★★

2 votes

Machine Learning

Details

Dataset description

The dataset folder contains a data.csv file that contains the following structure:

Column name	Description
user_id	Represents unique user IDs
username	Represents the name of a user
age	Represents the age of a user
status	Represents the relationship status of a user (Single, available, and so on)
sex	Represents the gender of a user
orientation	Represents the sexual orientation of a user (gay, bisexual, or straight)
drinks	Represents if a user likes to drink or not
drugs	Represents if a user consumes drugs or not
height	Represents the height of a user in inches
job	Represents the profession that a user
location	Represents where a user resides
pets	Represents if a user likes pets or not
smokes	Represents if a user smokes or not
language	Represents the languages spoken by a user
new_languages	Represents if a user is interested to learn a new language
body_profile	Represents the type of body a user has
education_level	Represents the educational level of a user
dropped_out	Represents if a user dropped out of school or college
bio	Represents a user's description
interests	Represents the interests of a user
other_interests	Represents other interests of a user
location_preference	Represents the preferred location to find a date

Submission file format

The submission file is required to be in a matrix format. For example, if the number of users in the dataset provided is \(1000\), then the submission.csv file must contain a matrix of size \(1000 \times 1000\).

You can refer to the 'sample submission.csv' file for the sample dataset provided in the dataset folder.

Note: Ensure that 'user_id' of the users is mentioned correctly in the submission.csv file.

Evaluation metric

The evaluation metric that is used is the root mean square error metric. The score is calculated using the following:

\(score = max(0, 100 - root\_mean\_squared\_error(actual, predicted))\)

Download dataset

Time Limit: 5

Memory Limit: 256

Source Limit:

Contributers:

Syed