On The Plague Trail

2.7

3 votes
Medium
Problem

PROBLEM STATEMENT

Predict the total number of people infected by the 7 different pathogens.

Plague is an epidemic event caused by Bacteria. A group of senior scientists misplaced a package containing fatal plague bacteria during one of their trips. With no means of tracking where the package is, scientists are now trying to come up with a solution to stop the plague. This plague has 7 different strains that are unique for each continent. This strain is expanding rapidly in each continent.

The dataset contains escalations of the plague for all the seven strains. The dataset is a time series in which the training set contains the number of individuals that are infected by the plague over a defined period of time.

Your mission, should you choose to accept it, is to defend the world against this plague by building an algorithm that can minimize the damage.

DATA DESCRIPTION

You can find the dataset here.

You have to predict the columns of PA, PB, PC, PD, PE, PF, PG and please submit in the format given in the “sample.csv” file.
COLUMN DESCRIPTION

Sl. No

Column Label

Column Description

1

ID

A calculated unique ID for each research.

2

DateTime

Represents the data and time on which the event is recorded

3

TempOut

Outside Temperature

4

HiTemp

Highest Temperature

5

LowTemp

Lowest Temperature

6

OutHum

Outside Humidity

7

DewPt

Dew Point

8

WindSpeed

Wind Speed

9

WindDir

Wind Direction

10

WindRun

Wind Run Flow

11

HiSpeed

Highest Speed of the wind

12

HiDir

Direction of the wind which has highest speed

13

WindChill

Chillness of the wind

14

HeatIndex

Heat Index

15

THWIndex

THW Index

16

Bar

Barometer Reading

17

Rain

Rain

18

RainRate

Frequency of Rain

19

HeatDD

Heat DD

20

CoolDD

Cool DD

21

InTemp

Temperature Inside

22

InHum

Humidity Inside

23

InDew

Dew Inside

24

InHeat

Heat Inside

25

InEMC

EMC Inside

26

InAirDensity

Air Density

27

WindSamp

Wind - Attribute 1

28

WindTx

Wind - Attribute 2

29

ISSRecpt

Reception

30

ArcInt

Attribute

31

PA

Total No of People infected by Pathogen A

32

PB

Total No of People infected by Pathogen B

33

PC

Total No of People infected by Pathogen C

34

PD

Total No of People infected by Pathogen D

35

PE

Total No of People infected by Pathogen E

36

PF

Total No of People infected by Pathogen F

37

PG

Total No of People infected by Pathogen G

EVALUATION CRITERIA

Your submissions wil be evaluated on Root Mean Squared Error (RMSE). Leaderboard score is calculated as:

leaderboard score=max(0,(100rmse))

Time Limit: 5
Memory Limit: 256
Source Limit:
Editor Image

?