Predict the total number of people infected by the 7 different pathogens.
Plague is an epidemic event caused by Bacteria. A group of senior scientists misplaced a package containing fatal plague bacteria during one of their trips. With no means of tracking where the package is, scientists are now trying to come up with a solution to stop the plague. This plague has 7 different strains that are unique for each continent. This strain is expanding rapidly in each continent.
The dataset contains escalations of the plague for all the seven strains. The dataset is a time series in which the training set contains the number of individuals that are infected by the plague over a defined period of time.
Your mission, should you choose to accept it, is to defend the world against this plague by building an algorithm that can minimize the damage.
You can find the dataset here.
You have to predict the columns of PA, PB, PC, PD, PE, PF, PG and please submit in the format given in the “sample.csv” file.
COLUMN DESCRIPTION
Sl. No |
Column Label |
Column Description |
1 |
ID |
A calculated unique ID for each research. |
2 |
DateTime |
Represents the data and time on which the event is recorded |
3 |
TempOut |
Outside Temperature |
4 |
HiTemp |
Highest Temperature |
5 |
LowTemp |
Lowest Temperature |
6 |
OutHum |
Outside Humidity |
7 |
DewPt |
Dew Point |
8 |
WindSpeed |
Wind Speed |
9 |
WindDir |
Wind Direction |
10 |
WindRun |
Wind Run Flow |
11 |
HiSpeed |
Highest Speed of the wind |
12 |
HiDir |
Direction of the wind which has highest speed |
13 |
WindChill |
Chillness of the wind |
14 |
HeatIndex |
Heat Index |
15 |
THWIndex |
THW Index |
16 |
Bar |
Barometer Reading |
17 |
Rain |
Rain |
18 |
RainRate |
Frequency of Rain |
19 |
HeatDD |
Heat DD |
20 |
CoolDD |
Cool DD |
21 |
InTemp |
Temperature Inside |
22 |
InHum |
Humidity Inside |
23 |
InDew |
Dew Inside |
24 |
InHeat |
Heat Inside |
25 |
InEMC |
EMC Inside |
26 |
InAirDensity |
Air Density |
27 |
WindSamp |
Wind - Attribute 1 |
28 |
WindTx |
Wind - Attribute 2 |
29 |
ISSRecpt |
Reception |
30 |
ArcInt |
Attribute |
31 |
PA |
Total No of People infected by Pathogen A |
32 |
PB |
Total No of People infected by Pathogen B |
33 |
PC |
Total No of People infected by Pathogen C |
34 |
PD |
Total No of People infected by Pathogen D |
35 |
PE |
Total No of People infected by Pathogen E |
36 |
PF |
Total No of People infected by Pathogen F |
37 |
PG |
Total No of People infected by Pathogen G |
Your submissions wil be evaluated on Root Mean Squared Error (RMSE). Leaderboard score is calculated as:
leaderboard score=max(0,(100−rmse))