Shell.ai Hackathon 2023

Challenge Over

General Edition

2 years ago

General Edition Discussion

Replies (41)

If anyone has a great Cost_forecast but is struggling with the oprimization problem have a look at my optimization strategy:https://github.com/raubenheimer/Genetic-Opt-for-Shell-Hackathon-2023/tree/main

Regrettably, I wasn't able to reduce my Cost_forecast sufficiently to be competitive.

2 years ago

arolharlem

Hello everyone,

Please, I am looking to join some team If there is still place available in your groupe. my email: arolharlem@yahoo.fr. I am currentely at the end of the nanodegree AI programing with Python course.

2 years ago

Prabin Kumar

Doubt about finding the supply chain elements especially finding the Overall cost:

Based on the formulae for finding out the cost of transportation, we need the quantity of biomass, number of pellets between location and dist between the locations. But how to find the number of pellets?
And as we dont have True Biomass values for both 2018 and 2019 , how would we calculate the "Cost of biomass forecast mismatch"?

2 years ago

AlbertoBT

Any hint about how to reduce the dimensionality of the optimization problem? :)

2 years ago

Zephaniah Qamar

Coming in quite late but if anyone is interested in collabing email me at zeph.qamar4@gmail.com. I am a beginner with some experience in Python ML and a little in qgis.

2 years ago

Edith Cheruiyot

Hello. How can I also apply to the startup edition after applying to the general edition?

2 years ago

Aniket Kumar

Can someone help me with the datasets? I am unable to find it on the hackathon page,

2 years ago

Harshil Patel

Please follow the instructions given on the main page.

2 years ago

SOHAN MISHRA

Hi there everyone! I am ready to form a team. Do reach out through my LinkedIn: https://www.linkedin.com/in/sohan-mishra-17b010227/

2 years ago

Shubham Goyal

If your forecasted info is ready, here is the alto you nsider to find the cities you want to install J's at https://chat.openai.com/share/9fefae9c-8b1a-47b0-abfa-9e093fc0edce

2 years ago

Abhishek Kumar

Hello,

I'm Abhishek, having a background in Agriculture science(don't know if that'll help), familiar with basic concepts of ML & Python.

I'm a total beginner but highly motivated and want to participate and submit a relevant submission.

If anyone wants to collaborate and give it a try at least can contact me at my mail: abhi.kumar0799@gmail.com.

2 years ago

Aniket Kumar

Mail me at - akumar3_be19@thapar.edu

2 years ago

Yan Teixeira

Hello everybody,

I just entered this competition. If anyone wants to collaborate, please reach out to me

https://www.linkedin.com/in/yanteix/

2 years ago

Abhishek Kumar

I'm a total beginner, but highly motivated. If you want we can collaborate.PS. I have a bachelors degree in Agriculture.

2 years ago

Navoneel Chakrabarty

I am a Data Scientist at Eindhoven University of Technology, Eindhoven, The Netherlands. I have 2 years of work experience as a Professional Data Scientist along with a Masters in Machine Learning and AI from Liverpool John Moores University, UK. I am open for collaboration. Please share your email address and contact details at nc2012@cse.jgec.ac.in

2 years ago

arolharlem

Hi Sir, I am interested to collaborate and join your team. my mail is arolharlem@yahoo.fr

2 years ago

Navoneel Chakrabarty

I have some questions regarding the Datasets for the hackathon:

Firstly, In the distance matrix dataset, how can I understand which is a harvesting site, depot or biorefinery?

Secondly, What is the formula for calculating the demand-supply matrix of Biomass and Pellet? I have no clue how to combine the forecast of biomass availability and distance (supply-chain).

Thirdly, In the submission file, there will be 3 years: 2018, 2019 and 20182019. I do not understand the meaning of 20182019.

2 years ago

Harshil Patel

Please go through the detailed problem statement (provided on main page of the hackathon): https://apse1-uc.hackerearth.com/he-public-data/Detailed%20Problem%20Statementb5c7c96.pdf. You will get answers of all your queries in it.

2 years ago

Navoneel Chakrabarty

I have very well gone through the PDF containing the detailed problem statement and these are the questions I have post that. If these questions are not answered properly, I will choose to drop out of this hackathon.

2 years ago

Harshil Patel

Let me try to address your queries: Note that the objective is to i) forecast the biomass for year 2018 and 2019 at harvesting sites and then ii) using your forecasted values to place supply-chain elements like depots and refinaries.
There are total 2418 locations. Each location is a harvesting site. You are free to place depots and refinaries at any location given that you follow the constraints. Distance matrix (2418 x 2418) is the distance from any source location to any destination location.
Demand-supply matrix of Biomass and Pellet is the solution you are supposed to provide. It will depend on your forecasted biomass and placement of your supply chain elements (depots and refinaries).
So in summary, forecasted biomass alongwith Biomass and Pellet demand-supply matrices are year dependent. Whereas, supply-chain elements (depots and refineries) are invariant with year. Hence, a special key 20182019 is used for depots and refineries location in solution.

2 years ago

Ebuka Amadi-Obi

So to clarify, it's solely the 2019 score that will determine whether each team is shortlisted or not?

2 years ago

Harshil Patel

Yes. but you will only be able to see your 2019 score (on private leaderboard) after the competition is over. During the competition, the public leaderboard shows your score for year 2018.

2 years ago

Ebuka Amadi-Obi

got it, thanks!

2 years ago

Dakare Shrinath

Hello!! This is Shrinath Dakare. I work as an Operations Research Scientist at Optym. I'm proficient in OR concepts, C# and python. Also, have knowledge of basic ML. Looking for team members. Please reach out to https://www.linkedin.com/in/shrinath-dakare-iitkgp/.

2 years ago

Eduardo Andree

Hello Dakare, if you're still looking for teammates I'm available and looking for a team.

I am a chemical engineer with basic knowledge in ML and DL, and a solid background in math and statistics.

Please let me know if you're interested in forming a team. https://www.linkedin.com/in/eedx/

2 years ago

Hamad Aziz

I didn't find any dataset not even under instructions

2 years ago

Harshil Patel

Instructions to access Data Set :

Register to the hackathon
Create a team with team name(individual team is allowed)
Back to challenge page
Start now
"Data set " is seen under Instructions , click on it to download the ZIP file.

2 years ago

James Raubenheimer

Hi Massimiliano, I agree. The problem statement does not specify how the costs for both years will be aggregated. I.e. will the two costs just be summed, or will they be weighted differently? It would be nice to get some clarity about this.

2 years ago

Harshil Patel

Hi James, We have included following in the detailed problem statement. Sorry you coundn't find it. It's in the Notes section: a) Your solution will be eligible for ranking only if it satisfies all the constraints for 2018 and 2019. b) We will keep the first year (2018) of your solution for the public leaderboard. You can test your solution any time and see how it ranks.c) We will keep the second year (2019) of your solution for the private leaderboard and it willbe used to determine the finalists. So, cost (and hence leaderboard score) for year 2018 and 2019 will be calculated individually and used for public and private leaderboards, respectively.

2 years ago

James Raubenheimer

Thanks for the clarification Harshil

2 years ago

Folasayo Ogundipe

Hi Guys, I am in for collaboration guys. I currently do not have any team, but I hope to join or form a formidable one. I am completing my MSc in AI and Data Science at Keele University, UK. I am also starting a role as A Data Scientist with the Leeds Institute of Data Analytics, University of Leeds as a Data Scientist. I am available on LinkedIn at hhtp://linkedin.com/in/sayo2rule, on Twitter @sayo2rule or by email at sayo2rule@yahoo.com.

2 years ago

tejasva soni

hi, we can work together, https://www.linkedin.com/in/tejasva-soni-0a81121ab/

2 years ago

LUCAS DE

Hello. I am surching for a team too.

2 years ago

arolharlem

Hi, It's a pleasure if i can join your group. arolharlem@yahoo.fr

2 years ago

Massimiliano Porzio

"Optimized supply chain infrastructure proposed in your solution must be the same for bothyear 2018 and 2019" But forecast errors for 2018 would be different from 2019 forecast so we have to optimize for BOTH year simultaneously?

IMHO the problem is not so clearly described.

2 years ago

Harshil Patel

Hi Massimiliano, i) you are supposed to forecast the biomass for year 2018 and 2019 and then ii) using your forecasted values place supply-chain elements like depots and refinaries. Of couse, biomass forecast for year 2018 and 2019 will be different. However, the supply-chain elements locations are constant irrespective of the year. So, idea is to design a robust supply-chain for a rapildly changing biomass-forecast.

2 years ago

Massimiliano Porzio

Hello everyone, in the sample submission file there are no biomass_demand_supply,pellet_demand_supply . So what are those columns?

2 years ago

Nommie Kashani

Hi Massililiano, I think you mean you dont see them in the dataset (while they are expected in the sample file) .... I am not sure how we are expected to predict biomass_demand_supply & pellet_demand_supply if we were never provided with any figures....

2 years ago

Harshil Patel

You will need to optimize the supply-chain and estimate i) biomass_demand_supply: flow of biomass between harvesting sites and pre-processing depots ii) pellet_demand_supply: flow of pellets between depots and refineries. These quantities will be result of where you place the supply-chain elements (depots and refineries) and of couse will be different for each year based on the forecasted biomass of that year.

2 years ago

Shirish Potu

Hi Massimiliano and Kashani, adding on to what Harshil has mentioned above -

The sample submission file does contain values for biomass_demand_supply and pellet_demand_supply. I would suggest taking another look at all the entries in column B i.e. 'data_type'.
Kashani, Harshil's point directly answers your question. The prediction that you're challenged to do here is with estimating the amount of biomass that is available in the 2418 gridpoints for the years 2018 and 2019. You're provided with historical biomass values to help with this, and you may choose to incorporate other factors as you deem fit. Plotting the historical data as a scatter plot of biomass values corresponding to each gridpoint will help you with visualizing them as figures, if needed. The part about 'biomass_demand_supply' and 'pellet_demand_supply' is not prediction, but the flow of biomass that your supply chain estimates when following the constraints, as Harshil rightly describes it

Hope this helps!

2 years ago