Building predictive models from banking and financial data

Societe Generale generates massive amounts of banking and financial data every day. They wanted to put this data to better use by leveraging the power of crowd-sourcing for data analysis and building predictive models. Having conducted physical hackathons for the past couple of years, Societe Generale wanted to scale its flagship event—Brainwaves—and reach a wider audience with a Machine Learning theme.

enter image description here

What’s their story?

Societe Generale Global Solutions Centre (SG GSC) is a subsidiary of Societe Generale—the French multinational banking and financial services company. SG GSC focuses on long-term vision and developing global best practices to promote the strategic objectives of the SG group. They provide services in the areas of Application Development, Infrastructure Management, Business Process Management, and Knowledge Process Management, to Societe Generale's business lines around the world.

enter image description here

Machine Learning Hackathon as a solution

The hackathon was conducted on HackerEarth’s Data Science platform. Our platform allowed Societe Generale to:

  • Create a customized Machine Learning (ML) challenge by using their data
  • Manage and validate user submission efficiently

enter image description here Our ML platform was equipped with customized auto-evaluation mechanism. It allows you to put out data-sets to public. The data is divided into two sets - training data set and test data set.

A training dataset is the data on which users train their models. After the models are trained, users are expected to predict on the test data set and submit their predictions.

After users submit their prediction files, the models are evaluated on 50% of the test data and scores are awarded to the participants in real-time.

Once the contest is over, models are evaluated on the remaining 50% test data set as well to award the final score to the submissions.

Note: The reason for evaluating only 50% of the data set during the online phase is to discourage overfitting by the users.

Read more about our evaluation mechanism for Machine Learning platforms, here.

enter image description here The event received an overwhelming response.

  • More than 50% of the participants were experience professionals
  • Students from the top 10 premier engineering institutes of India participated in the event

enter image description here

enter image description here

Innovate and build a better business using HackerEarth Sprint

Try HackerEarth Sprint
Global Hackathon Report
Insights & Trends - Download Now