Deep Learning & Parameter Tuning with MXnet, H2o Package in R

from the UC Irivine ML repository. Let's start with H2O. This data set isn't the most ideal one to work with in neural networks. However, the motive of this hands-on section is to make you familiar with model-building processes.

H2O Package

H2O package provides h2o.deeplearning function for model building. It is built on Java. Primarily, this function is useful to build multilayer feedforward neural networks. It is enabled with several features such as the following:

Multi-threaded distributed parallel computation
Adaptive learning rate (or step size) for faster convergence
Regularization options such as L1 and L2 which help prevent overfitting
Automatic missing value imputation
Hyperparameter optimization using grid/random search

There are many more!For optimization, this package uses the hogwild method instead of stochastic gradient descent. Hogwild is just parallelized version of SGD.Let's understand the parameters involved in model building with h2o. Both the packages have different nomenclatures, so make sure you don't get confused. Since most of the parameters are easy to understand by their names, I'll mention the important ones:

hidden - It specifies the number of hidden layers and number of neurons in each layer in the architechture.
epochs - It specifies the number of iterations to be done on the data set.
rate - It specifies the learning rate.
activation - It specifies the type of activation function to use. In h2o, the major activation functions are Tanh, Rectifier, and Maxout.

Let's quickly load the data and get over with sanitary data pre-processing steps:

	path = "~/mydata/deeplearning"
	setwd(path)

	#load libraries
	library(data.table)
	library(mlr)
	#set variable names
	setcol <- c("age",
	"workclass",
	"fnlwgt",
	"education",
	"education-num",
	"marital-status",
	"occupation",
	"relationship",
	"race",
	"sex",
	"capital-gain",
	"capital-loss",
	"hours-per-week",
	"native-country",
	"target")

	#load data
	train <- read.table("adultdata.txt",header = F,sep = ",",col.names = setcol,na.strings = c(" ?"),stringsAsFactors = F)
	test <- read.table("adulttest.txt",header = F,sep = ",",col.names = setcol,skip = 1, na.strings = c(" ?"),stringsAsFactors = F)

	setDT(train)
	setDT(test)

	#Data Sanity
	dim(train) #32561 X 15
	dim(test) #16281 X 15
	str(train)
	str(test)

	#check missing values
	table(is.na(train))
	sapply(train, function(x) sum(is.na(x))/length(x))*100
	table(is.na(test))
	sapply(test, function(x) sum(is.na(x))/length(x))*100

	#check target variable
	#binary in nature check if data is imbalanced
	train[,.N/nrow(train),target]
	test[,.N/nrow(test),target]

	#remove extra characters
	test[,target := substr(target,start = 1,stop = nchar(target)-1)]

	#remove leading whitespace
	library(stringr)
	char_col <- colnames(train)[sapply(test,is.character)]

	for(i in char_col)
	set(train,j=i,value = str_trim(train[[i]],side = "left"))

	#set all character variables as factor
	fact_col <- colnames(train)[sapply(train,is.character)]

	for(i in fact_col)
	set(train,j=i,value = factor(train[[i]]))

	for(i in fact_col)
	set(test,j=i,value = factor(test[[i]]))
	#impute missing values
	imp1 <- impute(data = train,target = "target",classes = list(integer = imputeMedian(), factor = imputeMode()))
	imp2 <- impute(data = test,target = "target",classes = list(integer = imputeMedian(), factor = imputeMode()))

	train <- setDT(imp1$data)
	test <- setDT(imp2$data)

view raw DL1_H2o.R hosted with ❤ by GitHub

Now, let's build a simple deep learning model. Generally, computing variable importance from a trained deep learning model is quite pain staking. But, h2o package provides an effortless function to compute variable importance from a deep learning model.

	#load the package
	require(h2o)

	#start h2o
	localH2o <- h2o.init(nthreads = -1, max_mem_size = "20G")

	#load data on H2o
	trainh2o <- as.h2o(train)
	testh2o <- as.h2o(test)

	#set variables
	y <- "target"
	x <- setdiff(colnames(trainh2o),y)

	#train the model - without hidden layer
	deepmodel <- h2o.deeplearning(x = x
	,y = y
	,training_frame = trainh2o
	,standardize = T
	,model_id = "deep_model"
	,activation = "Rectifier"
	,epochs = 100
	,seed = 1
	,nfolds = 5
	,variable_importances = T)

	#compute variable importance and performance
	h2o.varimp_plot(deepmodel,num_of_features = 20)
	h2o.performance(deepmodel,xval = T) #84.5 % CV accuracy

view raw DL2.H2o.R hosted with ❤ by GitHub

Now, let's train a deep learning model with one hidden layer comprising five neurons. This time instead of checking the cross-validation accuracy, we'll validate the model on test data.

	deepmodel <- h2o.deeplearning(x = x
	,y = y
	,training_frame = trainh2o
	,validation_frame = testh2o
	,standardize = T
	,model_id = "deep_model"
	,activation = "Rectifier"
	,epochs = 100
	,seed = 1
	,hidden = 5
	,variable_importances = T)
	h2o.performance(deepmodel,valid = T) #85.6%

view raw DL3_H2o.R hosted with ❤ by GitHub

For hyperparameter tuning, we'll perform a random grid search over all parameters and choose the model which returns highest accuracy.

	#set parameter space
	activation_opt <- c("Rectifier","RectifierWithDropout", "Maxout","MaxoutWithDropout")
	hidden_opt <- list(c(10,10),c(20,15),c(50,50,50))
	l1_opt <- c(0,1e-3,1e-5)
	l2_opt <- c(0,1e-3,1e-5)

	hyper_params <- list( activation=activation_opt,
	hidden=hidden_opt,
	l1=l1_opt,
	l2=l2_opt )

	#set search criteria
	search_criteria <- list(strategy = "RandomDiscrete", max_models=10)

	#train model
	dl_grid <- h2o.grid("deeplearning"
	,grid_id = "deep_learn"
	,hyper_params = hyper_params
	,search_criteria = search_criteria
	,training_frame = trainh2o
	,x=x
	,y=y
	,nfolds = 5
	,epochs = 100)

	#get best model
	d_grid <- h2o.getGrid("deep_learn",sort_by = "accuracy")
	best_dl_model <- h2o.getModel(d_grid@model_ids[[1]])
	h2o.performance (best_dl_model,xval = T) #CV Accuracy - 84.7%

view raw DL4.H2o.R hosted with ❤ by GitHub

MXNetR Package

The mxnet package provides an incredible interface to build feedforward NN, recurrent NN and convolutional neural networks (CNNs). CNNs are being widely used in detecting objects from images. The team that created xgboost also created this package. Currently, mxnet is being popularly used in kaggle competitions for image classification problems.

This package can be easily connected with GPUs as well. The process of building model architecture is quite intuitive. It gives greater control to configure the neural network manually.

Let's get some hands-on experience using this package.Follow the commands below to install this package in your respective OS. For Windows and Linux users, installation commands are given below. For Mac users, here's the installation procedure.

	# # Installation - Windows
	install.packages("drat", repos="https://cran.rstudio.com")
	drat:::addRepo("dmlc")
	install.packages("mxnet")
	library(mxnet)

	#Installation - Linux
	#Press Ctrl + Alt + T and run the following command
	sudo apt-get update
	sudo apt-get -y install git
	git clone https://github.com/dmlc/mxnet.git ~/mxnet --recursive
	cd ~/mxnet/setup-utils
	bash install-mxnet-ubuntu-r.sh

view raw mxnet1.R hosted with ❤ by GitHub

In R, mxnet accepts target variables as numeric classes and not factors. Also, it accepts data frame as a matrix. Now, we'll make the required changes:

	#load package
	require(mxnet)

	#convert target variables into numeric
	train[,target := as.numeric(target)-1]
	test[,target := as.numeric(target)-1]

	#convert train data to matrix
	train.x <- data.matrix(train[,-c("target"),with=F])
	train.y <- train$target

	#convert test data to matrix
	test.x <- data.matrix(test[,-c("target"),with=F])
	test.y <- test$target

view raw mxnet2.R hosted with ❤ by GitHub

Now, we'll train the multilayered perceptron model using the mx.mlp function.

	#set seed to reproduce results
	mx.set.seed(1)

	mlpmodel <- mx.mlp(data = train.x
	,label = train.y
	,hidden_node = 3 #one layer with 10 nodes
	,out_node = 2
	,out_activation = "softmax" #softmax return probability
	,num.round = 100 #number of iterations over training data
	,array.batch.size = 20 #after every batch weights will get updated
	,learning.rate = 0.03 #same as step size
	,eval.metric= mx.metric.accuracy
	,eval.data = list(data = test.x, label = test.y))

view raw mxnet3.R hosted with ❤ by GitHub

Softmax function is used for binary and multi-classification problems. Alternatively, you can also manually craft the model structure.

	#create NN structure
	data <- mx.symbol.Variable("data")
	fc1 <- mx.symbol.FullyConnected(data, num_hidden=3) #3 neuron in one layer
	lrm <- mx.symbol.SoftmaxOutput(fc1)

view raw mxnet4.R hosted with ❤ by GitHub

We have configured the network above with one hidden layer carrying three neurons. We have chosen softmax as the output function. The network optimizes for squared loss for regression, and the network optimizes for classification accuracy for classification. Now, we'll train the network:

	nnmodel <- mx.model.FeedForward.create(symbol = lrm
	,X = train.x
	,y = train.y
	,ctx = mx.cpu()
	,num.round = 100
	,eval.metric = mx.metric.accuracy
	,array.batch.size = 50
	,learning.rate = 0.01)

view raw mxnet5.R hosted with ❤ by GitHub

Similarly, we can configure a more complexed network fed with hidden layers.

	#configure another network
	data <- mx.symbol.Variable("data")
	fc1 <- mx.symbol.FullyConnected(data, name = "fc1", num_hidden=10) #1st hidden layer
	act1 <- mx.symbol.Activation(fc1, name = "sig", act_type="relu")
	fc2 <- mx.symbol.FullyConnected(act1, name = "fc2", num_hidden=2) #2nd hidden layer
	out <- mx.symbol.SoftmaxOutput(fc2, name = "soft")

view raw mxnet6.R hosted with ❤ by GitHub

Understand it carefully: After feeding the input through data, the first hidden layer consists of 10 neurons. The output of each neuron passes through a relu (rectified linear) activation function. We have used it in place of sigmoid. relu converges faster than a sigmoid function. You can read more about relu here.

Then, the output is fed into the second layer which is the output layer. Since our target variable has two classes, we've chosen num_hidden as 2 in the second layer. Finally, the output from second layer is made to pass though softmax output function.

	#train the network
	dp_model <- mx.model.FeedForward.create(symbol = out
	,X = train.x
	,y = train.y
	,ctx = mx.cpu()
	,num.round = 100
	,eval.metric = mx.metric.accuracy
	,array.batch.size = 50
	,learning.rate = 0.005)

view raw mxnet7.R hosted with ❤ by GitHub

As mentioned above, this trained model predicts output probability, which can be easily transformed into a label using a threshold value (say, 0.5). To make predictions on the test set, we do this:

	#predict on test
	pred_dp <- predict(dp_model,test.x)
	str(pred_dp) #contains 2 rows and 16281 columns

	#transpose the pred matrix
	pred.val <- max.col(t(pred_dp))-1

view raw mxnet8.R hosted with ❤ by GitHub

The predicted matrix returns two rows and 16281 columns, each column carrying probability. Using the max.col function, we can extract the maximum value from each row. If you check the model's accuracy, you'll find that this network performs terribly on this data. In fact, it gives no better result than the train accuracy! On this data set, xgboost tuning gave 87% accuracy!

If you are familiar with the model building process, I'd suggest you to try working on the popular MNIST data set. You can find tons of tutorials on this data to get you going!

Summary

Deep Learning is getting increasingly popular in solving most complex problems such as image recognition, natural language processing, etc. If you are aspiring for a career in machine learning, this is the best time for you to get into this subject. The motive of this article was to introduce you to the fundamental concepts of deep learning.In this article, we learned about the basics of deep learning (perceptrons, neural networks, and multilayered neural networks). We learned deep learning as a technique is composed of several algorithms such as backpropagration and gradient descent to optimize the networks. In the end, we gained some hands-on experience in developing deep learning models.Do let me know if you have any feedback, suggestions, or thoughts on this article in the comments below!

Revolutionizing Mobile Talent Hiring: The HackerEarth Advantage

The demand for mobile applications is exploding, but finding and verifying developers with proven, real-world skills is more difficult than ever. Traditional assessment methods often fall short, failing to replicate the complexities of modern mobile development.

Introducing a New Era in Mobile Assessment

At HackerEarth, we're closing this critical gap with two groundbreaking features, seamlessly integrated into our Full Stack IDE:

Now, assess mobile developers in their true native environment. Our enhanced Full Stack questions now offer full support for both Java and Kotlin, the core languages powering the Android ecosystem. This allows you to evaluate candidates on authentic, real-world app development skills, moving beyond theoretical knowledge to practical application.

Say goodbye to setup drama and tool-switching. Candidates can now build, test, and debug Android and React Native applications directly within the browser-based IDE. This seamless, in-browser experience provides a true-to-life evaluation, saving valuable time for both candidates and your hiring team.

Assess the Skills That Truly Matter

With native Android support, your assessments can now delve into a candidate's ability to write clean, efficient, and functional code in the languages professional developers use daily. Kotlin's rapid adoption makes proficiency in it a key indicator of a forward-thinking candidate ready for modern mobile development.

Breakup of Mobile development skills ~95% of mobile app dev happens through Java and Kotlin — This chart illustrates the importance of assessing proficiency in both modern (Kotlin) and established (Java) codebases.

Streamlining Your Assessment Workflow

The integrated mobile emulator fundamentally transforms the assessment process. By eliminating the friction of fragmented toolchains and complex local setups, we enable a faster, more effective evaluation and a superior candidate experience.

Old Fragmented Way vs. The New, Integrated Way — Visualize the stark difference: Our streamlined workflow removes technical hurdles, allowing candidates to focus purely on demonstrating their coding and problem-solving abilities.

Quantifiable Impact on Hiring Success

A seamless and authentic assessment environment isn't just a convenience, it's a powerful catalyst for efficiency and better hiring outcomes. By removing technical barriers, candidates can focus entirely on demonstrating their skills, leading to faster submissions and higher-quality signals for your recruiters and hiring managers.

A Better Experience for Everyone

Our new features are meticulously designed to benefit the entire hiring ecosystem:

For Recruiters & Hiring Managers:

Accurately assess real-world development skills.
Gain deeper insights into candidate proficiency.
Hire with greater confidence and speed.
Reduce candidate drop-off from technical friction.

For Candidates:

Enjoy a seamless, efficient assessment experience.
No need to switch between different tools or manage complex setups.
Focus purely on showcasing skills, not environment configurations.
Work in a powerful, professional-grade IDE.

Unlock a New Era of Mobile Talent Assessment

Stop guessing and start hiring the best mobile developers with confidence. Explore how HackerEarth can transform your tech recruiting.

A New Era of Code

Vibe coding is a new method of using natural language prompts and AI tools to generate code. I have seen firsthand that this change makes software more accessible to everyone. In the past, being able to produce functional code was a strong advantage for developers. Today, when code is produced quickly through AI, the true value lies in designing, refining, and optimizing systems. Our role now goes beyond writing code; we must also ensure that our systems remain efficient and reliable.

From Machine Language to Natural Language

I recall the early days when every line of code was written manually. We progressed from machine language to high-level programming, and now we are beginning to interact with our tools using natural language. This development does not only increase speed but also changes how we approach problem solving. Product managers can now create working demos in hours instead of weeks, and founders have a clearer way of pitching their ideas with functional prototypes. It is important for us to rethink our role as developers and focus on architecture and system design rather than simply on typing c

The Promise and the Pitfalls

I have experienced both sides of vibe coding. In cases where the goal was to build a quick prototype or a simple internal tool, AI-generated code provided impressive results. Teams have been able to test new ideas and validate concepts much faster. However, when it comes to more complex systems that require careful planning and attention to detail, the output from AI can be problematic. I have seen situations where AI produces large volumes of code that become difficult to manage without significant human intervention.

AI-powered coding tools like GitHub Copilot and AWS’s Q Developer have demonstrated significant productivity gains. For instance, at the National Australia Bank, it’s reported that half of the production code is generated by Q Developer, allowing developers to focus on higher-level problem-solving . Similarly, platforms like Lovable enable non-coders to build viable tech businesses using natural language prompts, contributing to a shift where AI-generated code reduces the need for large engineering teams. However, there are challenges. AI-generated code can sometimes be verbose or lack the architectural discipline required for complex systems. While AI can rapidly produce prototypes or simple utilities, building large-scale systems still necessitates experienced engineers to refine and optimize the code.

The Economic Impact

The democratization of code generation is altering the economic landscape of software development. As AI tools become more prevalent, the value of average coding skills may diminish, potentially affecting salaries for entry-level positions. Conversely, developers who excel in system design, architecture, and optimization are likely to see increased demand and compensation.
Seizing the Opportunity

Vibe coding is most beneficial in areas such as rapid prototyping and building simple applications or internal tools. It frees up valuable time that we can then invest in higher-level tasks such as system architecture, security, and user experience. When used in the right context, AI becomes a helpful partner that accelerates the development process without replacing the need for skilled engineers.

This is revolutionizing our craft, much like the shift from machine language to assembly to high-level languages did in the past. AI can churn out code at lightning speed, but remember, “Any fool can write code that a computer can understand. Good programmers write code that humans can understand.” Use AI for rapid prototyping, but it’s your expertise that transforms raw output into robust, scalable software. By honing our skills in design and architecture, we ensure our work remains impactful and enduring. Let’s continue to learn, adapt, and build software that stands the test of time.

Ready to streamline your recruitment process? Get a free demo to explore cutting-edge solutions and resources for your hiring needs.

What is Systems Design?

Systems Design is an all encompassing term which encapsulates both frontend and backend components harmonized to define the overall architecture of a product.

Designing robust and scalable systems requires a deep understanding of application, architecture and their underlying components like networks, data, interfaces and modules.

Systems Design, in its essence, is a blueprint of how software and applications should work to meet specific goals. The multi-dimensional nature of this discipline makes it open-ended – as there is no single one-size-fits-all solution to a system design problem.

What is a System Design Interview?

Conducting a System Design interview requires recruiters to take an unconventional approach and look beyond right or wrong answers. Recruiters should aim for evaluating a candidate’s ‘systemic thinking’ skills across three key aspects:

How they navigate technical complexity and navigate uncertainty
How they meet expectations of scale, security and speed
How they focus on the bigger picture without losing sight of details

This assessment of the end-to-end thought process and a holistic approach to problem-solving is what the interview should focus on.

What are some common topics for a System Design Interview

System design interview questions are free-form and exploratory in nature where there is no right or best answer to a specific problem statement. Here are some common questions:

How would you approach the design of a social media app or video app?

What are some ways to design a search engine or a ticketing system?

How would you design an API for a payment gateway?

What are some trade-offs and constraints you will consider while designing systems?

What is your rationale for taking a particular approach to problem solving?

Usually, interviewers base the questions depending on the organization, its goals, key competitors and a candidate’s experience level.

For senior roles, the questions tend to focus on assessing the computational thinking, decision making and reasoning ability of a candidate. For entry level job interviews, the questions are designed to test the hard skills required for building a system architecture.

The Difference between a System Design Interview and a Coding Interview

If a coding interview is like a map that takes you from point A to Z – a systems design interview is like a compass which gives you a sense of the right direction.

Here are three key difference between the two:

Coding challenges follow a linear interviewing experience i.e. candidates are given a problem and interaction with recruiters is limited. System design interviews are more lateral and conversational, requiring active participation from interviewers.

Coding interviews or challenges focus on evaluating the technical acumen of a candidate whereas systems design interviews are oriented to assess problem solving and interpersonal skills.

Coding interviews are based on a right/wrong approach with ideal answers to problem statements while a systems design interview focuses on assessing the thought process and the ability to reason from first principles.

How to Conduct an Effective System Design Interview

One common mistake recruiters make is that they approach a system design interview with the expectations and preparation of a typical coding interview.
Here is a four step framework technical recruiters can follow to ensure a seamless and productive interview experience:

Step 1: Understand the subject at hand

Develop an understanding of basics of system design and architecture
Familiarize yourself with commonly asked systems design interview questions
Read about system design case studies for popular applications
Structure the questions and problems by increasing magnitude of difficulty

Step 2: Prepare for the interview

Plan the extent of the topics and scope of discussion in advance
Clearly define the evaluation criteria and communicate expectations
Quantify constraints, inputs, boundaries and assumptions
Establish the broader context and a detailed scope of the exercise

Step 3: Stay actively involved

Ask follow-up questions to challenge a solution
Probe candidates to gauge real-time logical reasoning skills
Make it a conversation and take notes of important pointers and outcomes
Guide candidates with hints and suggestions to steer them in the right direction

Step 4: Be a collaborator

Encourage candidates to explore and consider alternative solutions
Work with the candidate to drill the problem into smaller tasks
Provide context and supporting details to help candidates stay on track
Ask follow-up questions to learn about the candidate’s experience

Technical recruiters and hiring managers should aim for providing an environment of positive reinforcement, actionable feedback and encouragement to candidates.

Evaluation Rubric for Candidates

Facilitate Successful System Design Interview Experiences with FaceCode

FaceCode, HackerEarth’s intuitive and secure platform, empowers recruiters to conduct system design interviews in a live coding environment with HD video chat.

FaceCode comes with an interactive diagram board which makes it easier for interviewers to assess the design thinking skills and conduct communication assessments using a built-in library of diagram based questions.

With FaceCode, you can combine your feedback points with AI-powered insights to generate accurate, data-driven assessment reports in a breeze. Plus, you can access interview recordings and transcripts anytime to recall and trace back the interview experience.

Learn how FaceCode can help you conduct system design interviews and boost your hiring efficiency.

Deep Learning & Parameter Tuning with MXnet, H2o Package in R

H2O Package

MXNetR Package

Summary

Thank you for subscribing!

Hire top tech talent with our recruitment platform

Discover more articles

The Mobile Dev Hiring Landscape Just Changed

Revolutionizing Mobile Talent Hiring: The HackerEarth Advantage

Introducing a New Era in Mobile Assessment

Assess the Skills That Truly Matter

Streamlining Your Assessment Workflow

Quantifiable Impact on Hiring Success

A Better Experience for Everyone

Unlock a New Era of Mobile Talent Assessment

Vibe Coding: Shaping the Future of Software

A New Era of Code

From Machine Language to Natural Language

The Promise and the Pitfalls

The Economic Impact

Guide to Conducting Successful System Design Interviews in 2025

What is Systems Design?

What is a System Design Interview?

What are some common topics for a System Design Interview

The Difference between a System Design Interview and a Coding Interview

How to Conduct an Effective System Design Interview

Step 1: Understand the subject at hand

Step 2: Prepare for the interview

Step 3: Stay actively involved

Step 4: Be a collaborator

Evaluation Rubric for Candidates

Facilitate Successful System Design Interview Experiences with FaceCode

Explore HackerEarth’s top products for Hiring & Innovation

Deep Learning & Parameter Tuning with MXnet, H2o Package in R

H2O Package

MXNetR Package

Summary

Subscribe to The HackerEarth Blog

Thank you for subscribing!

Hire top tech talent with our recruitment platform

Discover more articles

The Mobile Dev Hiring Landscape Just Changed

Revolutionizing Mobile Talent Hiring: The HackerEarth Advantage

Introducing a New Era in Mobile Assessment

Assess the Skills That Truly Matter

Streamlining Your Assessment Workflow

Quantifiable Impact on Hiring Success

A Better Experience for Everyone

Unlock a New Era of Mobile Talent Assessment

Vibe Coding: Shaping the Future of Software

A New Era of Code

From Machine Language to Natural Language

The Promise and the Pitfalls

The Economic Impact

Guide to Conducting Successful System Design Interviews in 2025

What is Systems Design?

What is a System Design Interview?

What are some common topics for a System Design Interview

The Difference between a System Design Interview and a Coding Interview

How to Conduct an Effective System Design Interview

Step 1: Understand the subject at hand

Step 2: Prepare for the interview

Step 3: Stay actively involved

Step 4: Be a collaborator

Evaluation Rubric for Candidates

Facilitate Successful System Design Interview Experiences with FaceCode

Explore HackerEarth’s top products for Hiring & Innovation