Scala Programming for Data Science
in collaboration with IBM

Apply Now
Industry Experts From

Universally Recognized Certificates

From IBM and DataTrained

Capstone and Real Life Projects

Access to 15 real life projects and a capstone project

Analytics Jobs Placement Assistance

Access to analyticsjobs.in curated jobs

Access to in-demand Tools

IBM Watson labs and $1200 equivalent Cloud Credits

₹19,999 ($320)

for self-paced

About the Program

  • Scala Fundamentals

  • Spark Overview for Scala Analytics

  • Data Science for Scala

Download Syllabus

Prerequisite

Data Scientists are likely to favor one of 3 programming languages:

. Scala
. R

IBM is an American multinational information technology company headquartered in Armonk, New York, with operations in over 170 countries. IBM is one of the world's largest employers, with over 350,000 employees, known as "IBMers". At least 70% of IBMers are based outside the United States, and the country with the largest number of IBMers is India. IBM employees have been awarded five Nobel Prizes, six Turing Awards, ten National Medals of Technology (USA) and five National Medals of Science (USA).

This collaboration between IBM and DataTrained provide our student's hands-on experience in predictive analytics and advanced computing.

Expectations for this program co-developed with IBM:

1. Industry-recognized certificate from IBM and DataTrained.
2. IBM Cloud Credits for 6 months equivalent to $1200.
3. IBM Cloud Platforms access like IBM Watson for hands-on practice.
You will learn to make use of the R language to access databases, clean, analyze, and visualize data with R. Through our guided lectures and access to labs, you will get hands-on experience tackling fascinating data issues. This's an action-packed learning path for data science enthusiasts who wish to work on real-life problems with R.
Data Scientists are likely to favor one of 3 programming languages:

. Scala
. R

Programming Languages and Tolls Covered

Instructors

Learn from India’s leading Software Engineering faculty and Industry leaders

Learning Path

1
Course 1: Scala Fundamentals

Learn the foundations of the language for developers and data scientists interested in using Scala for data analysis. Tackle data analysis problems involving Big Data, Scala and Spark. Get a solid understanding of the fundamentals of the language, the tooling, and the development process. Develop a good appreciation of more advanced features.

Module 1 -  Introduction

1. Introduction to Scala
2. Creating a Scala Doc
3. Creating a Scala Project
4. The Scala REPL
5. Scala Documentation

Module 2 - Basic Object Oriented Programming

1. Classes
2. Immutable and Mutable Fields
3. Methods
4. Default and Named Arguments
5. Objects

Module 3 - Case Objects and Classes

1. Companion Objects
2. Case Classes and Case Objects
3. Apply and Unapply
4. Synthetic Methods
5. Immutability and Thread Safety

Module 4 - Collections

1. Collections overview
2. Sequences and Sets
3. Options
4. Tuples and Maps
5. Higher Order Functions

Module 5 - Idiomatic Scala

1. For expressions
2. Pattern Matching
3. Handling Options
4. Handling Failures
5. Handling Futures

Course 2: Spark Overview for Scala Analytics

Learn the history of Apache Spark™, how it came to be, how to build applications with Spark, how establish an understanding of RDDs and DataFrames, and other advanced Spark topics.

. Be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets with Scala.

. Get an overview of Spark and its associated ecosystem.

. Gain enough skills to leverage the Map-Reduce framework with the Scala language.

Module 1 - What is Spark?
Module 2 - Introduction to RDDs
Module 3 - Introduction to DataFrames
Module 4 - Advanced Spark Topics
Module 5 - Introduction to Spark MLlib
2

3
Course 3: Data Science for Scala

In this course you will learn about Basic statistics and data types, Preparing data, Feature engineering, Fitting a model and Pipelines and grid search.  Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, machine learning and graph processing. This course shows you how to use Spark’s machine learning pipelines to fit models and search for optimal hyperparameters using a Spark cluster.

Module 1 - Basic Statistics and Data Types

. Vectors and Labelled Points
. Local and Distributed Matrices
. Summary Statistics, Correlations, and Random Data
. Sampling
. Hypothesis Testing

Module 2 - Preparing Data

. Statistics, Random data and Sampling on Data Frames
. Handling Missing Data and Imputing Values
. Transformers and Estimators
. Data Normalization
. Identifying Outliers

Module 3 - Feature Engineering
. Feature Vectors
. Categorical Features
. Using Explode, User Defined Functions, and Pivot
. Principal Component Analysis (PCA) in Feature Engineering
. RFormulas

Module 4 - Fitting a Model
. Decision Trees
. Random Forests
. Gradient-Boosting Trees
. Linear Methods
. Evaluation

Module 5 - Pipeline and Grid Search

. Predicting Grant Applications: Introduction
. Predicting Grant Applications: Creating Features
. Predicting Grant Applications: Building a Pipeline
. Prediciting Grant Applications: Cross Validation and Model
Tuning . Predicting Grant Applications: Wrapping up

Scala Programming for Data Science in collaboration with IBM

Get eligible for 3 world-class certifications thus adding that extra edge to your resume.
  • Alumni Status
  • Learning paths and certification from IBM
  • Course completion certificate from DataTrained Education
  • Project completion certificate from DataTrained Education

Admission Process

There are 3 simple steps in the Admission Process which is detailed below
Step 1: Fill in a Query Form
Fill up the Query Form and one of our counselor will call you & understand your eligibility.
Step 2: Get Shortlisted & Receive a Call
Our Admissions Committee will review your profile. Upon qualifying, an Email will be sent to you confirming your admission to the Program.
Step 3: Block your Seat & Begin the Prep Course
Block your seat with a payment of INR 10,000 to enroll into the program. Begin with your Prep course and start your Data Science journey!

Program Fee

No Cost EMI options are also available. *

What's Included in the Price

Features/Benefits
  • Industry recognized certificate from IBM
  • Access to 15 real life projects and a capstone project
  • IBM Watson labs and $1200 equivalent Cloud Credits

I’m interested in this program

By clicking Start Application, you agree to our terms and conditions and our privacy policy.

Career Impact

Over 500 Careers Transformed
Average Salary Hike
Highest Salary
Jobs Sourced
Hiring Partners

Frequently Ask Questions

Yes, you will get a certificate from DataTrained for the course completion as well as a project completion certificate from DataTrained.
There are two types of projects:

a. Practice projects: Your mentor will first do 2-3 projects for you and then you will do the next 3-4 projects wherein you will get help from your mentor and on tickets.

b. Evaluation projects: Once you’re done with the practice projects, you get access to the evaluation projects.
There are two types of projects:

A. Practice projects: Your mentor will first do 2-3 projects for you and then you will do the next 3-4 projects wherein you will get help from your mentor and on tickets.

B. Evaluation projects: Once you’re done with the practice projects, you get access to the evaluation projects.
No, the program is designed in such a way that, you can continue with your job along with this program. It will be a mix of pre-recorded videos, live classes as well as printed study material. Every topic would be project-based and will be taught as per the live market scenario. The course module will be covered under the guidance of Industry Experts.
There are two training modes:

a. Self-paced: You will get access to DataTrained and IBM joint LMS wherein you will be assigned courses and projects. You will need to go through these courses and complete the projects as your own pace. Mentor support will be provided.

b. Blended: You will get access to DataTrained and IBM joint LMS wherein you will be assigned courses and projects. You will need to go through these courses and complete the projects as your own pace. In addition to these courses, live online classes are conducted on Saturdays and Sundays for you. Mentor support will be provided.
In case you miss a class, you need not to worry. All the live classes’ recordings will be available on your LMS. You can watch and practice the concepts at your own time.
We have partnered with analyticsjobs.in for the placement assistance for our learners who successfully completes our programs. Analytics Jobs is a leading media and job portal company specifically aimed for the jobs in Data Science, Analytics, Automation, RPA, Cloud, Block Chain and computer science.
The program fee is Rs. 19999 for self-paced and Rs. 14999 for self-paced and live sessions blended mode. For International students, the program fee is $350 plus taxes for self-paced.
For Queries and Suggestions

Call Datatrained Now

Email us for Enrolment Queries at admissions@datatrained.com
Email us for Payment and Other Queries at support@datatrained.com