Choose your language:

France
Germany
Hong Kong

India
Ireland
Japan
Malaysia
Netherlands
New Zealand

Singapore

Sweden
United Kingdom
United States
Course Code

BD03

Duration

2 Days

Basic data processing concepts.
This course is about Big Data and the current technologies including Hadoop, MapReduce, NoSQL Data Stores including Hbase, Cassandra and MongoDB. Through lecture and demonstrations, we explore the current Big Data methods for capturing, analyzing with the R language and integrating with existing database systems.
This course is designed for individuals who want to understand what Big Data is about and the current technologies for capturing, analyzing and reporting on Big Data.

Upon completion of this course, participants will be able to:

  • Define and Characterize Big Data 
  • Investigate Big Data Architecture 
  • Understand Hadoop Components 
  • Explore MapReduce 
  • Know when to use Pig, PigLatin and Hive 
  • Understand basics of HBase, Cassandra and MongoDB 
  • Define Data Science 
  • Investigate Techniques for Analyzing Big Data 
  • Observe the R Language 
  • Explore Leveraging Existing Technology
Introduction to Big Data
Big Data is Big Business
Definitions
Characteristics
Data Realms
2013, Big Data Trends
Big Data Best Practices
Standards
Initial Steps
Economic Value

Big Data Architectures
Choices
Data structure
Structured
Google File System
Unstructured, key-value pair
Column Store
Compression
Hardware
Networks
Access
Cloud Services
Software assisted Storage

Hadoop
The Hadoop Approach
Components
History
Apache
Hadoop Distributed File System (HDFS)
Hadoop Clusters
Name Node
Hadoop Operations
Hadoop Commands
Closed source solutions
Using HDFS in MapReduce

MapReduce
MapReduce Basics
Processing/Data Nodes
Job Tracker
The Map
The Reduce
The MapReduce Process
A MapReduce Program
WordCounter
Word Count Driver
Listing and Killing Jobs

Pig and Hive
Pig Overview
Pig Latin
Pig Latin Data Types
Pig Latin Statements
Pig vs SQL
"Word Count" in Pig Latin
Hive Overview
HiveQL
Hive vs Pig
Primitive Data Types
Create Table
Browsing Tables and Partitions

HBase, Cassandra, MongoDB
Hadoop
NoSQL
NoSQL Consistency
CAP theorem
HBase
HBase Features
HBase Data Model
HBase Regions
HBase Architecture
HBase Session
Cassandra
Cassandra Features
Cassandra Data Model
Cassandra Cluster
Cassandra Architecture
Cassandra Query Language
MongoDB
MongoDB Features
MongoDB Data Model
MongoDB Architecture
MongoDB Session

Data Science and Analytics
Data science
Data scientist
Data Science Course
Analytics
K-Means Clustering
Publications

R Language
R Overview
R Capabilities
Language Examples
Mandelbrot Set
R GUI Window
RExcel

Leveraging Your Existing Technology
Big Data in the Oracle World
Big Data in the IBM World
Big Data in the SAP World
Big Data in the Microsoft World
Send Us a Message
First Name
*
Last Name
*
Company
*
Email
*
Address Line 1
*
Address Line 2
City
*
*
Zip Code
Telephone
*
*
Choose one
*
Comments