This course provides Java programmers a deep-dive into Hadoop application development. Participants will learn how to design and develop MapReduce applications for Hadoop and manipulate, analyze and perform computations on their Big Data.
The Cassandra NoSQL database is one of the most powerful and widely used non-relational databases available today. It is a fault tolerant, highly scalable database with tunable consistency that meets the demanding requirements of the "can't fail, must scale" systems driving growth for many of the most successful enterprises of today. However, along with that capability comes a new data and programming model that many organizations lack the expertise to use in an optimal fashion. This course provides a technical introduction to all the conceptual and practical areas needed to use Cassandra successfully. It is written expressly for the new capabilities in version 2.0, including CQL3.1. The course provides a solid foundation in the architecture and data model of Cassandra and how to work with it. It covers CQL3 in detail, as well as important data modeling techniques to optimize your usage of the database. After taking this course, you will be ready to work with Cassandra in an informed and productive manner, including using CQL3.
This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics. The course covers the fundamental and advanced concepts and methods of deriving business insights from raw data using cost-effective data processing solutions. The course is supplemented by hands-on labs that help attendees reinforce their theoretical knowledge of the learned material.
This one day primer introduces Cloud Computing, Big Data, and the emerging discipline of Data Analytics. Attention will be given to the three V’s of Big Data: Volume, Velocity, and Variety as well as the fourth V of Value. You’ll learn about these critical elements and the powerful value proposition that these capabilities provide. What are the processes, tools, and personnel that will be needed in order to take advantage of this sea change in information management? This essential course will equip you to understand your customers better and how to deliver more value today.
This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics. The course covers the fundamental and advanced concepts and methods of deriving business insights from Big Data. The course is supplemented by hands-on labs that help attendees reinforce their theoretical knowledge of the learned material.
The NoSQL (Not Only SQL) persistence systems space offers a great variety of solutions that may be overwhelming. This class aims at helping the attendees understand the challenges of the emerging world of Big Data as well as identify suitable use cases for a variety of NoSQL systems such as Pig, Hive, HBase, Cassandra and MongoDB. The attendees will also be given some underlying architecture details of those NoSQL systems to enable them make informed decisions about using NoSQL systems when they return to work.
This course is about Big Data and the current technologies including Hadoop, MapReduce, NoSQL Data Stores including Hbase, Cassandra and MongoDB. Through lecture and demonstrations, we explore the current Big Data methods for capturing, analyzing with the R language and integrating with existing database systems.
This course is for Database Developers who want a fast-paced, intense introduction to Teradata programming. We will cover the parallel architecture of Teradata and virtual processing environment. The participants will create various table types and learn when to use each. They will also create procedural code in macros, user defined functions and stored procedures. They will understand and create BEFORE and AFTER triggers. We will investigate the various transaction control techniques and learn how to use them. Finally the participants will export and import data.
This course will help you to master one of the most popular NoSQL databases. This course is designed to provide knowledge and skills to become a successful MongoDB expert. The course covers a range of NoSQL and MongoDB topics such as CRUD Operations, Schema Design, Data Modelling, Scalability and Administration. We also cover Indexes and performance optimizations.
This course will help you to master the Apache Spark platform with added emphasis on Analytics with Machine Learning. Spark enables participants to build complete, unified big data applications combining batch, streaming, and interactive analytics on all their data. With Spark, developers can write sophisticated parallel applications to execute faster decisions, better decisions, and real-time actions, applied to a wide variety of use cases, architectures, and industries. The course covers the core APIs for using Spark, fundamental mechanisms and basic internals of the platform. The participants will use the Spark Shell, Python and SQL to access and transform data. We also cover Jupyter Notebooks, Anaconda and Apache Parquet.