hero2-desktop.webp

Introduction to noSQL

Începător

Introduction to noSQL

Durată: 3 zile

Certificare: Diploma de participare

Cui îi este dedicat cursul?
  • Beginners in noSQL.
Cunoștințe și abilități inițiale
  • generic IT systems architecture knowledge.
Prezentarea cursului

The scope of this course is to provide an introduction of the distributed data storage layer in Big Data architectures and the main noSQL concepts, also to detail 3 of the most known noSQL solutions: columnar store Apache Cassandra, columnar store Apache Hbase and document store MongoDB. For each of the chosen solutions we will go through concepts and as well have hands on sessions - to try to understand what works or not works from queries perspective. If you are coming from relational databases and are interested in noSQL this is the perfect way to understand the possible use cases for noSQL solutions, the requirements of natively distributed systems - requirements that drive most of the concepts behind the noSQL systems.

Ce subiecte abordează cursul
  1. Big Data architecture review and intro of potential storage components (file and database storage) 0.5 days
    • Role of distributed storage layer in a big data architecture
    • Components of the storage layer:
      • Distributed file storage
        • HDFS
        • Other file storage
      • distributed databases
    • noSQL Overview
      • Distributed databases: common concepts
      • Consistency & the CAP Theorem
      • noSQL solutions overview
    • Distributed File storage: HDFS
      • Hadoop architecture components
      • HDFS Overview
      • HDFS and Computing and Analysis layer of Big Data architectures
    • Analysis layer overview
      • Apache Hive, Impala, Spark SQL overview
      • HDFS & Hive
      • Hive overview
      • Create tables (internal/external), SERDE’s, partitions, buckets, file formats, …
  2. Columns store noSQL: Apache Cassandra (0.75 days)
    • The Column-Family Data Model: why better for specific use cases
    • Cassandra use cases
    • Main concepts: data partitioning, distribution, replication and consistency, write and read paths (cluster and node level)
    • Architecture: gossip, snitch, multi DC deployments
    • Data model: keyspace, table, partition key(s), clustering key(s), indexes
    • Cassandra Query Language (create keyspace, table, insert/update data into tables, delete, query data)
  3. Columns store noSQL: Apache HBase (0.75days)
    • Hbase most known use cases, differences vs Apache Cassandra
    • Architecture: master, meta, zookeeper, region servers, HDFS
    • Basic concepts: row-key, tables, regions, persistence of data, write and read paths
    • Regions scaling and auto-scaling
    • Data Model
    • Row Key Design
    • Operations: write, read and delete data
  4. Documents store noSQL: MongoDB (0.5 day)
    • The Document Data Model (documents, collections)
    • MongoDB Use Cases
    • Architecture:
      • Replication via Replica Sets
      • MongoDB and the CAP Theorem
      • Auto-Sharding, Shard Keys, and Horizontal Scalability
    • DML
    • Indexing
Ce abilități se dobândesc în urmă cursului
  • understand the role of distributed data storage layer in a Big Data architecture
  • understand the main noSQL concepts
  • understand the details of the most known noSQL solutions: columnar store Apache Cassandra, columnar store Apache Hbase and document store MongoDB

Course Requirements:

  • We will need open Internet connection throughout the course. Please test prior to course that there is available an open Wi-Fi connection ( the port 22 for outbound connection to be open );
  • Each participant need to have it’s own computer in order to run the exercises and need to make sure prior to the course that the computer settings allow access to Google docs and Github for getting access to presenters slides, documents and
    exercises;
  • We will run all solutions in public cloud thus please test prior to course that there is available an open Wi-Fi connection at the training location and as well: 
    • The port 22 for outbound connection is open;
    • And an SSH client is installed ;
    • Google Chrome is installed

Nu ai găsit ce căutai? Dă-ne un mesaj!

Prin trimiterea acestui formular sunteți de acord cu termenii și condițiile noastre și cu Politica noastră de confidențialitate, care explică modul în care putem colecta, folosi și dezvălui informațiile dumneavoastră personale, inclusiv către terți.