Azure Data Architecture Guide

User Manual: Pdf

Open the PDF directly: View PDF PDF.
Page Count: 62

DownloadAzure Data Architecture Guide
Open PDF In BrowserView PDF
Azure Data
Architecture Guide

Agenda

Subway Map

Types of Data

Real-Time Streaming Architecture

Lambda Architecture

Hot Path

• View Actionable Data Quickly
• Likely Looking at an Aggregate
• Requires Professional Presentation
• Better if Ad-Hoc Querying is Supported

Cold Path

• Long-term Storage
• Must be cheap to query and store
• May be processed multiple times
• Analysis
• Prediction
• Learning

Hot & Cold Path Analysis

How can we
make it
happen?
What will
happen?

Value

Why did it
happen?

What
happened?

Difficulty

Kappa Architecture

IoT Message Processing

Data Warehousing

Data Warehousing

Data Warehouse Pattern

Data Warehouse Technology Choices

• SMP (small/medium data)
• Azure SQL Database
• SQL Server in a virtual machine
• Azure SQL Database managed
instance

Data Warehouse Technology Choices

• MPP (big data)
• Azure Data Warehouse
• Apache Hive on HDInsight
• Interactive Query (Hive LLAP) on
HDInsight

SQL Data Warehouse

SQL Data Warehouse

Azure Storage

Data Warehousing

No-SQL

Document Data Stores

Key-Value Data Stores

Graph Data Stores

And Then There’s More…

• Columnar
• Object
• Time Series
• External Index

Processing JSON

No-SQL Technology Choices

• Azure Data Factory
• Azure Logic Apps
• Azure Functions
• App Service
• Azure Data Lake Analytics
• Azure HDInsight

No-SQL Technology Choices

• Spark SQL
• HBase
• Hive
• SQL Data Warehouse
• Azure Machine Learning Workbench
• SQL SSIS

Azure Cosmos DB
Column-family

Key-Value

Graph

Documents

SQL

Clickstream Analysis

Storing Relational and No-SQL Data

Processing JSON in Real-Time

Data Lake

Analytics
U-SQL
YARN

WebHDFS

Unstructured

Semi-Structured

Storage

Structured

HDInsight
Authoring Jobs

App Integration

End User Tooling (IDE’s, Analyst tools, Command lines)

Lightweight
Low cost to extend
Scenario oriented

Breadth of Clients (Java, JS, .NET, etc)

Consistent REST API’s

Innovation flows upward
New compute models
Perf enhancements

Extend breadth & depth
Enable new scenarios
Integrate with current tool
chains

Authoring frameworks and languages
Core Hadoop

Connectivity
Programmability
Security
Loosely coupled

HDInsight & Data Lake

Azure HDInsight
Hadoop WebHDFS Client

WebHDFS-compatible REST API

Azure Data Lake Store

On-Demand Big Data Analytics

Natural Language Processing

Processing Free-Form Text

Natural Language Processing (NLP)

“Great to meet you! I
need to extend my
booking next week by one
day. Can you also book
me a car?”

Processing Free-Form Text using NLP

Language Understanding (LUIS)
Train by providing
examples

Deploy to an HTTP
endpoint and
activate on any
device

Create your
own LU model

Maintain model
with ease

Natural Language Processing Technology Choices

• Azure HDInsight
• with Spark and Spark MLlib
• Microsoft Cognitive Services
• LUIS
• Bing Search APIs

Speech-to-Text Translation
TrueText

can can you
here me

Automatic
Speech
Recognition

Can you hear
me?

Machine
Translation

Speech-to-Speech Translation
TrueText

can can you
here me

Automatic
Speech
Recognition

Can you hear
me?
Text to
Speech

Machine
Translation

Intelligent Applications

Advanced Analytics & Deep Learning

Data Pipeline

Extract, transform, and load

Extract, load, and transform

Extract, Transform, Load (ETL)

Semantic Modeling

Semantic Modeling

Online Analytical Processing Pattern (OLAP)

OLAP Technology Choices

• SQL Server with Columnstore indexes
• Azure Analysis Services
• SQL Server Analysis Services (SSAS)

Azure Analysis Services

SQL Database

Security
In-Memory
Cache

SQL Data Warehouse
Data Lake

Power BI

Data
Modeling

HDInsight/Spark

Other

Azure Analysis Services

Third-Party

Power BI Desktop

SQL Server/Oracle
Other

Lifecycle
Management

Business Logic
& Metrics

Excel

Data Mart

Understand
• Search
• Browse
• Filter

Discover

• Metadata
• Experts
• Context

Contribute
• Data Assets
• Familiar Tools
• Existing
Processes

Consume

• Tag
• Document
• Publish

Business Intelligence

Relational Data

Relational Data

Transactional Data

Online Transaction Processing (OLTP) Pattern

OLTP Technology Choices

• Azure SQL Database
• Azure SQL Database Managed Instance
• SQL Server in an Azure Virtual Machine
• Azure Database for MySQL
• Azure Database for PostgreSQL

Thank You!
E-mail:

sidney@seesharprun.net

Twitter:

@sidney_andrews



Source Exif Data:
File Type                       : PDF
File Type Extension             : pdf
MIME Type                       : application/pdf
PDF Version                     : 1.7
Linearized                      : No
Page Count                      : 62
Language                        : en-US
Tagged PDF                      : Yes
XMP Toolkit                     : 3.1-701
Producer                        : Microsoft® PowerPoint® 2016
Title                           : 
Creator                         : 
Creator Tool                    : Microsoft® PowerPoint® 2016
Create Date                     : 2018:01:17 23:34:08-05:00
Modify Date                     : 2018:01:17 23:34:08-05:00
Document ID                     : uuid:85740D8C-452C-4D5B-B731-390E8D717080
Instance ID                     : uuid:85740D8C-452C-4D5B-B731-390E8D717080
Author                          : 
EXIF Metadata provided by EXIF.tools

Navigation menu