Skip to main content

What is the architecture of Azure Data Lake?


What is the architecture of Azure Data Lake?
Azure Data Lake is designed with 2 major components, data lake store and analytics. And majorly there are below structure:

1.) Internal system - YARN & WebHDFS. Yarn - Analytics  & WebHDFS - Hadoop hdfs storage.
2.) Analytics - USQL   
3.) Compute Engine - HdInsight (Big Data batch processing).
3 Azure Data Lake Store (ADLS) serving as the hyper-scale storage layer.

What can I do with Azure Data Lake Analytics?
·         Right now, ADLA is focused on batch processing, which is great for many Big Data workloads.
·         Prepping large amounts of data for insertion into a Data Warehouse
·         Processing scraped web data for science and analysis
·         Churning through text, and quickly tokenizing to enable context and sentiment analysis
·         Using image processing intelligence to quickly process unstructured image data
·         Replacing long-running monthly batch processing with shorter running distributed processes
ADLA is well equipped to handle many of the types of processing we do in the T portion of ETL; that is, transforming data. If you've found that your data volumes have increased, changed shape, or you are generally not happy with your ETL performance, ADLA might serve as a good replacement for your traditional approach to prepping data for analysis.


Thanks for reading
Plz dont forget to like Facebook Page..
https://www.facebook.com/pages/Sql-DBAcoin/523110684456757

Comments

Popular posts from this blog

How to encrypt and decrypt Table data in postgres

For encrypting and decrypting , we must use the bytea data type on the column which we implement. Bcoz bytea will use the pgcrypto method by default. However, you will need to create the pgcrypto extension to enable these functions as they are not pre-defined in PostgreSQL/PPAS. Example CREATE EXTENSION pgcrypto; CREATE TABLE userinfo (username varchar(20), password bytea); >>    Inserting the data in an encrypted format INSERT INTO userinfo VALUES(' suman ',encrypt('111222','password','aes')); select * from userinfo ; >>    Retrieving the data as decrypted format SELECT decrypt(password,decode('password','escape'::text),'aes'::text) FROM userinfo; Thanks for reading Plz dont forget to like Facebook Page.. https://www.facebook.com/pages/Sql-DBAcoin/523110684456757

How to recover msdb database from suspect mode

 It was Monday 9 th Jun 47 degr. temperature of Delhi-NCR. Temperature was like boiling me and database. When I reached my office( @ 8.45 am) got an alert from one of Server. “MSDB is in suspected mode” At the same time comes in my mind, this issue will boil me today.. I just tried to cool my self through cold drink then connected server from my local system using windows authentication mode..

SQL71562: external references are not supported when creating a package from this platform

Last week I got this error from one of developer who was trying to deploy his project from Testing server to SQL Azure QA server. He was using “Deploy Database to SQL Azure” option from SSMS Tool-Task option. After connecting to SQL Azure portal when operation started to deployment below errors occurs. Validation of the schema model for data package failed. Error SQL71562: Error validating element xx.xxx.xx:function .dbo.xxx has an unresolved refrence to object xx.dbo.xxxx external refrences are not supported when creating a package from this platform . Reason: The reason of the this error was; some functions of project was dependent on master database and only single database was being deploy to SQL Azure. DACFx must block Export when object definitions (views, procedures, etc.) contain external references, as Azure SQL Database does not allow cross-database external references So, this error was coming. Solution : I suggested him to create those function to locally