Log in
  • Home
  • A Machine Learning-Based Data Quality Framework for Cloud Data Lake-A. Dutta CTO & Seth Rao, Ph.D CEO of FirstEigen

A Machine Learning-Based Data Quality Framework for Cloud Data Lake-A. Dutta CTO & Seth Rao, Ph.D CEO of FirstEigen

  • 11 Sep 2019
  • 11:00 AM - 12:00 PM

Registration is closed

On behalf of 
DATA MANAGEMENT ASSOCIATION
SAN FRANCISCO CHAPTER

 Welcoming you all for the

A Machine Learning-Based Data Quality Framework for Cloud Data Lake

 

Presenter:

A. Dutta 

CTO  & Co Founder of FirstEigen

Seth Rao, Ph.D

CEO and Founder of FirstEigen

Date:

Sep 11th , 2019

11:00 AM - 12:00 PM PDT

 


Webinar details:-

Title:

A Machine Learning-Based Data Quality Framework for Cloud Data Lake

 

Abstract:

The speaker hopes to address some of these during this webinar.

Summary: 

Operational and transnational data are collected in large volumes, in different formats, from multiple sources and flow through multiple platforms. Even to validate a mere 1,000 tables, organizations typically have to write close to 100,000 Data Quality (DQ) rules. DQ validation using conventional approaches such as rules writing is costly, error prone and not scalable. Organizations are left to constantly firefight and keep writing new DQ validation rules when data errors are discovered, leaving the management very nervous and un-trusting of the data. Even when data has just 1-3% errors, the resulting analytical models are inaccurate and predictions have significant errors. Thus, it is paramount to detect and correct poor data before it propagates throughout the organization.

The speakers have developed and extensively used Machine Learning-based framework to discover DQ issues within cloud data lake at many organizations. This framework helps organizations to validate data assets through the lens of five DQ dimensions: Completeness, Conformity, Consistency, Reason-ability, and Validity. This ML-based approach has discovered complex and unexpected DQ errors in several leading organizations. The talk will also outline the different strategies for Lake-level and Application-level DQ validation using AI/ML.





Join us on Sep 11th @11:00 AM to learn more 

 

Details for Registration:

View the agenda and register at:

Please register using the below link

https://attendee.gotowebinar.com/register/6529235798093578763

Webinar ID

781-339-667

After registering, you will receive a confirmation email containing information about joining the webinar.

Please feel free to reach out to any of the board members of SF DAMA, should you have any questions regarding this.

SF DAMA, www.sfdama.org
Serving San Francisco Bay Area, Silicon Valley, and Sacramento Regions

----------------------------------------------------------------------------------------------------------

Data Management Association, Inc., San Francisco Bay Area Chapter
268 Bush Street, Suite 2523, San Francisco, CA 94104
http://www.sfdama.org/


SHARE ON:




Copyright © 2019-20 SF DAMA. All rights reserved.

Powered by Wild Apricot Membership Software