Interested in speaking at the next DataOps Unleashed? We've opened an early CFP for our February 9th meeting. Submit your talk proposal here.

let's

unleash

dataops,

together

we'll see

you again

february

9th, 2022

Let's Unleash DataOps Together on February 9th, 2022

Jeff Lambert

Vice President of Data Solutions

8451-kroger.png

Priya Vijayarajendran

Vice President Data & AI

6

Maxime Beauchemin

CEO & Founder

preset logo (1)

Srinivasa Gajula

Lead Engineer

mastercard

Kevin Davis

Application Engineering Manager

8

Abe Gong

Co-Founder & CEO

2

Shirshanka Das

Co-Founder & CTO

acryl data

Angelo Carvalho

Principal Solutions Architect

7

Kunal Agarwal

Co-Founder & CEO

3

Matthew Carroll

CEO

immuta-logo-copy

Kumar Menon

SVP Data Fabric & Decision Science Technology

equifax

Sarah Gadd

Global Head of Data and Artificial Intelligence Solutions

credit-suisse.png

Sanjeev Mohan

Research Vice President, Big Data & Advanced Analytics

gartner

Shivnath Babu

Co-Founder & Chief Technology Offficer

3

Patrick Druley

Senior Solution Engineer

4

Suresh Devarakonda

Lead Database Engineer

8451-kroger

Chinmay Sagade

Principal Engineer

mastercard

James Fielder

Senior Data Engineer

cox.png

Paloma González Martínez

Chief Data Officer

alphacredit.png

Wayne Eckerson

President

Eckerson logo

Christopher Bergh

CEO, Founder, Head Chef

data-kitchen.png

David Lloyd

Chief Data Officer

ceridian.png

Sandeep Uttamchandani

CDO & VP of Eng.

3

Justin Borgman

Chairman & CEO

starburst-logo.png

David Bath

Vice President of Platforms

oneweb-logo.png

Justin Mullen

CEO

Nick Acosta

Developer Advocate

fivetran-logo.png

Guy Adams

CTO

Stephen Bailey

Director of Applied Data Science

immuta-logo-copy

Ry Walker

Founder & CTO

astronomer-logo.png

Vijay Kiran

Head of Data Engineering

soda logo

Mary Flynn

Senior Director of Product Marketing

okera-logo-data-ops-unleashed.png

Tristan Spaulding

Senior Director of Product Management

5

Watch All of the DataOps Unleashed 2021 Sessions On-Demand

Welcome | Andrew Gelinas, Co-Founder @ Solution Monday

Opening Keynote: Unleashing DataOps | Kunal Agarwal, Co-Founder & CEO @ Unravel

DataOps empowers data teams to cost-effectively deliver high-quality data products, with the increasing use of AI and machine learning. With DataOps, teams go deep on specific technologies, such as Hadoop, Hive, Spark, Presto, Kafka, Databricks, and Snowflake. But they also maintain and manage across technologies, from data ingest through data pipelines to on-time delivery of analytics and results. Join DataOps innovator Kunal Agarwal, CEO of Unravel Data, as he describes how companies large and small are using DataOps to make their technology stacks hum, get more done at a lower cost, and improve both customer experience and the bottom line.

A journey to the cloud for Adobe’s corporate data platform | Kevin Davis, Application Engineering Manager @ Adobe

Adobe has just embarked on a multi-year journey to transition their on-premise Hadoop data platform to the cloud.  With thousands of users, petabytes of data, and millions of monthly job executions, transitioning to the cloud will be a tremendously challenging task.  Join Kevin Davis as he shares the catalysts that started Adobe on this journey, the processes being employed to ensure key customer challenges are addressed in the new environment, and other tools and strategies that are helping along the way.  If your organization is contemplating a move to the cloud, this session will provide key insights into the early stages of Adobe’s transition that will help you plan your initiative.

Data Quality in DataOps | Abe Gong, Co-Founder & CEO @ Superconductive

As the world’s leading tool for data quality, Great Expectations occupies a unique position in the DataOps ecosystem. Over the last year, thousands of data scientists, engineers, and analysts have joined the Great Expectations community, making it one of the fastest-growing data communities in the world. In addition, Great Expectations integrates with many other DataOps tools, giving our developers a unique perspective on how the ecosystem is developing.
This presentation will share examples, patterns, and emerging best practices for data quality from the Great Expectations community. The first half of the talk will focus on nuts-and-bolts engineering, including common use cases and deployment patterns for data quality.  The second half of the talk will share learnings for how data quality and DataOps are reshaping data workflows and collaboration. Together this presentation will give you a clear view into how to get started with data quality, and where the field is going as a whole.

DataOps automation and orchestration with Fivetran, dbt, and the modern data stack | Nick Acosta, Developer Advocate @ Fivetran

Many organizations struggle with creating repeatable and standardized processes for their data pipeline. Fivetran reduces pipeline complexity by fully managing the extraction and loading of data from a source to a destination and orchestrating transformations in the warehouse.

This talk will explain and evaluate the various benefits currently available using a DataOps approach with Fivetran and the rest of the modern data stack.

DataOps principles and practices | Vijay Kiran, Head of Data Engineering @ Soda

DataOps has grown because of the need to support execution at scale in the data management space. During this session Vijay Kiran, Head of Data Engineering at Soda Data, will present how the practice of DataOps is fundamental to how data moving across the stack, from source to data product, is monitored and managed to provide trusted data to the business to transform how data analytics works. Attendees will walk away with DataOps principles and practices that deliver value from data.

Observability of Apache Airflow | Ry Walker, Founder & CTO @ Astronomer

Apache Airflow has become an important tool in the modern data stack. We will explore the current state of observability of Airflow, common pitfalls if you haven't planned for observability, and chart a course for where we can take it going forward.

The evolution of a data platform | James Fielder, Senior Data Engineer @ Cox Automotive

Designing a data platform is no easy task, particularly when there are new technologies, techniques, and approaches appearing every week. At Cox Auto UK we have been on a journey from manually deployed Hadoop clusters to a full platform as a service setup using Azure Databricks. This journey hasn’t always been smooth however and we’ve learned some things along the way! In this talk, we will examine how we have made design choices while evolving our platform, our decision to open source some of our work, and what our past, present, and future look like.

A modern data estate is the key to data-led digital transformation | Priya Vijayarajendran, VP Data & AI @ Microsoft

At Microsoft, we believe that a modern, cloud-based data strategy is the foundation of successful digital transformation. Microsoft has helped innumerable organizations with that journey and we know that making the shift to the cloud or embracing a hybrid approach can be challenging. In this talk, Microsoft VP of Data and AI PriyaVijayarajendran will describe the benefits of data-led digital transformation and share her insights on what it takes to build a data-first culture. She’ll share how a modern data estate, powered by the right operational and analytics databases and employing AI and ML can create cloud-native user experiences that give your enterprise a competitive edge. If your organization is ready to embrace the cloud but unsure of how to take the first steps, this session will provide the inspiration, insights, and strategies to help you succeed.

How to Find a Misbehaving Model | Tristan Spaulding, Senior Director of Product Management @ DataRobot

Monitoring machine learning models once they are deployed can make the difference between a creating competitive advantage with ML and suffering setbacks that erode trust with your users and customers. But measuring ML model quality in production environments requires a different perspective and toolbox than monitoring normal software applications. In this talk, I share some practical techniques for identifying decaying models, along with strategies for providing this protection at scale in large organizations.

DataOps for the new data stack | Shivnath Babu, Co-Founder & CTO @ Unravel

This talk demystifies the new data stack that thousands of companies are deploying to convert data into insights continuously and with high agility. This stack continues to evolve with the emergence of new data roles like analytics engineers and ML engineers as well as new data technologies like lake houses and data validation. A new wave of operational challenges has emerged with this stack that, unless addressed from day one, will derail its success. Shivnath will discuss these DataOps challenges and the best practices to address them. The talk will be accompanied by a brief demonstration.

Driving DataOps Culture with LinkedIn DataHub | Shirshanka Das, Co-Founder & CTO @ Acryl Data

Your data is not changing slowly, so why should your metadata?

LinkedIn DataHub was open-sourced to enable other organizations to harness the power of metadata and unleash excellent DataOps practices. Doing DataOps well requires bringing together multiple disciplines of data science, data analytics, and data engineering into a cohesive unit. However, this is complicated, because there are a wide variety of data tools that are in use by these different tribes. Shirshanka, who founded and architected DataHub at LinkedIn, will describe its journey in enabling DataOps use-cases on top of the metadata platform. He will also showcase the latest integrations and features in the tool and share the roadmap for the project.

ELT-G: Locating governance in the modern data stack | Stephen Bailey, Director of Applied Data Science @ Immuta

ELT is a data ingestion pattern that promotes an "extract first, model later" approach to building data workflows. While it saved time for data teams and enabled more agile development, the buzz about ELT has not given proper credit to its silent G: governance. Unlike transformations, proper governance, and in particular, securing access to data, cannot be deferred until later, and requires clear, consistent principles to be implemented by data teams.

During his talk, Stephen will provide a framework for thinking about data governance in an ELT landscape, introduce policy-based access controls, and provide some suggestions for data teams to get started with better governance today.

Best practices for optimizing your big data costs with Amazon EMR | Angelo Carvalho, Principal Solutions Architect @ AWS

As data volumes increase, so do the costs of processing it. We’ll review several best practices and new features that enable you to cut operating costs and create efficiencies when processing vast amounts of data using Amazon EMR. Session attendees will be able to walk away with a solid understanding of Managed Scaling, improving Apache Spark performance to help lower their Amazon EMR costs, and monitoring, tuning, and troubleshooting solutions for big data workloads on Amazon EMR.

Keynote | DataOps.live and Snowflake going stellar with OneWeb (the Global Satellite Network Provider)

David Bath, Global VP of Platforms, OneWeb
Justin Mullen CEO, DataOps.live
Guy Adams CTO, DataOps.live

OneWeb is a global connectivity network powered from space. OneWeb is implementing a constellation of Low Earth Orbiting satellites with global gateway stations, a range of user terminals and a suite of digital products to provide an affordable, fast, low-latency communications service. Offering enterprise-grade, managed connectivity services in partnership with its solution providers, OneWeb enables communities, governments, businesses and mobility industries such as aviation and maritime to easily connect to the cloud, applications, customers, people and devices.

The global pandemic has accelerated the need for digital transformation, underscoring the necessity to be online for education, work, access to health care and to enable the IoT future and a pathway to 5G. OneWeb is driven by data: how the business operates, the partners it works with, the customers it serves, and IoT telemetry data on the status and health of millions of devices 24/7. OneWeb has chosen DataOps.live and Snowflake to securely store and govern every item of data, to automate every data pipeline from source to destination, and create a culture of self-service analytics across every part of the business.

Join this session to find out how you can apply this ground-breaking approach to your business.

The state of DataOps | Wayne Eckerson, President @ Eckerson Group and Kunal Agarwal, Co-Founder & CEO @ Unravel

There is a lot of interest in DataOps, but many people are confused about what it is and isn't. Is DataOps a methodology for building pipelines? A set of development and execution tools? Or a process for continuous improvement? This session will clear up the confusion and help align our understanding before we dive into details during breakout sessions.

To start this fireside chat, veteran data and analytics thought leader Wayne Eckerson will deliver a short presentation based on three-years of research that describes the core principles and components of DataOps. He will then sit down with Unravel CEO, Kunal Agarwal, to discuss the trends, challenges, and best practices required to succeed with DataOps. The goal is to give attendees a clear, concise, and unbiased understanding of DataOps with guidance about where to start and how to implement it.

Panel: Creating a data-driven culture | Moderated by Sanjeev Mohan, Research Vice President, Big Data & Advanced Analytics @ Gartner

Panelists: 

Sarah Gadd, Global Head of Data and Artificial Intelligence Solutions @ Credit Suisse

Kumar Menon, SVP Data Fabric & Decision Science Technology @ Equifax

David Lloyd, Chief Data Officer @ Ceridian

Paloma González Martínez, Chief Data Officer @ AlphaCredit

More and more companies are adding a Chief Data Officer (CDO) and other leadership roles to the executive-suite, often leading an organization-wide transformation to a data-driven culture. CDOs, and other senior technologists, face a wide range of challenges. At the same time, the progress of data technology - especially cloud services, AI, and machine learning - is opening up new opportunities. 

Our panelists will share their insights and describe the strategies they’re using to move data-driven decision-making to the core of organizational processes, products, and services.

Things you may not know about Apache Kafka but should | Patrick Druley, Senior Solution Engineer @ Confluent

In this session, you will learn about some of the common misconceptions, best practices, and little-known facts about Apache Kafka. Event Streaming has changed the way businesses think about data movement and integration. If you are new to Kafka or having been creating topics and developing clients for years, there's something for everyone in this fun and informative session.

Apache Superset for Data Engineers | Maxime Beauchemin, CEO & Founder @ Preset

Superset is the leading open source data exploration and visualization platform. In this talk, we’ll be presenting Superset with a focus on advanced topics that are most relevant to Data Engineers. The presentation will include a live demo of the product, and dive into advanced topics including the alert & report framework, the REST API, and building custom visualization plugins.

Building Checkpoints in your DataOps | Sandeep Uttamchandani, CDO & VP of Engineering @ Unravel

Behind every successful insight (BI analytics or ML model) is a reliable data pipeline! These pipelines are planned, implemented, deployed, and monitored in an ongoing fashion referred to as the DataOps infinity loop (similar to CI/CD for traditional software). This talk covers battle scars in managing DataOps at scale, and how building checkpoints in the DataOps loop can reduce missed SLAs, cost outages, escalation from data users, and most importantly avoid data pipeline surprises!

Universal Data Authorization for Your Data Platform: What is It and Why Now? | Mary Flynn, Senior Director of Product Marketing @ Okera

With all the advances in DataOps, many data-driven initiatives still fail. Why? Because organizations still struggle to resolve two problems as old as data itself: people can retrieve and use data they should not have access to, and other people cannot access data for legitimate business purposes.
In this session, you’ll learn what Universal Data Authorization is and how adding it to your modern technology stack brings clarity and appropriate control across your entire data platform. You’ll learn why fine-grained access control and de-identification techniques are the new table stakes, and why success at enterprise scale is only achieved through an API-first platform approach with delegated stewardship and full visibility for audit and reporting. Join this session to learn how your company can accelerate business agility, minimize data security risks, and demonstrate regulatory compliance.

How 84.51° Slashed Operational Costs & Improved DataOps Efficiency by Solving Problems with Small Files | Jeff Lambert, Vice President of Data Solutions @ Kroger/84.51˚ and Suresh Devarakonda, Lead Database Engineer @ Kroger/84.51˚

Hear from 84.51° as they give a 30,000 ft view into their management of Yarn and Impala. They will share how they solved challenges associated with small files and used a centralized DataOps approach to troubleshoot issues with their big data pipelines. 84.51° will also take from their executive dashboards and share key learnings in helping your business improve efficiency and reduce operational costs.

Improving platform resiliency by detecting harmful workloads | Chinmay Sagade, Principal BizOps Engineer @ Mastercard and Srinivasa Gajula, Lead Engineer @ Mastercard

Big Data unlocks tremendous opportunities. The distributed platforms which enable this capability are dependent on optimized workloads to make efficient use of available resources. In this talk, we will be presenting an application monitoring system created at MasterCard,  which detects harmful workloads and helps maintains the business goals on resiliency and latency.

Founder's Roundtable | Panel Discussion

Moderator: Wayne Eckerson, President @ Eckerson Group 

Kunal Agarwal, Co-Founder & CEO @ Unravel

Matthew Carrol, CEO @ Immuta

Christopher Bergh, CEO, Founder, and Head Chef @ DataKitchen

Ry Walker, Founder & CTO @ Astronomer

Justin Borgman, Chairman & CEO @ Starburst

Join Wayne Eckerson and Unravel’s Kunal Agarwal for a Founders' Roundtable with Astronomer's Ry Walker, DataKitchen's Christopher Bergh, and Immuta's Matthew Carrol, and Starburst's Justin Borgman as we look at the evolution of data and the rapid adoption of DataOps.

Our participants join us to share their perspectives on their place in the fast-changing DataOps world. They will share their thoughts on why this community and these conversations are so important today and give real-life examples of how they are working to help organizations realize value from their data.

This roundtable is sure to be lively. Each of our panelists is an innovator in their respective realm; we invite you to attend and help us tap their expertise for innovative DataOps strategies.

Closing Remarks | Kunal Agarwal, Co-Founder & CEO @ Unravel

What is the cost to attend the virtual sessions?

DataOps Unleashed is always free and open for all to attend.

What is DataOps Unleashed?

DataOps Unleashed is the official DataOps community.

We'll be back on February 9th, 2022 with DataOps, CloudOps, AIOps, MLOps, and other professionals, to gathered virtually to share the latest trends and best practices for running, managing, and monitoring data pipelines and data-intensive analytics workloads.

Sessions will include talks by DataOps professionals at leading organizations, detailing how they’re establishing data predictability, increasing reliability, and reducing costs.

New to DataOps?

DataOps is a holistic approach to the creation, deployment, monitoring, management, and optimization of data-driven applications. It describes the culture and rules of engagement that allow data teams to deliver and maintain high-quality, on-time data products, often powered by AI and machine learning, in an agile and cost-effective way.

DataOps defines how data teams work and also affects data consumers and those whose work causes new data to be created and used within the organization. Their work enables the entire organization to access data efficiently for data-driven decision-making and for the creation and delivery of data-driven applications.

Organizations with well-developed DataOps strategies, governance, and processes can expedite the delivery of data-driven workflows and results faster and better than others.

Who comes to DataOps Unleashed?

DataOps professionals and experts including data administrators, data architects, data engineers, data analysts, AI/ML professionals, and data technology leadership.

Join us for sessions on:

  • Data pipelines
  • Data orchestration
  • Metadata
  • Data quality
  • Data governance
  • Data science platforms
  • AIOps and MLOps
  • CloudOps
  • Migrations
  • Observability
  • Optimization
  • Operations

Sign up to join us at the next DataOps Unleashed on February 9th, 2022!

Interested in participating as a partner sponsor? For more information please contact mike@solutionmonday.com