In this series we look at aspects of energy generation, energy storage & smart grid technology and we explore how data is used to inform both up-front investment decisions and ongoing investment return. Produced by iamthehow.
…
continue reading
Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, ...
…
continue reading
1
Zero Carbon 2030 : Views from Ollie Boyacigiller
18:34
18:34
Spela senare
Spela senare
Listor
Gilla
Gillad
18:34
Join Fraser and Ollie as they chat about the future of energy from the perspective of young voters.Av @IAmTheHow
…
continue reading
1
Zero Carbon 2030 : From What If To What Next
25:51
25:51
Spela senare
Spela senare
Listor
Gilla
Gillad
25:51
Join Fraser Durham and Rob Hopkins (cofounder of Transition Town Totnes) as they discuss how the power of imagination creates the future that we want. Produced by @IAmThe HowAv @IAmTheHow
…
continue reading
1
The Analytics Engine for All Your Data with Justin Borgman @ Starburst
36:12
36:12
Spela senare
Spela senare
Listor
Gilla
Gillad
36:12
In this episode we speak with Justin Borgman, Chairman & CEO at Starburst, which is based on open source Trino (formerly PrestoSQL) and was recently valued at $3.35 billion after securing their series D funding. In this episode we discuss convergence of DW’s / DL's, why data lakes fail and much much more. Top 3 takeaways The data mesh architecture …
…
continue reading
1
Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS
27:23
27:23
Spela senare
Spela senare
Listor
Gilla
Gillad
27:23
In this episode we speak with Paul Singman Developer Advocate at Treeverse / LakeFS. LakeFS is an open source project that allows you to transform your object storage into a Git-like repository. Top 3 takeaways LakeFS enables use cases like debugging to quickly view historical versions of your data at a specific point in time and running ML experim…
…
continue reading
1
Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset
49:15
49:15
Spela senare
Spela senare
Listor
Gilla
Gillad
49:15
In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access. Below are the top 3 value bombs: Apache Arrow is an open-source in-memory columnar format that creates a standard way to share and process data struc…
…
continue reading
1
Implementing Amundsen @ Convoy with Chad Sanderson
35:52
35:52
Spela senare
Spela senare
Listor
Gilla
Gillad
35:52
In this episode we speak with Chad Sanderson head of data and early stage startup advisor focused on data innovation @ Convoy and uncover their journey to implementing Amundsen, an open source data catalog. Below are the top 3 value bombs: Data Scientist’s should not be spending the majority of their time trying to find the data they are interested…
…
continue reading
1
Zero Carbon 2030 : What's in store during 2022?
32:32
32:32
Spela senare
Spela senare
Listor
Gilla
Gillad
32:32
Join Fraser Durham and Johnny Gowdy (Director at Regen) as they reveal their insights on energy prices, new generation projects and evolving policy in a world where the supply chain is calling the shots. Produced by @IAmThe HowAv @IAmTheHow
…
continue reading
1
The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli
26:33
26:33
Spela senare
Spela senare
Listor
Gilla
Gillad
26:33
Your data team should not just be keeping the lights on, but should be building and creating data products to support the business. In this episode we speak with Murali Bhogavalli a data product manager and explore what is a data product manager and how they differ from a traditional product manager. Below are the top 3 value bombs: Data should be …
…
continue reading
1
Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
41:11
41:11
Spela senare
Spela senare
Listor
Gilla
Gillad
41:11
In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen. Below are top 3 value bombs: Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to ide…
…
continue reading
1
Zero Carbon 2030 : Imagining a New Economy
22:02
22:02
Spela senare
Spela senare
Listor
Gilla
Gillad
22:02
Are our pensions based on finance models which are broken? Is our obsession with growth an outdated experiment - that has gone wrong? Is now the time for us to imagine and build a new 'asset based' economy? Fraser gives us all reason to pause for thought. Produced by @IAmTheHowAv @IAmTheHow
…
continue reading
1
Architecting a Modern Data Lake with Dipti Borkar from Ahana
39:32
39:32
Spela senare
Spela senare
Listor
Gilla
Gillad
39:32
In this episode of Building The Backend we hear from Dipti Borkar cofounder @ Ahana a managed service for Presto on AWS, where we talk all about the data lake, how it should be structured and where the industry is going. Below are top 3 value bombs: Presto is an open source distributed SQL query engine originally created by Facebook, mainly used to…
…
continue reading
1
Open Source BI with Apache Superset
29:15
29:15
Spela senare
Spela senare
Listor
Gilla
Gillad
29:15
What tools are you using for data viz? Are they low cost? One option is Apache Superset, in this episode we speak with Robert Stolz to learn more about Superset and other open source data tools. Top 3 Value Bombs: One popular use case with Apache Superset is embedding it within applications because it’s open source, there is a wide range of flexibi…
…
continue reading
1
Edge Computing and Continuous Intelligence with Swim
34:17
34:17
Spela senare
Spela senare
Listor
Gilla
Gillad
34:17
In this episode of Building The Backend we hear from Simon Crosby – CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more. Below are top 3 value bombs: Edge means more than just being physically located somewhere it could also mean in the cloud. It really is the closest poin…
…
continue reading
1
12 Modern Data Architecture Principles That Should Be Implemented in 2022
20:24
20:24
Spela senare
Spela senare
Listor
Gilla
Gillad
20:24
This episode is a little different then the usual format. Instead of interviewing a data leader - I share what I consider are the 12 most important principles when designing a modern data architecture. Please message me on LinkedIn with the thoughts on this show.Av Travis Lawrence
…
continue reading
1
Zero Carbon 2030 : The Road from Paris to Glasgow
28:57
28:57
Spela senare
Spela senare
Listor
Gilla
Gillad
28:57
Tom's insight from his first hand negotiations at Paris 2015 tee-up our expectations for Glasgow COP26 with a message of respectful balance between the "blah blah blah" and the turning wheels of politics, as we all travel along the highway to change. Produced by @IAmTheHowAv @IAmTheHow
…
continue reading
1
The Keys to Good Data Quality With Prukalpa Sankar from Atlan
37:21
37:21
Spela senare
Spela senare
Listor
Gilla
Gillad
37:21
In this episode of Building The Backend we hear from Prukalpa Sankar – Co-founder of Atlan, where we talk all about data quality/governance, common issues organizations face when implementing data quality and much much more. Below are top 3 value bombs: Data Governance has a bad reputation. It should not be a bureaucratic controlling process that i…
…
continue reading
1
Designing a Modern Data Architecture – Teradata
44:29
44:29
Spela senare
Spela senare
Listor
Gilla
Gillad
44:29
This is a podcast episode you do not want to miss with Stephen Brobst, CTO @ Teradata. We discuss all things Data Warehouses, the shift to the distributed cloud and, key principles to implementing successful DW's. Top 3 Value Bombs: Large organizations are shifting more to a distributed / inter-cloud architecture for many reasons, a couple of reaso…
…
continue reading
1
Exploring Open-Source Data Integration With Airbyte
35:42
35:42
Spela senare
Spela senare
Listor
Gilla
Gillad
35:42
“The hardest part of ETL is not building the connectors, it is maintaining them.” Truer words never spoken. Really enjoyed this episode with Michel Tricot CEO & Co-Founder of Airbyte where we discuss all things data integration and connectors. Top 3 value bombs: The future of ETL/ELT integration connectors may lie with open source. Many closed sour…
…
continue reading
1
How To Effectively Reduce Data Quality Incidents 10x with Datafold
39:12
39:12
Spela senare
Spela senare
Listor
Gilla
Gillad
39:12
This episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autodesk, where he built sophisticated data platforms and developed tooling to improve productivity and data quality…
…
continue reading
1
Applying Transformations to Streaming Data with Materialize
32:55
32:55
Spela senare
Spela senare
Listor
Gilla
Gillad
32:55
This episode features Arjun Narayan Co-Founder & CEO @ Materialize, during our discussion we talk all about transforming streaming data, the do’s the don’ts and how Materialize is changing the landscape of streaming. Top 3 Value Bombs: When creating schema changes organizations should always strive to create forward compatible schema changes only. …
…
continue reading
1
Optimizing Spark in the Cloud - with Jean-Yves Stephan
32:26
32:26
Spela senare
Spela senare
Listor
Gilla
Gillad
32:26
This episode features Jean-Yves Stephan Co-Founder & CEO @ Data Mechanics (recently Acq. by Spot by NetApp), during our discussion we talk about optimizing Spark to run in the cloud at a low cost. Top 3 Value Bombs: Running Spark CAN be expensive but there are ways to reduce your current operating costs by 50-75% by smart automations (i.e. tune for…
…
continue reading
1
How To Achieve Better Observability and Control Over Your Data Pipelines with Josh Benamram
37:03
37:03
Spela senare
Spela senare
Listor
Gilla
Gillad
37:03
This episode features Josh Benamrum, who is the co-founder of Databand. Databand is a company that helps engineering teams achieve better observability and control over their tech stack. Top 3 Value Bombs: When observing our data we should be looking at our data and pipelines Don’t wait till the board meeting for an incorrect metric to make DQ a pr…
…
continue reading
1
Unify Your Data Operations with Nexla
25:12
25:12
Spela senare
Spela senare
Listor
Gilla
Gillad
25:12
Travis welcomes to his podcast Saket Saurabh, who provides a window into the world of data management and the self-service options that are democratizing it. Co-founder and CEO of Nexla, Saket has a passion for data and infrastructure and how to improve its flow among partners, customers and vendors. Nexla automates various data engineering tasks, …
…
continue reading
1
A Powerful Open Source Database That Supports Many Storage Needs (MariaDB)
27:33
27:33
Spela senare
Spela senare
Listor
Gilla
Gillad
27:33
In this episode, we speak with Rob Hedgpeth, a director of developer developer relations at Maria DB. We explore all things Maria DB, the capabilities it has and when you should consider it for your next project. Top 3 value bombs: MariaDB follows a shared nothing architecture and supports distributed SQL for unlimited scaling on demand. MariaDB ca…
…
continue reading
1
Increase the Quality and Reliability of Your Data
31:12
31:12
Spela senare
Spela senare
Listor
Gilla
Gillad
31:12
In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime. Top 3 Value Bombs: Data products should be thought of in it’s entirely from the source to the consumer. No one data stakeholder can solve data quality issu…
…
continue reading
1
Zero Carbon 2030 : Energy Networks for the Future
32:05
32:05
Spela senare
Spela senare
Listor
Gilla
Gillad
32:05
How future energy scenarios require us to balance investment and consumer demand along with the role of flexibility. A How Production for Argand Solutions, @IAmTheHowAv @IAmTheHow
…
continue reading
1
Build Real-Time Data Pipelines in Minutes Not Months with Meroxa
36:33
36:33
Spela senare
Spela senare
Listor
Gilla
Gillad
36:33
In this episode, we speak with DeVaris Brown, he is the CEO and co-founder of Meroxa, which is a data platform that enables organizations to build real time data pipelines in minutes not months. Prior to founding Meroxa, DeVaris was a product leader at Twitter, Heroku, and Zendesk. In this episode we will be talking about all things data ingestion.…
…
continue reading
1
Launch, Monitor, and Share Data Pipelines In a Matter of Minutes
32:07
32:07
Spela senare
Spela senare
Listor
Gilla
Gillad
32:07
In this episode, we speak with Blake Burch, co-founder of Shipyard, a data orchestrator tool that allows you to create powerful workflows in a matter of minutes. Top 3 Value Bombs: Data tests are often for the assumptions we already know. There's a lot of unknowns that can crop up and cause issues that tests are not catching. Start analyzing job me…
…
continue reading
1
The Data Warehouse for Distributed Clouds - Yellowbrick
37:57
37:57
Spela senare
Spela senare
Listor
Gilla
Gillad
37:57
In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem. Top 3 Value Bombs: Yellowbrick DW was recently named a contender in Cloud Data Warehouses by Forrester Research and they are able to achie…
…
continue reading
1
What You Should Know Before Getting Started With Data Science with DATA SCIENCE I N F I N I T Y
43:42
43:42
Spela senare
Spela senare
Listor
Gilla
Gillad
43:42
In this episode, we speak with Andrew Jones who has spent 13 years in Data Science at companies including Amazon & more recently Sony PlayStation where he developed and prototyped Machine Learning based features for the PlayStation 5, several of which have been patented by Sony. Since then he has created the DATA SCIENCE I N F I N I T Y community t…
…
continue reading
In this episode, we speak with Tejas Manohar, Co-Founder of Hightouch, a leading Reverse ETL platform. That syncs data from your warehouse or lake back into tools your business teams rely on. Top 3 Value Bombs: Organizations should be sending more holistic customer data back into their marketing solutions. Reverse ETL is the process of creating pip…
…
continue reading
1
Become a Data Driven Organization with Christina Stathopoulos an Analytical Lead at Waze @Google
33:44
33:44
Spela senare
Spela senare
Listor
Gilla
Gillad
33:44
In this episode, we speak with Christina Stathopoulos who works at Google as an Analytical Lead with Waze, a crowdsourced mobile navigation app. She is also an adjunct professor at IE Business School and guest lecturer at ISDI where she teaches analytics courses in the MBA programs. In this episode we will discuss the current landscape of data, cha…
…
continue reading
1
Designing Scalable Data Architects with Dr. Mark Tabladillo from Microsoft
40:46
40:46
Spela senare
Spela senare
Listor
Gilla
Gillad
40:46
In this episode, we speak with Dr. Mark Tabladillo. Mark is a thought leader in the AI/ML space at Microsoft where he creates technical architectures for artificial intelligence and data science solutions. Top 3 Value Bombs: Many organizations struggle to understand their current data landscape. Azure Purview can help you manage and govern your dat…
…
continue reading
1
Zero Carbon 2030 : It's All About The Money
23:53
23:53
Spela senare
Spela senare
Listor
Gilla
Gillad
23:53
Are green investment funds effective or do they just make us feel better? Understanding primary and secondary markets may seem irrelevant to the average investor, but it's vital that we learn more and take part in the conversation. A How Production for Argand Solutions, @IAmTheHowAv @IAmTheHow
…
continue reading
1
Reduce Data Movement and Decrease Processing Times with a Machine Scale Feature Store(Molecula)
46:14
46:14
Spela senare
Spela senare
Listor
Gilla
Gillad
46:14
In this episode, we speak with H.O. Maycotte. H.O. is the CEO/founder of Molecula, an enterprise feature store that simplifies, accelerates, and controls big data access to power machine-scale analytics and AI. Molecula is powered by Pilosa, an open source project created by H.O. and team. Pilosa eliminates the need to copy data between systems in …
…
continue reading
1
Why You Should Be Using (CDC) Change Data Capture for Ingestion with Datacoral
40:47
40:47
Spela senare
Spela senare
Listor
Gilla
Gillad
40:47
In this episode, we speak with Raghu Murthy. He is the founder of Datacoral, which provides serverless architectures that support data pipelines and orchestration to facilitate ELT into your Data Warehouse. Prior to founding Datacoral he was at Yahoo, Facebook and was part of the initial team that developed Hive. In this episode we will explore the…
…
continue reading
1
Integrating Large Scale Microservices Architectures with your Data Platform with Sunu Sasidharan
25:48
25:48
Spela senare
Spela senare
Listor
Gilla
Gillad
25:48
In this episode, we speak with Sunu Sasidharan. Sunu is the technology lead at Cuologic where he helps global brands implement large scale architectures, devops automation and data engineering across multiple technology stacks. Top 3 Value Bombs: When creating distributed databases, you can only achieve two of the following three: consistency, avai…
…
continue reading
1
Transportation Modeling and Autonomous Vehicles With Matt Battifarano
35:04
35:04
Spela senare
Spela senare
Listor
Gilla
Gillad
35:04
In this episode, we speak with Matt Battifarano. Matt is a data scientist focusing on transportation modeling. He first started his career as a data scientist at a startup called Bridj where they created a smart micro-bus platform for urban transit similar to Uber Pool. Currently he’s working towards his PHD at Carnegie Mellon at their Mobility Dat…
…
continue reading
1
Zero Carbon 2030 : Being Clever about Smart Tech
23:37
23:37
Spela senare
Spela senare
Listor
Gilla
Gillad
23:37
The pandemic shows us that behavioural change is not enough to meet carbon targets; we have to get clever about being smart. A How Production for Argand Solutions, @IAmTheHowAv @IAmTheHow
…
continue reading
1
The Importance of Self Service BI with 5xData
26:30
26:30
Spela senare
Spela senare
Listor
Gilla
Gillad
26:30
In this episode, we speak with Tarush Aggarwal. Tarush is the founder of 5xdata, where he helps companies build a strong data foundation with self-service BI to enable the business. Prior to starting 5xData he was one of the first data engineers on the analytics team Salesforce and helped scale the data team WeWork from 5 to 100+. Top 3 Value Bombs…
…
continue reading
1
The Next Wave of AI and Creating Intelligent Cognitive Assistants with aigo.ai
32:34
32:34
Spela senare
Spela senare
Listor
Gilla
Gillad
32:34
In today’s episode, we will speak with Peter Voss and discuss the current landscape of AI, the next wave of AI called Artificial General Intelligence, and how organizations today can level up their chatbots to create satisfied customers. Peter Voss is a Serial Entrepreneur, and Pioneer in Artificial Intelligence. Who coined the term ‘AGI’ (Artifici…
…
continue reading
1
Learn How LinkedIn is Future-Proofing There Data Architecture
41:18
41:18
Spela senare
Spela senare
Listor
Gilla
Gillad
41:18
In today’s episode, we will speak with Kapil Surlaker, the vice president of engineering at LinkedIn. Kapil has been with LinkedIn for over 10 years and has played an instrumental role in shaping the data architecture that LinkedIn is built on top of. In this episode, we cover a wide range of topics surrounding data architecture from: How metadata …
…
continue reading
1
DataOps Is Not Just DevOps for Data with DataKitchen
28:14
28:14
Spela senare
Spela senare
Listor
Gilla
Gillad
28:14
In today’s episode, we will speak with Chris Bergh, a pioneer in the DataOps landscape and the CEO at DataKitchen, a DataOps Platform that Simplifies Complex Data Toolchains and Environments Top 3 Value Bombs: DataOps is not just DevOps for data Any organization can get started today and start implementing DataOps practices. Start small and priorit…
…
continue reading
1
Getting Started with AI While Avoiding R&D Failures
37:38
37:38
Spela senare
Spela senare
Listor
Gilla
Gillad
37:38
In today’s episode, we will speak with Manny Bernabe and discuss the current landscape of AI, how to get started implementing AI solutions and what organizations should be doing today to set them up AI success in the future. Manny is the founder of BigPlasma.ai and has 10+ years of experience creating and deploying AI & Machine Learning solutions a…
…
continue reading
1
Cleaning Dirty Data with the Classification Guru
17:00
17:00
Spela senare
Spela senare
Listor
Gilla
Gillad
17:00
In today’s episode, we will speak with Susan Walsh and learn why organizations struggle with creating and maintaining high-quality data and the steps she takes to resolve data issues. Susan Walsh has nearly a decade of experience fixing your data and founded the classification guru. Susan is a specialist in data classification and data cleansing. S…
…
continue reading
1
Zero Carbon 2030 : The Social Cost of Carbon
26:40
26:40
Spela senare
Spela senare
Listor
Gilla
Gillad
26:40
'The social cost of carbon' debate between scientists and economists has been running for years, but it's now firmly back on the table; Fraser steers us through some of the complexities. A How Production @IAmTheHowAv @IAmTheHow
…
continue reading
1
Data Teams: A Unified Management Model for Successful Data-Focused Teams
40:33
40:33
Spela senare
Spela senare
Listor
Gilla
Gillad
40:33
In today’s episode, we will speak with Jesse Anderson and learn how to run successful big data projects and how to resource your teams. Jesse is a big data expert at Big Data Institute, who’s worked with startups to Fortune 100 companies. He has taught over 30,000 people the skills to become data engineers and is published in prestigious publicatio…
…
continue reading
1
DataSecOps - Increase the Security of Data While Making it Simple to Manage with eXate
41:57
41:57
Spela senare
Spela senare
Listor
Gilla
Gillad
41:57
In today’s episode, you will hear from the co-founders of eXate Peter Lancos and Sonal Rattan. eXate streamlines, automates and simplifies the processes of storing, interpreting, and extracting value from data assets. It democratizes data privacy for organizations by providing a simple, embedded platform that automates the technical enforcement of …
…
continue reading
1
How to Monetize, Manage, and Measure Information as an Asset for Competitive Advantage
31:04
31:04
Spela senare
Spela senare
Listor
Gilla
Gillad
31:04
In today's episode you will hear from Doug Laney, a best-selling author and recognized authority on data and analytics strategy. Doug’s book, Infonomics: How to Monetize, Manage, and Measure Information for Competitive Advantage, was selected by CIO Magazine as the “Must-Read Book of the Year” and one of the “Top 5 Books for Business Leaders and Te…
…
continue reading
1
Disrupting Data Governance - Everybody Should be a Data Steward
27:20
27:20
Spela senare
Spela senare
Listor
Gilla
Gillad
27:20
In today’s episode, you will hear from Laura Madsen with 20+ years in data and analytics, authoring books on data governance and healthcare analytics, and co-founded Minneapolis-based consulting firm Via Gurus. Top 3 Value Bombs: Data governance should be democratized throughout the organization Data Governance is a journey, not a destination. Most…
…
continue reading