Distributed Data Show show

Distributed Data Show

Summary: The Distributed Data Podcast is your weekly source for the latest news and technical expertise to help you succeed in building large-scale distributed systems. Brought to you by the Developer Advocate team, we go in-depth with DataStax engineers and special guests from the broader data community. New episodes each Tuesday.

Join Now to Subscribe to this Podcast

Podcasts:

 Cassandra in Japan with Yuki Morishita | Ep. 114 Distributed Data Show | File Type: audio/mpeg | Duration: 00:18:41

Patrick talks with longtime Cassandra contributor and committer Yuki about his background with Cassandra, Change Data Capture, and the state of the Cassandra community in Japan. 0:00 - Patrick welcomes longtime Cassandra contributor and committer Yuki to the show and we learn what attracted Yuki to Cassandra 2:55 - Yuki and Patrick talk about what it was like working with early Cassandra releases before CQL 4:03 - Yuki's efforts on translating documentation into Japanese led to his becoming a Cassandra committer and an early employee at Riptano (original name of DataStax) 6:15 - Yuki moved to the USA for about 5 years to work on Cassandra but is now back in Japan as a Solution Architect 7:50 - Using Cassandra as your source of truth means you need a Change Data Capture facility to flow changes to other systems. 9:48 - Yuki gives an overview of the history of CDC in Cassandra and how it works 11:53 - The hard part of processing changes from multiple nodes is de-duplication. Several companies are working on this 13:20 - Comparing and contrasting engineering vs. solution architecture roles 14:58 - Cassandra community in Japan is emerging. There is a lot of awareness of Cassandra but people are working to get their skills up. 16:58 - Looking forward to DataStax Constellation as a great way to reduce the learning curve 17:33 - Cassandra Summit Japan is the oldest of these events and has been running continuously

 Cassandra Use-Cases and V4 best features with Carlos Rolo | Ep. 113 Distributed Data Show | File Type: audio/mpeg | Duration: 00:19:49

It's always interesting to discuss technologies with real experts like Carlos Rolo, so we can't let him go without some questions answered. What are the most awaited version four features, who gets the biggest win from ZCS and why sidecar project is so important? All of these plus best and worst use-cases for Cassandra discussed in our new release of Distributed Data Show!

 What's New in Tinkerpop 3.4 with Stephen Mallette | Ep. 112 Distributed Data Show | File Type: audio/mpeg | Duration: 00:25:25

Listen as Stephen Mallette gives us the drop on the latest with Tinkerpop 3.4 while elevating our Gremlin game with tips on becoming an advanced user and using application based DSL's. Highlights: 00:54 - what's going on with Tinkerpop 3.4 01:49 - what is version 3, what does that mean? 02:22 - lots of new contributions from the Tinkerpop community 03:31 - discussing changes for 3.4 05:21 - 3.4 is out and ready to go 05:51 - developing parity across different language variants 07:19 - A better serialization format for Graph 09:38 - call to action to solicit feedback from the Graph community on 3.4 11:00 - we want people to write Gremlin in their language natively 12:59 - some future talk on what's coming after 3.4 14:20 - performance increases with Graph binary 15:54 - discussing Stephen's Accelerate talk from novice to advanced 21:02 - a composition of novice level traversals inside of transformations 22:06 - lots of people using DSL's (domain specific languages) in their applications 24:26 - using DSL's to extend the Gremlin language

 Timeseries use cases and why Cassandra fits | Ep.111 Distributed Data Show | File Type: audio/mpeg | Duration: 00:08:09

This week the EMEA DataStax crew takes over the DDS to provide feedbacks about the DataStax Conference and announcements made during keynotes. This was also an occasion to highlight the talk Timeseries at scale performed by Alice and Patrick.

 Cassandra at Netflix and Version 4 Wishlist with Vinay Chella | Ep. 110 Distributed Data Show | File Type: audio/mpeg | Duration: 00:19:34

Host Aleks Volochnev sits down with Netflix Cloud Database Architect, Vinay Chella to discuss Full Query Logging, how Sidecar makes ops people happy and why Netflix already plans to migrate to version 4? A lot is discussed, so stay tuned! Timeline: 00:00 Welcome 00:25 Introduction 01:53 Vinay's Talk I @ Accelerate 02:00 Full Query Logging 03:00 Vinay's Talk II @ Accelerate 03:20 What are you working on right now? 03:25 Sidecar 05:40 Performance Monitoring 06:40 Netflix' Technical Blog 07:05 Version Four 08:05 Async Internode Messaging 08:40 Zero Copy Streaming 10:12 Chaos Engineering vs Cassandra 12:15 Working with Apache Community 14:10 An open-source contributions 15:00 How to become a Cassandra Contributor 15:58 Favourite Bug 17:13 Numbers?! 18:17 Data Density 19:20 Thank you!

 Constellation Tech Preview | Ep. 109 Distributed Data Show | File Type: audio/mpeg | Duration: 00:14:42

Starting off this episode Adron and Kat (Kathryn Erickson) kicks off the discussion with a little focused camera angle on the DataStax Accelerate 2019 Conference! Adron and Kat elaborate on DataStax Desktop and also AppStax! The conversation wraps up with details around DataStax Enterprise Graph, and future direction around that technology. Afterwards Amanda joins Mattias Broecheler for more discussion around the Desktop and AppStax technology. Mattias explains the focus, ideas behind, and core features that will change how development is done with AppStax!

 Apache Casandra 4.0 Improvements With TheLastPickle Guys | Ep. 108 Distributed Data Show | File Type: audio/x-m4a | Duration: 00:09:39

TheLastPickle, DataStax Accelerate, and exciting updates coming in Apache Cassandra 4.0. Cedrick talks with John Haddad and Alex Dejanovski from TheLastPickle to discuss their presentations at DataStax Accelerate along with Apache Cassandra tools managed by TLP and new updates coming with Cassandra 4.0.

 Performance Heaven with Intel Optane + DSE with Donnie Roberson | Ep. 107 Distributed Data Show | File Type: audio/mpeg | Duration: 00:10:15

Donnie Roberson of the DataStax Partner team joins the show to talk about the amazing performance results observed in running DataStax Enterprise 6 on Intel's latest generation hardware including the Xeon processors and Optane DCPMM, and when and where you might be able to get your hands on this technology.

 Building CICD Pipelines in the Modern Age with Christopher Bradford | Ep. 106 Distributed Data Show | File Type: audio/mpeg | Duration: 00:13:06

Many DSE users have very long upgrade cycles due to time and complexity concerns. Using the CICD methodology Christopher Bradford has taken up the challenge to make the upgrade path both faster and lower risk. Today we get to dive in and take a look at what he has been up to.

 Growing Your Developer Skills with Valerie Parham-Thompson | Ep. 105 Distributed Data Show | File Type: audio/mpeg | Duration: 00:10:42

In this industry, showing the drive to expand one's technological skills is crucial. Valerie personifies this drive, and then some. We met with Valerie to discuss all things Cassandra documentation, her love of open source contributions, and some interesting projects she's working on at Pythian.

 Apache Cassandra's™ Newest Features with Jake Luciani | Ep. 104 Distributed Data Show | File Type: audio/mpeg | Duration: 00:24:39

There are not so many developers who joined the Cassandra Community in the very beginning and then never quit. Jake is one of them: he works with the Cassandra community as a PMC Member and leads a team at DataStax Enterprise for almost a dozen years already. Of course, he attended Datastax Accelerate conference and we didn't miss a chance to ask him a few questions about the past and future of Cassandra & DSE!

 What's new with Cassandra at Instagram | Ep. 103 Distributed Data Show | File Type: audio/mpeg | Duration: 00:24:53

In this episode, Jeff Carpenter talks with Dikang Gu about the origins of Cassandra at Instagram, an update on how the adoption of RocksDB as a storage engine for Cassandra is progressing, geographic data partitioning, and how his team is providing Cassandra as a Service inside Instagram.

 A Topical Journey Into Bulk Loading with Brian Hess | Ep. 102 Distributed Data Show | File Type: audio/mpeg | Duration: 00:11:51

A Topical Journey Into Bulk Loading with Brian Hess | Ep. 102 Distributed Data Show by DataStax Developers

 Cassandra Data Modeling Tools | Ep. 101 Distributed Data Show | File Type: audio/mpeg | Duration: 00:12:36

In this episode Jeff and Adron have a quick topical discussion of some tools they're using to get work done with CQL and databases in general. Adron discusses using JetBrains DataGrip and what it's been enabling him to do, then Jeff interjects with some additional thoughts and asks the question, is Cassandra not your only database? Where Adron elaborates on how DataGrip works with many other databases, so when one is approached with work across a wide spectrum of sources they can tackle that work with DataGrip. Then both Jeff and Adron get into what they'd like to see in next generation IDE's and what they'd like to have tooling around to get the job done!

 Apache Cassandra TM Trends With Aaron Ploetz | Ep. 100 Distributed Data Show | File Type: audio/mpeg | Duration: 00:12:19

This week, Aaron Ploetz joins Eric Zietlow to discuss trends within Apache Cassandra and gives us a few resources on how to learn C*.

Comments

Login or signup comment.