Speed Up Your Data Processing: Parallel and Asynchronous Programming in Python

A presentation at FOSSASIA Summit 2020 in in Singapore by Ong Chin Hwee

In a data science project, one of the biggest bottlenecks (in terms of time) is the constant wait for the data processing code to finish executing. Slow code, as well as connectivity issues, affect every step of a typical data science workflow — be it for event-driven I/O operations or computation-driven workloads. Through real-life analogies based on my experience in a young data science team getting started with real-world data, I will be exploring the use of parallel and asynchronous programming in Python to speed up your data processing pipelines so that you could focus more on getting value out of your data.

Video

Buzz and feedback

Here’s what was said about this presentation on social media.