Breaking Pandas

Channel:
Subscribers:
725,000
Published on ● Video Link: https://www.youtube.com/watch?v=BFFEBDTMI98



Duration: 25:38
6,121 views
92


pandas is more than 10 years old now. In this time, it became almost a standard for building data pipelines and perform data analysis in Python. As the popularity of the project grows, it also grows the number of projects that depend or interact with pandas.

This talk will cover this ecosystem of projects around pandas, mainly in the prespective of scalability and performance. Discussing for example how projects like Arrow are key for the future of pandas, or how Dask is overcoming pandas limitations.

In a first part, the talk will focus on pandas itself, its components, and its architecture. This will give the required context for a second part, that will explain related projects, how they interact with pandas, and what the whole ecosystem can offer to users.

EVENT:

PyLondinium19

SPEAKER:

Marc Garcia

PUBLICATION PERMISSIONS:

Original video was published with the Creative Commons Attribution license (reuse allowed).

ATTRIBUTION CREDITS:

Original video source: https://www.youtube.com/watch?v=a5EUV6dsCPY







Tags:
pandas
python
data science
data analysis