Attention: You are using an outdated browser, device or you do not have the latest version of JavaScript downloaded and so this website may not work as expected. Please download the latest software or switch device to avoid further issues.
With data volumes growing exponentially every year, we need new skills, new tools and new platforms to allow us to both manage and to extract value from these big datasets. Apache Spark is a powerful platform which gives customers new methods for storing, processing and analysing these huge datasets. In this masterclass we will cover:
* What is Apache Spark and what can it do for you?
* The use cases that are most suited to Apache Spark
* What are the alternatives to Apache Spark?
* Follow along with practical examples of processing data with Python and Apache Spark
While this masterclass will be of interest to many within the data science community, it will be of particular value to those who are considering adopting a Big Data processing system and considering Apache Spark as an option. During the session attendees may follow along with worked examples if they have a standard Python / Jupyter Notebooks environment available.
For any queries regarding this event, please contact Rebecca@analyticsinstitute.org