Solving a real world business problem for a fictitious subscription based music streaming service called “Sparkify” using PySpark

Introduction

This post is a discussion about the capstone project that I carried out as part of the Udacity Data Scientist Nanodegree. This project was a chance to learn about big data and using PySpark to process it.

The dataset for this project was provided by Udacity in partnership with Insight Data Science. It contains event log data generated by users of the service Sparkify.

The main source of revenue for Sparkify, like most subscription based business models, is from user subscription fees…


An analysis of the posts on the subreddit that started it all.

A graph of the GameStop stock price movement
A graph of the GameStop stock price movement

Introduction

The last couple of weeks have been interesting to say the least in the world of stocks.

Prices of stocks like GameStop, AMC Theatres and Nokia have soared and remained volatile.

There are plenty of articles out there covering the intricacies of the events but in this post we will look into the subreddit that started it all.

A Reddit group by the name of WallStreetBets was central to all the plotting which encouraged individuals to buy stocks in revolt.

This is an analysis of the posts on…

Vishnu N Raju

A huge Chelsea fan!

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store