Solving a real world business problem for a fictitious subscription based music streaming service called “Sparkify” using PySpark
This post is a discussion about the capstone project that I carried out as part of the Udacity Data Scientist Nanodegree. This project was a chance to learn about big data and using PySpark to process it.
The dataset for this project was provided by Udacity in partnership with Insight Data Science. It contains event log data generated by users of the service Sparkify.
The main source of revenue for Sparkify, like most subscription based business models, is from user subscription fees…
An analysis of the posts on the subreddit that started it all.
The last couple of weeks have been interesting to say the least in the world of stocks.
Prices of stocks like GameStop, AMC Theatres and Nokia have soared and remained volatile.
There are plenty of articles out there covering the intricacies of the events but in this post we will look into the subreddit that started it all.
A Reddit group by the name of WallStreetBets was central to all the plotting which encouraged individuals to buy stocks in revolt.
A huge Chelsea fan!