I’d rather be building

Investigating Global COVID-19 Vaccinations


Introduction

I’m curious to see how vaccination rates compare between countries and within countries (specifically the United States, India, and Israel).

Israel is the country with the fastest vaccination rate, already surpassing 30% of the total population vaccinated. I’m curious to see how this compares to the other local Arab states (I know the UAE is in second in terms of vaccination rate), and how vaccination rates among Palestinians compare, if that data is available.

Read more ⟶

Defining Music Services: Artist-facing vs. Listener-facing


Introduction

When people think of music services they often think of streaming services like Spotify, Apple Music, and the like. While these are the largest players in the music services industry, streaming is far from the only service provided in the music industry. Music services can largely be split into two categories: listener-facing and artist-facing services. Listener-facing services are those which are geared towards helping listeners find new music (music discovery), save and store music they like, and listen to music by streaming remotely (Spotify) or playing from local storage (iTunes). Artist-facing services are defined as follows: “One, it makes money for the artist, and two, it helps them to acquire and engage fans. So anything from ticketing, email marketing, retail, strategic technology, licensing and now the Songkick concert-discovery business” (MusicAlly). Additionally, distribution and discovery services help artists host their music on streaming services and get found by new fans, so they also fall under the definition of artist-facing services. I would add that services which facilitate the creative process are also artist-facing services because although they do not directly make money or draw fans, they play an important part in creating the music which does.

Read more ⟶

Problems in the Music Industry: The Need for Better Recommenders and Improved Music Discovery


Introduction

Since artist compensation is dependent on the number of streams an artist can generate and the barrier of entry into music, in terms of distribution, is much lower, the entire pool of music available for streaming has become oversaturated with subpar songs and, more generally, the amount of readily streamable music is increasing rapidly. Building improved music discovery services and recommendation systems would solve these issues.

Artist Compensation and Counting Streams

Firstly, the current model for compensating artists (adopted by Spotify and Apple Music, the two largest music streaming service providers) is entirely dependent on the number of streams an artist can generate. Therefore, in the interest of making more of their product, artists have started to put out albums with high numbers of individual tracks. “Releases by Migos (Culture II, 24 tracks), Rae Sremmurd (SR3MM, 27 tracks), and Drake (Views, 20 tracks; More Life, 22; Scorpion, 25) don’t encourage being heard in full on one sitting, in fact, they actively discourage repeat listens. They’re intended to be cherry-picked for playlist consumption because it results in bigger streams and therefore higher chart positions.” (Triple-J)

Read more ⟶

Genre Classification and Recommendations Using the FMA Dataset


Introduction

Working with the FMA dataset which contains audio features and free music snippets on 100k+ tracks. This is in the interest of working towards my own MIR using deep learning to extract audio features from audio signals (probably using GTZAN). The data comes from the FMA dataset. Let’s start by training a model to do feature extraction given an MFCC (derived from an audio signal). We can use the Echonest dataset which comes with audio features already extracted and use the FMA set with librosa to generate the MFCC.

Read more ⟶

Recommendations Based on Pitchfork Written Reviews


Introduction

Setup

First, let’s make sure we’re in the correct working directory and have all the necessary packages imported. We’ll also read in the dataset–It’s been cleaned beforehand on my local machine; a simple process to get it into a csv rather than the SQLLite db I had downloaded. Finally, we’ll set a flag to indicate which vectorizer we are using in the interest of testing both and comparing (A/B testing).

Read more ⟶