hiphop_nlp_webscrape

Has Hip-Hop Gotten Worse? - Web Scraping and NLP Project

banner

SOURCE

Links

Table of contents

Background

Hip-hop/rap music was founded in the 1970’s and became widespread during the 80’s, kicking off the decade-long era known as the “golden age” of hip-hop (NPR).

Today, hip-hop/rap is music’s most popular and commercially successful genre. Every region has its own style of rap and numerous musical and cultural trends have defined hip-hop over the years.

Many fans believe that rap music has gotten worse over the years, crediting this decline to the rise of what is called mumble rap, a style characterized by artists rapping inchorently and/or melodically and using a variety of ad-libs as opposed to actual lyrics. But, while mumble rap has certainly risen in popularity, does this mean that “good” rap, music filled with clever and meaningful lyrics, has vanished from the genre?

To answer this question, I will be analyzing the lyrics of the best rap song from every year from 1990 to 2020.

(Back to top)

Song Selection

Songs from 1990-2003 will be pulled from Complex article, “The Best Rap Song, Every Year Since 1979” and the songs from 2004-2020 will be the winners of the Grammy Award for Best Rap Song (first awarded in 2004).

(Back to top)

Lyrics Source

The lyrics for each will be scraped from AZLyrics. Since these lyrics are user submitted, the formatting of lyrics is variable. This variation is largely in how chorus lyrics are posted. For example, one song may have all the lyrics to the entire chorus typed out each time whereas another may just use ‘[Chorus]’ or ‘[2x]’ to avoid repetition. My hope is that this will not affect the metrics too much, as songs with shorter/less repetitive choruses can indicate a song’s greater substance and lyrical quality.

(Back to top)

Metrics:

(Back to top)

Here are some quick definitions of terms used in this project:

(Back to top)

Goal

As stated above, this project has some limitations in terms of the consistency of lyric data, the extent to which the packages/tools used can analyze rap lyrics, and the overall subjectivity of this topic. Ultimately, I started this project as a fun way to both explore one of my interests and utilize the skills and technical knowledge I have learned thus far. But, I certainly believe that the results of this project can give insight into how rap music has evolved over the years.

(Back to top)

Usage

Please refer to the Jupyter Notebook viewer to view all the code and visualizations created during this project.

The source files contain all the functions used to web scrape, process the text, and calcualte the metrics used for the project.

(Back to top)

Findings

While all four metrics seemed to decline over the years, % Unique Rhymes to All Rhymes (number of unique rhymes / number of all rhymes) proved to be the metric with:

scatter

Even though what qualifies as “good music” will always be subjective, quantifying the quality of lyrics in these songs proved to be insightful. The generally weak relationships between each metric and year indicate that any “decline” in hip-hop/rap music may not be as strong as some would assume.

As mentioned above, hip-hop has become the most popular genre of music. With this ever increasing popularity comes more commercial and lucrative opportunities, and such opportunities are not necessarily conducive to lyrically complex and intricately crafted songs. The genre becoming more commerical and marketable does not mean that there are no lyrically interesting songs being made. But, this trend may contribute to an oversaturated market, where mostly catchy, less intricate songs become popular.

Overall, this project gives evidence that hip-hop as a genre has not seen a dramatic decline. Instead, changing trends in the music industry and how the public consumes media may affect what types of songs become most popular, but not necessarily the skills of all rappers.

(Back to top)

Source Files

## Legality

This personal project was made for the sole intent of applying my skills in Python thus far and as a way to learn new ones. It is intended for non-commercial uses only.

Some issues with webscraping from AZLyrics arose as I was developing this project because the website detected an unusual amount of activity. An alternative to AZLyrics is archive.org, a website that regularly stores archives for various webpages. Nonetheless, using the Jupyter Notebook viewer should not present any issues.

(Back to top)