SkillAgentSearch skills...

Datadata

Pulling data using both the Genius API and the Spotify API I've been able to analyze Jorge Drexler's music and get some insights and visualizations on his creative process and his songs in general; both from the lyrics side and the musical theory side. Wordcount, lexical and lyrical density, sentiment analysis and analysis of musical components like tempo, time signature and key are all taken into account. Also, in the end, gloom_index is used combining both lyrics and music.

Install / Use

/learn @alexing/Datadata
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Data, data

A statistical analysis and exploration on Jorge Drexler's music and lyrics.

by Alex Ingberg

Data, data is a homage to the great Uruguayan musician and songwriter Jorge Drexler.

Pulling data using both the Genius API and the Spotify API I've been able to analyze Jorge's music and get some insights and visualizations on his creative process and his songs in general; both from the lyrics side and the musical theory side.

Wordcount, lexical and lyrical density, sentiment analysis and analysis of musical components like tempo, time signature and key are all taken into account. Also, in the end, gloom_index is used combining both lyrics and music.

To check the analysis, go here.

To see the article published in Towards Data Science, go here

To check its spanish translation, go here

To check how i built the database, go here.

Some cool samples from the visualizations:

NRC emotions through the years Tempo by albums Usage of keys Top 10 songs with more words Wordcloud Lyrical density vs lexical density Correlation in negative NRC emotions

This whole project has been created using Python 3, Jupyter Notebook and a little bit of PyCharm.

I created the databases with pandas, BeautifulSoup, Spotipy (an amazing Python wrapper for the Web Spotify API), and the Genius API and Web Spotify API.

To work on the analysis the tools I used were pandas, NumPy, Matplotlib, Seaborn, scikit-learn, SciPy, Natural Language Toolkit, wordcloud and py-lex.

To check the analysis, go here.

To check how i built the database, go here.

References:

[1] Text analysis in Pandas with some TF-IDF (again) by Jakub Nowacki

[2] Everything in Its Right Place: Visualization and Content Analysis of Radiohead Lyrics by Myles Harrison

[3] Data Visualization and Analysis of Taylor Swift’s Song Lyrics by Preetish Panda

[4] fitteR happieR by Rcharlie

[5] Quantifying Sufjan Stevens with the Genius API and NLTK by Jonathan Evans

View on GitHub
GitHub Stars25
CategoryDevelopment
Updated2mo ago
Forks7

Languages

Jupyter Notebook

Security Score

75/100

Audited on Jan 20, 2026

No findings