Popularity#
We used Data on Songs from Billboard 1999-2019, specifically songAttributes_1999-2019.csv. The graph shows the spread of the popularity of songs per artist. This popularity metric (0-100) is calculated by Spotify and we have no insight on how it is calculated. It only shows artists with a median popularity higher than 45 to keep readability.
Here we can see that the artist Tool has the highest popularity for his least popular song. It is interesting to note that Taylor swift and Ed Sheeran both have the most popular song with a popularity rating of 91.
Show code cell source
import plotly.graph_objs as go
import plotly.express as px
import pandas as pd
df = pd.read_csv('../cleaned/songAttributes_1999-2019.csv', index_col=0)
df2 = df.groupby('Artist', as_index=False).agg({'Popularity': 'median'})
df2 = df2.loc[df2['Popularity'] > 48]
df = df.loc[df['Artist'].isin(df2['Artist'])]
df = df.sort_values('Popularity')
px.box(df, x='Artist', y='Popularity').show()