INDEX
Explanations
mentions of specific celebrities, particularly Taylor Swift
mentions of popular music artists, especially Taylor Swift
New Auto-Interp
Negative Logits
glim
-0.85
rongh
-0.74
oard
-0.71
Manit
-0.71
ranc
-0.69
olulu
-0.68
ruciating
-0.67
buffer
-0.67
sembly
-0.65
std
-0.64
POSITIVE LOGITS
lyrics
1.07
Beyon
1.00
Kardashian
0.96
albums
0.95
Album
0.93
album
0.92
Songs
0.91
songs
0.90
Eminem
0.90
merch
0.89
Activations Density 0.210%