INDEX
Explanations
mentions of the word "Swift" with varying levels of importance assigned to different contexts
mentions of the artist Taylor Swift
New Auto-Interp
Negative Logits
berra
-0.79
abases
-0.77
sembly
-0.72
terior
-0.71
abama
-0.71
chell
-0.69
ABE
-0.66
++++
-0.65
gdala
-0.65
pse
-0.65
POSITIVE LOGITS
heart
0.84
song
0.81
lings
0.77
Swim
0.73
Berry
0.73
Movie
0.73
Swift
0.72
Key
0.71
ies
0.71
lance
0.71
Activations Density 0.047%