INDEX
Explanations
mentions of the word "Swift" with varying activation values indicating different strengths of association
mentions of the name "Swift."
New Auto-Interp
Negative Logits
ulhu
-0.70
irin
-0.69
Downloadha
-0.69
chell
-0.69
terior
-0.68
egal
-0.67
berra
-0.66
abases
-0.66
Ars
-0.65
orate
-0.65
POSITIVE LOGITS
Swift
0.84
heart
0.81
ipeg
0.80
Swim
0.77
lings
0.76
song
0.75
omatic
0.75
ies
0.73
blade
0.72
ness
0.72
Activations Density 0.017%