INDEX
Explanations
mentions of the national anthem
references to national anthems
New Auto-Interp
Negative Logits
Sor
-0.73
erm
-0.70
aido
-0.69
angs
-0.67
Commerce
-0.65
sighted
-0.63
Wraith
-0.62
capitals
-0.62
-0.61
cery
-0.61
POSITIVE LOGITS
anthem
1.07
Anthem
0.96
lyrics
0.81
aepernick
0.80
brance
0.80
isphere
0.76
kneeling
0.75
naire
0.75
Kaepernick
0.73
salute
0.72
Activations Density 0.029%