INDEX
Explanations
phrases related to national anthems
references to the national anthem
New Auto-Interp
Negative Logits
erm
-0.73
aido
-0.71
Sor
-0.68
fram
-0.66
angs
-0.65
rive
-0.64
CI
-0.64
oming
-0.64
err
-0.64
remote
-0.63
POSITIVE LOGITS
anthem
1.13
Anthem
0.97
kneeling
0.87
salute
0.84
lyrics
0.80
kne
0.80
Parade
0.78
brance
0.78
chant
0.75
chants
0.75
Activations Density 0.042%