INDEX
Explanations
mentions of the word "brass"
references to military or authoritative groups and their associations
New Auto-Interp
Negative Logits
pend
-0.90
mia
-0.83
teenth
-0.66
DonaldTrump
-0.64
Stanton
-0.63
psychiat
-0.63
horizont
-0.63
aundering
-0.61
aving
-0.60
discont
-0.59
POSITIVE LOGITS
glers
1.27
ilage
0.92
ular
0.87
osal
0.86
uality
0.82
Rouge
0.78
ues
0.78
Champ
0.77
xon
0.77
acular
0.76
Activations Density 0.050%