INDEX
Explanations
phrases related to naming or identifying specific things or individuals
phrases that describe the current state or identification of subjects
New Auto-Interp
Negative Logits
gap
-0.66
azar
-0.61
conn
-0.60
ertodd
-0.58
atana
-0.56
acio
-0.54
gaps
-0.54
ason
-0.53
NAACP
-0.53
Monitoring
-0.53
POSITIVE LOGITS
wont
1.16
nowadays
0.90
today
0.87
now
0.79
lished
0.77
presently
0.75
today
0.73
named
0.71
portrayed
0.71
anyways
0.71
Activations Density 0.141%