INDEX
Explanations
names of political figures
prominent names of political figures and their associated actions or statements
New Auto-Interp
Negative Logits
OAD
-0.63
girls
-0.63
olves
-0.60
antz
-0.60
EStreamFrame
-0.60
Split
-0.59
increments
-0.59
mats
-0.57
outputs
-0.57
atile
-0.57
POSITIVE LOGITS
meanwhile
1.23
who
1.20
who
1.12
whose
1.07
speaking
1.04
whose
0.99
whom
0.97
reacting
0.96
flanked
0.92
citing
0.92
Activations Density 0.131%