INDEX
Explanations
references to political figures and their roles
New Auto-Interp
Negative Logits
Governors
-0.18
governor
-0.17
governors
-0.17
psilon
-0.17
Governor
-0.16
regulators
-0.16
ija
-0.16
è¹
-0.16
enko
-0.16
ynes
-0.16
POSITIVE LOGITS
Speaker
0.44
speaker
0.40
Speaker
0.39
Majority
0.35
Speakers
0.34
speaker
0.32
Leader
0.30
peaker
0.29
speakers
0.29
majority
0.28
Activations Density 0.117%