INDEX
Explanations
proper nouns related to political figures, government, and official titles
New Auto-Interp
Negative Logits
MpServer
-0.80
ratom
-0.74
essen
-0.69
γ
-0.67
alle
-0.64
lot
-0.63
lighting
-0.61
Medium
-0.61
trace
-0.61
Alien
-0.60
POSITIVE LOGITS
Barack
1.11
clinton
1.07
Obama
0.97
ially
0.97
Lyndon
0.94
Jinping
0.92
Recep
0.84
ial
0.83
hopeful
0.82
candidate
0.80
Activations Density 2.821%