INDEX
Explanations
references to government officials and their titles
New Auto-Interp
Negative Logits
tures
-0.15
erspective
-0.15
utz
-0.15
ilt
-0.15
358
-0.14
лиÑĨ
-0.14
ics
-0.14
SSIP
-0.14
idas
-0.14
uke
-0.14
POSITIVE LOGITS
-General
0.25
ariat
0.23
-general
0.23
aries
0.20
secretary
0.19
-secret
0.19
ship
0.19
arial
0.18
retry
0.18
Secretary
0.16
Activations Density 0.013%