INDEX
Explanations
references to individuals who previously held notable positions or roles
New Auto-Interp
Negative Logits
kon
-0.15
Ret
-0.15
4
-0.15
261
-0.14
1
-0.14
(
-0.14
broad
-0.14
Bowie
-0.14
-
-0.14
yc
-0.14
POSITIVE LOGITS
avad
0.16
ediator
0.16
riad
0.15
ARRIER
0.15
ompiler
0.15
utex
0.15
Ngh
0.14
_TAC
0.14
ì±ħ
0.14
eyh
0.14
Activations Density 0.008%