INDEX
Explanations
specific names and titles related to individuals in professional roles
New Auto-Interp
Negative Logits
eva
-0.16
uzu
-0.15
ivor
-0.13
å®ħ
-0.13
...
-0.13
_SLOT
-0.13
tuto
-0.13
šak
-0.13
âĻª
-0.12
edii
-0.12
POSITIVE LOGITS
covid
0.21
Event
0.19
psy
0.17
EVENT
0.17
Klaus
0.16
Covid
0.16
tranny
0.15
global
0.15
lobal
0.15
ÑģÑĤÑĥп
0.15
Activations Density 0.003%