INDEX
Explanations
references to specific job positions
New Auto-Interp
Negative Logits
apon
-0.17
lore
-0.17
usi
-0.17
lear
-0.16
ends
-0.15
uya
-0.15
kond
-0.15
é¾Ħ
-0.15
opc
-0.15
wo
-0.15
POSITIVE LOGITS
ality
0.35
nement
0.34
ally
0.28
ning
0.24
naire
0.23
als
0.23
naires
0.23
nable
0.21
ned
0.20
:relative
0.20
Activations Density 0.034%