INDEX
Explanations
concepts related to career choices and societal issues
New Auto-Interp
Negative Logits
Ìģt
-0.16
wyn
-0.14
обов
-0.14
="__
-0.13
("'"-0.13
oble
-0.13
ritch
-0.13
ادر
-0.13
odynam
-0.13
Willi
-0.13
POSITIVE LOGITS
omnia
0.17
ogue
0.15
icz
0.14
emek
0.14
äge
0.14
hec
0.14
buz
0.14
ãĥ¼ãĤ¹
0.13
clc
0.13
lass
0.13
Activations Density 0.237%