INDEX
Explanations
mentions of occupations and roles, particularly in professional contexts
New Auto-Interp
Negative Logits
mej
-0.16
132
-0.14
iant
-0.14
uko
-0.14
echa
-0.14
opak
-0.14
iene
-0.14
Multiplicity
-0.13
Cv
-0.13
ycz
-0.13
POSITIVE LOGITS
she
0.19
ê·¸ëĬĶ
0.18
à¤īसन
0.17
νÏī
0.16
saya
0.16
usted
0.15
he
0.15
you
0.15
она
0.15
ogan
0.15
Activations Density 0.175%