INDEX
Explanations
references to skills, roles, and the complexities of job responsibilities
New Auto-Interp
Negative Logits
they
-0.19
we
-0.18
it
-0.18
you
-0.16
lein
-0.15
he
-0.15
’d
-0.15
no
-0.15
'd
-0.15
-0.14
POSITIVE LOGITS
that
0.37
that
0.35
hogy
0.31
rằng
0.31
ÑĩÑĤо
0.29
that
0.29
Ú©Ùĩ
0.28
bahwa
0.27
daÃŁ
0.25
_THAT
0.25
Activations Density 0.150%