INDEX
Explanations
phrases indicating choices and options available to individuals or groups
New Auto-Interp
Negative Logits
ĻĤ
-0.17
ipi
-0.17
WI
-0.16
unte
-0.14
obi
-0.14
acro
-0.14
ég
-0.14
aca
-0.14
mak
-0.14
ecast
-0.14
POSITIVE LOGITS
Skeleton
0.16
orp
0.15
ETS
0.15
ople
0.15
chten
0.15
uyá»ĩt
0.14
emoth
0.14
onavir
0.14
kino
0.14
ustos
0.14
Activations Density 0.079%