INDEX
Explanations
terms related to hierarchy or classification in organizational or developmental contexts
New Auto-Interp
Negative Logits
ary
-0.18
liness
-0.17
ese
-0.16
mong
-0.15
ings
-0.15
mund
-0.15
ivo
-0.14
iness
-0.14
ible
-0.14
mar
-0.14
POSITIVE LOGITS
ity
0.23
/main
0.19
-secondary
0.18
ities
0.17
ologne
0.17
nty
0.16
pha
0.16
ehler
0.15
yyy
0.15
idade
0.15
Activations Density 0.029%