INDEX
Explanations
different types or varieties of items
phrases indicating different types or categories of subjects
New Auto-Interp
Negative Logits
esses
-0.85
ĸļ
-0.74
oult
-0.74
lest
-0.73
ellen
-0.71
lund
-0.70
Clar
-0.70
Below
-0.68
erves
-0.68
eks
-0.68
POSITIVE LOGITS
equipment
0.82
clothing
0.82
person
0.81
intermediate
0.78
medicine
0.78
activity
0.78
weaponry
0.78
measurement
0.77
crossover
0.76
manif
0.76
Activations Density 0.076%