INDEX
Explanations
ordinal numbers
references to rankings or placements in comparisons
New Auto-Interp
Negative Logits
uria
-0.64
rina
-0.64
afety
-0.59
uristic
-0.58
teness
-0.58
Machina
-0.58
arah
-0.56
una
-0.55
ogenesis
-0.55
zar
-0.54
POSITIVE LOGITS
respectively
2.39
alike
1.68
together
1.14
respective
1.12
jointly
1.08
together
1.06
both
1.02
depending
0.95
interchange
0.92
combined
0.90
Activations Density 0.821%