INDEX
Explanations
references to singular entities or unique situations
New Auto-Interp
Negative Logits
or
-0.18
such
-0.18
amage
-0.17
either
-0.17
ones
-0.16
various
-0.16
next
-0.16
инок
-0.16
alike
-0.15
°E
-0.15
POSITIVE LOGITS
each
0.19
Each
0.18
OTHER
0.18
entire
0.18
Each
0.18
instance
0.17
others
0.17
other
0.17
each
0.17
EACH
0.17
Activations Density 0.011%