INDEX
Explanations
negative qualifiers that express limitation or contrast in a narrative
New Auto-Interp
Negative Logits
oba
-0.16
ulum
-0.15
ula
-0.15
overe
-0.14
alue
-0.14
702
-0.14
ieren
-0.14
azon
-0.14
untas
-0.14
unt
-0.14
POSITIVE LOGITS
far
0.76
far
0.66
Far
0.60
FAR
0.59
Far
0.54
_far
0.49
by
0.46
далеко
0.45
hardly
0.38
_FAR
0.36
Activations Density 0.178%