INDEX
Explanations
phrases indicating degrees of qualification or exceptions in statements
New Auto-Interp
Negative Logits
ingers
-0.17
umer
-0.16
Ñģклад
-0.14
amine
-0.14
oulos
-0.14
å§Ķåĵ¡
-0.14
zsche
-0.14
á»ĵ
-0.14
oÄŁ
-0.13
illisecond
-0.13
POSITIVE LOGITS
means
0.37
Means
0.35
accounts
0.34
stretch
0.32
means
0.31
standards
0.31
measures
0.30
account
0.28
measure
0.28
Stretch
0.25
Activations Density 0.024%