INDEX
Explanations
phrases indicating reasons or excuses
New Auto-Interp
Negative Logits
somewhat
-0.22
incare
-0.20
ÑĮко
-0.16
áo
-0.15
zwar
-0.14
som
-0.14
ivor
-0.14
ê´
-0.14
Kund
-0.14
credited
-0.13
POSITIVE LOGITS
nor
0.31
WHATSOEVER
0.28
whatsoever
0.28
nor
0.27
nÃło
0.24
Nor
0.23
except
0.20
кÑĢоме
0.20
Nor
0.19
except
0.19
Activations Density 0.096%