INDEX
Explanations
instances of the article "a" and phrases indicating quantity or specificity
New Auto-Interp
Negative Logits
acker
-0.15
stå
-0.14
zeit
-0.14
åľ¨
-0.14
andas
-0.13
shelves
-0.13
spokeswoman
-0.13
_UNUSED
-0.13
lov
-0.13
possibility
-0.13
POSITIVE LOGITS
manner
0.53
nutshell
0.39
way
0.39
hurry
0.36
fashion
0.33
effort
0.29
sposób
0.28
bid
0.26
format
0.25
context
0.25
Activations Density 0.149%