INDEX
Explanations
adverbs that express certainty or frequency
New Auto-Interp
Negative Logits
nt
-0.14
untime
-0.14
iface
-0.14
_tF
-0.14
'll
-0.14
/w
-0.13
igan
-0.13
entials
-0.13
ä»¶
-0.13
çħ§
-0.13
POSITIVE LOGITS
been
0.20
most
0.20
be
0.20
ly
0.18
LY
0.17
yyy
0.16
(?)
0.16
ifi
0.15
wise
0.15
JsonValue
0.15
Activations Density 0.240%