INDEX
Explanations
adverbs and their various forms in the text
New Auto-Interp
Negative Logits
540
-0.18
pmat
-0.15
tÃŃm
-0.14
407
-0.14
phan
-0.14
idar
-0.14
Äįi
-0.14
/all
-0.14
ÙĪÙĨد
-0.14
aycast
-0.14
POSITIVE LOGITS
ly
0.20
ably
0.18
Tol
0.17
NESS
0.16
ioc
0.16
elly
0.15
ingly
0.15
asil
0.15
Toll
0.14
ely
0.14
Activations Density 0.131%