INDEX
Explanations
adverbs describing manner and frequency
New Auto-Interp
Negative Logits
esser
-0.15
.jet
-0.14
avia
-0.13
inator
-0.13
esk
-0.13
ëł´
-0.13
ighbor
-0.13
IBE
-0.13
ember
-0.13
emic
-0.13
POSITIVE LOGITS
à¹Ĩ
0.15
erre
0.14
mente
0.14
obra
0.14
повÑĸд
0.13
mẽ
0.13
engu
0.13
apon
0.13
alls
0.13
orda
0.13
Activations Density 0.428%