INDEX
Explanations
adverbs that describe manner, intensity, or frequency
New Auto-Interp
Negative Logits
ervo
-0.18
esub
-0.16
IBE
-0.14
illon
-0.14
odont
-0.14
μεÏģ
-0.14
oksen
-0.13
okud
-0.13
OMIC
-0.13
agna
-0.13
POSITIVE LOGITS
-*-č↵
0.16
mente
0.14
erre
0.14
wards
0.14
684
0.14
681
0.14
alls
0.14
Záp
0.14
/at
0.13
zeitig
0.13
Activations Density 0.391%