INDEX
Explanations
occurrences of verbs associated with actions and events
New Auto-Interp
Negative Logits
sowie
-0.21
erman
-0.14
ridge
-0.14
åĩī
-0.14
çıŃ
-0.14
zman
-0.13
rophe
-0.13
angi
-0.13
anvas
-0.13
jam
-0.13
POSITIVE LOGITS
547
0.17
olas
0.17
llen
0.15
ullan
0.15
agos
0.15
kem
0.14
crest
0.14
allest
0.14
indeb
0.14
tte
0.14
Activations Density 0.028%