INDEX
Explanations
verbs that indicate existence or occurrence
New Auto-Interp
Negative Logits
çļĦæĺ¯
-0.16
jong
-0.15
uben
-0.14
onga
-0.14
inge
-0.14
keiten
-0.13
æ¯Ľ
-0.13
.PIPE
-0.13
ILLISE
-0.13
iegel
-0.13
POSITIVE LOGITS
.opend
0.14
obar
0.14
Äįást
0.14
imed
0.14
izza
0.14
ched
0.14
ênh
0.14
allon
0.14
ellas
0.13
yles
0.13
Activations Density 0.168%