INDEX
Explanations
constructions expressing states of being or existence
New Auto-Interp
Negative Logits
chet
-0.18
Want
-0.15
fark
-0.14
ابÛĮ
-0.14
elix
-0.14
aukee
-0.14
Spo
-0.14
IPLE
-0.13
ieder
-0.13
IED
-0.13
POSITIVE LOGITS
proceed
0.29
Proceed
0.28
proceeding
0.26
proceeds
0.24
trust
0.23
proceeded
0.23
trusted
0.23
follow
0.23
Proceed
0.22
handled
0.22
Activations Density 0.006%