INDEX
Explanations
phrases indicating ongoing existence or continuity
New Auto-Interp
Negative Logits
ia
-0.16
INU
-0.15
ưng
-0.15
ana
-0.15
urat
-0.14
nu
-0.14
æĮģ
-0.14
trs
-0.13
iu
-0.13
rio
-0.13
POSITIVE LOGITS
alike
0.16
.createFrom
0.15
gart
0.15
ispens
0.15
idden
0.14
oplan
0.13
sse
0.13
assen
0.13
athers
0.13
astes
0.13
Activations Density 0.055%