INDEX
Explanations
present tense verbs indicating ongoing activities or positions
New Auto-Interp
Negative Logits
iola
-0.16
nze
-0.15
rello
-0.15
ette
-0.14
aginator
-0.14
ož
-0.14
ado
-0.14
ADO
-0.14
yster
-0.14
eder
-0.14
POSITIVE LOGITS
isas
0.16
EDIA
0.16
èĹ
0.15
dq
0.14
APR
0.14
Theme
0.14
elix
0.14
ãĥ¼ãĤ¿ãĥ¼
0.14
alian
0.13
cla
0.13
Activations Density 0.037%