INDEX
Explanations
gerunds and present participles that indicate actions
New Auto-Interp
Negative Logits
rix
-0.15
(cf
-0.15
olver
-0.15
phin
-0.14
owell
-0.14
оваÑĢ
-0.14
rong
-0.14
obra
-0.13
oltage
-0.13
amac
-0.13
POSITIVE LOGITS
cakes
0.16
Bond
0.14
bond
0.14
staking
0.14
-placeholder
0.14
Demir
0.14
lint
0.13
ado
0.13
алÑĮ
0.13
Orig
0.13
Activations Density 0.137%