INDEX
Explanations
infinitives and gerunds indicating actions or processes
New Auto-Interp
Negative Logits
argar
-0.15
ALA
-0.15
елов
-0.14
ear
-0.14
DataTask
-0.14
Donovan
-0.14
_LOCK
-0.14
alis
-0.14
ecer
-0.14
endor
-0.14
POSITIVE LOGITS
loh
0.19
asu
0.18
aptor
0.16
izzo
0.16
rzy
0.16
edis
0.15
>manual
0.14
iaux
0.14
loor
0.14
ailed
0.14
Activations Density 0.050%