INDEX
Explanations
verbs related to assistance and support
New Auto-Interp
Negative Logits
ighton
-0.16
burgh
-0.16
ares
-0.15
staking
-0.14
alsy
-0.14
INGS
-0.14
оÑĢдин
-0.14
disob
-0.14
crollView
-0.13
aversable
-0.13
POSITIVE LOGITS
vla
0.17
acht
0.17
owy
0.17
.yy
0.17
vor
0.16
achi
0.16
.cls
0.15
ami
0.15
Tap
0.15
ucht
0.15
Activations Density 0.001%