INDEX
Explanations
actions and adjectives associated with mechanical tasks and conditions
New Auto-Interp
Negative Logits
535
-0.14
569
-0.14
ongyang
-0.14
Carp
-0.13
_CP
-0.13
ict
-0.13
appearances
-0.13
lest
-0.13
trous
-0.13
ERP
-0.13
POSITIVE LOGITS
etc
0.17
etc
0.17
.weixin
0.16
ÑĤоÑīо
0.15
à¹īำ
0.15
ovnÃŃ
0.14
uten
0.14
itty
0.14
undry
0.14
apsed
0.13
Activations Density 0.360%