INDEX
Explanations
active sensory perception and internal states
New Auto-Interp
Negative Logits
廃
0.84
删除
0.78
ধ্বংস
0.75
parado
0.75
propertyName
0.71
supprimer
0.70
demolish
0.70
wiping
0.69
penampilan
0.68
JAK
0.68
POSITIVE LOGITS
tells
1.16
tell
1.04
screamed
1.02
rebel
1.00
telling
0.99
refuses
0.96
flared
0.94
balk
0.94
told
0.93
betray
0.93
Activations Density 0.070%