INDEX
Explanations
describing actions and expressions
New Auto-Interp
Negative Logits
desperate
0.59
desesper
0.50
desperation
0.50
screaming
0.47
懵
0.46
desperately
0.43
cry
0.43
screamed
0.43
scream
0.42
crying
0.42
POSITIVE LOGITS
gesturing
0.67
walked
0.66
gest
0.64
踱
0.64
gestures
0.64
glanced
0.63
拿起
0.61
smiled
0.60
Gest
0.58
glance
0.58
Activations Density 0.038%