INDEX
Explanations
references to symbolic actions or gestures
New Auto-Interp
Negative Logits
ondo
-0.85
zona
-0.78
tics
-0.77
elsen
-0.76
Nightmares
-0.73
raid
-0.72
ãĥ¯ãĥ³
-0.71
gres
-0.71
vae
-0.70
seed
-0.68
POSITIVE LOGITS
gesture
1.44
gestures
1.24
toward
1.03
towards
1.01
greeting
0.90
salute
0.81
recognition
0.78
dexterity
0.77
waving
0.74
indicating
0.72
Activations Density 0.008%