INDEX
Explanations
instances of holding or supporting actions
New Auto-Interp
Negative Logits
tongues
-0.18
erland
-0.16
ancock
-0.15
ometr
-0.15
SOLE
-0.14
chaining
-0.14
haystack
-0.14
elige
-0.14
inia
-0.14
361
-0.13
POSITIVE LOGITS
held
0.40
held
0.38
-held
0.35
Held
0.33
hold
0.29
Hold
0.28
hold
0.28
holds
0.28
Hold
0.28
holding
0.27
Activations Density 0.147%