INDEX
Explanations
This neuron detects references to the “tool” selection list or tool-related headings in the instruction text.
New Auto-Interp
Negative Logits
Wars
-0.07
SKU
-0.07
↵ ↵
-0.07
motives
-0.07
Wright
-0.07
Save
-0.06
choir
-0.06
rivals
-0.06
มา
-0.06
hated
-0.06
POSITIVE LOGITS
uppe
0.07
.tabPage
0.07
dialog
0.07
Harmon
0.06
užel
0.06
raising
0.06
@FindBy
0.06
leth
0.06
09
0.06
.toolStripMenuItem
0.06
Activations Density 0.001%