INDEX
Explanations
limits and maximums
The neuron detects mentions of the model’s input‐length or token‐limit capabilities and related guidance on conciseness.
New Auto-Interp
Negative Logits
<y
-0.06
ğim
-0.06
SEE
-0.06
indeed
-0.06
woff
-0.06
Blockly
-0.06
meziná
-0.06
;';↵
-0.06
interpersonal
-0.06
itial
-0.06
POSITIVE LOGITS
Celebr
0.07
]=(
0.07
Weapons
0.06
perí
0.06
建築
0.06
vd
0.06
resemblance
0.06
Angles
0.06
verir
0.06
dí
0.06
Activations Density 0.034%