INDEX
Explanations
short descriptions/information
The neuron detects instruction words in the prompt that tell the model to generate or rewrite text—for example “write,” “short,” “description,” and “headline.”
New Auto-Interp
Negative Logits
pirate
-0.07
LCD
-0.07
-and
-0.06
refs
-0.06
ViewPager
-0.06
gcd
-0.06
Whenever
-0.06
(embed
-0.06
(gc
-0.06
arged
-0.06
POSITIVE LOGITS
涉
0.06
فرمان
0.06
JSName
0.06
0.06
approve
0.06
ΔE
0.06
dg
0.06
super
0.06
Loren
0.05
donna
0.05
Activations Density 0.005%