INDEX
Explanations
The neuron is looking for the word “introduction” (i.e. prompts asking for an introduction).
New Auto-Interp
Negative Logits
play
-0.06
slots
-0.06
appl
-0.06
Icon
-0.06
playful
-0.06
area
-0.06
čast
-0.06
WINDOWS
-0.06
pad
-0.06
gran
-0.06
POSITIVE LOGITS
dateString
0.07
Liability
0.07
совсем
0.06
@{$0.06
lerce
0.06
PERF
0.06
kah
0.06
_ADMIN
0.06
achable
0.06
/function
0.06
Activations Density 0.004%