INDEX
Explanations
technical writing
This neuron activates on words indicating repetition (e.g. “repeating”).
New Auto-Interp
Negative Logits
looping
-0.06
Paşa
-0.06
ल
-0.06
maktan
-0.06
equitable
-0.06
ВС
-0.06
,text
-0.05
/NĐ
-0.05
Kas
-0.05
cts
-0.05
POSITIVE LOGITS
condo
0.07
experimented
0.07
yielding
0.07
di
0.07
binds
0.06
fundra
0.06
ictured
0.06
(_.
0.06
Agent
0.06
_seed
0.06
Activations Density 0.006%