INDEX
Explanations
github pages
The neuron activates on mentions of GitHub Pages hosting (e.g. “github.io,” “Pages,” Jekyll site references).
New Auto-Interp
Negative Logits
Ir
-0.06
duk
-0.06
Depending
-0.06
енью
-0.06
Santa
-0.06
usive
-0.06
gameplay
-0.06
Συν
-0.06
oret
-0.06
hence
-0.06
POSITIVE LOGITS
Limit
0.07
stood
0.07
-nav
0.07
third
0.07
(($
0.07
Би
0.07
торгів
0.06
//------------------------------------------------
0.06
Oliv
0.06
tipo
0.06
Activations Density 0.005%