INDEX
Explanations
download/install
The neuron detects words and phrases associated with downloading and installing software (e.g., “download,” “installation,” URLs for download pages).
New Auto-Interp
Negative Logits
rad
-0.07
универ
-0.07
िस
-0.07
spheres
-0.06
\xc
-0.06
от
-0.06
複
-0.06
س
-0.06
cores
-0.06
runes
-0.06
POSITIVE LOGITS
soir
0.07
Pier
0.06
-food
0.06
isOpen
0.06
saddened
0.06
Quyết
0.06
overnight
0.06
формы
0.06
年度
0.06
setup
0.06
Activations Density 0.012%