INDEX
Explanations
Origins of names/titles
The neuron detects mentions of a work’s title or name—i.e. phrases explaining what something is called or where its title comes from.
New Auto-Interp
Negative Logits
b
-0.07
Lv
-0.06
(group
-0.06
prices
-0.06
w
-0.06
Ingredient
-0.06
Tek
-0.06
マ
-0.06
izers
-0.06
Plug
-0.06
POSITIVE LOGITS
강
0.07
slashes
0.07
режд
0.07
南
0.06
Alternate
0.06
Geneva
0.06
λεύ
0.06
زنی
0.06
NSCoder
0.06
untos
0.06
Activations Density 0.044%