INDEX
Explanations
The neuron activates on words used to attribute or credit sponsors and presenters (e.g. “by,” “presented,” “sponsored,” “made possible by,” etc.).
New Auto-Interp
Negative Logits
School
-0.07
již
-0.07
Processor
-0.07
fibre
-0.07
şehir
-0.07
rowser
-0.06
exploration
-0.06
innocent
-0.06
.bottom
-0.06
everyone
-0.06
POSITIVE LOGITS
typeparam
0.07
م
0.07
plc
0.06
Ư
0.06
.ibm
0.06
StatusLabel
0.06
PLC
0.06
ALI
0.06
"<?
0.06
iht
0.06
Activations Density 0.008%