INDEX
Explanations
The main thing this neuron does is detect mentions of pickup trucks (and similar truck model names).
New Auto-Interp
Negative Logits
贝
-0.07
(QL
-0.07
ΙΚ
-0.07
developing
-0.07
เบ
-0.07
підприємства
-0.07
эт
-0.07
랍니다
-0.06
�
-0.06
være
-0.06
POSITIVE LOGITS
=time
0.07
failures
0.07
divides
0.06
정확
0.06
-duty
0.06
_SCHEDULE
0.06
.HasPrefix
0.06
0.06
abuses
0.06
probation
0.06
Activations Density 0.006%