INDEX
Explanations
The neuron activates on words denoting a small, approximate quantity—most notably the word “few.”
New Auto-Interp
Negative Logits
perennial
-0.07
Chronicle
-0.07
weblog
-0.06
Lok
-0.06
092
-0.06
.nombre
-0.06
�
-0.06
gregator
-0.06
QP
-0.06
oxy
-0.06
POSITIVE LOGITS
a
0.09
-the
0.08
-a
0.07
very
0.07
an
0.07
les
0.07
the
0.07
an
0.06
IL
0.06
mor
0.06
Activations Density 0.032%