INDEX
Explanations
Internal and external factors
The neuron fires on words signaling a distinction between quantitative and qualitative criteria or measures.
New Auto-Interp
Negative Logits
Program
-0.07
[from
-0.07
健康
-0.07
Hunting
-0.07
Simone
-0.06
hunts
-0.06
.Abstract
-0.06
їм
-0.06
mdat
-0.06
through
-0.06
POSITIVE LOGITS
νή
0.06
özelliği
0.06
Legend
0.06
eb
0.06
&B
0.06
satire
0.06
heck
0.06
quir
0.06
filler
0.06
Thirty
0.06
Activations Density 0.035%