INDEX
Explanations
geographic locations
This neuron activates on tokens that are part of place names when listing administrative subdivisions (e.g., the names of communes or municipalities).
New Auto-Interp
Negative Logits
ATTER
-0.07
harma
-0.07
แ
-0.06
inve
-0.06
ासन
-0.06
forecast
-0.06
الخامسة
-0.06
businessman
-0.06
tod
-0.06
Smartphone
-0.06
POSITIVE LOGITS
oenix
0.07
Aunt
0.06
şekilde
0.06
らない
0.06
کاهش
0.06
acích
0.06
.et
0.06
ังคม
0.06
_typeof
0.06
298
0.05
Activations Density 0.028%