INDEX
Explanations
The neuron activates on mentions of populations “below the poverty line,” i.e. phrases indicating percentages living in poverty.
New Auto-Interp
Negative Logits
optic
-0.07
리아
-0.07
posters
-0.07
REQUIRE
-0.07
pit
-0.06
ुभ
-0.06
Ι
-0.06
thinkers
-0.06
hosted
-0.06
er
-0.06
POSITIVE LOGITS
lasc
0.06
:black
0.06
excessive
0.06
вав
0.06
ými
0.06
wonderfully
0.06
клас
0.06
contempt
0.06
nonprofit
0.06
bian
0.06
Activations Density 0.000%