INDEX
Explanations
others have it worse
The neuron spotlights comparative mentions of others being worse off—phrases where the text says someone “has it worse,” “more” severe problems, etc.
New Auto-Interp
Negative Logits
masses
-0.07
/User
-0.07
aims
-0.06
cursed
-0.06
Val
-0.06
Args
-0.06
flex
-0.06
compiling
-0.06
धर
-0.06
selection
-0.06
POSITIVE LOGITS
INUE
0.07
ZIP
0.07
ivní
0.07
'">'
0.06
Coil
0.06
Mobil
0.06
cpt
0.06
tane
0.06
材
0.06
unrealistic
0.06
Activations Density 0.067%