INDEX
    Explanations

    nationalities

    The neuron activates on nationality descriptors (e.g. “Irish,” “French,” “Australian,” “Dutch,” etc.).

    New Auto-Interp
    Negative Logits
    .VAL
    -0.07
    ốn
    -0.07
     تت
    -0.06
     GD
    -0.06
     HDC
    -0.06
     Под
    -0.06
     کمتر
    -0.06
    Nd
    -0.06
    -0.06
     ид
    -0.06
    POSITIVE LOGITS
     />↵↵
    0.07
    ’nın
    0.07
    (equalTo
    0.07
    	success
    0.07
    	
    0.06
    >).
    0.06
     _
    0.06
     стак
    0.06
     plt
    0.06
    _coeff
    0.06
    Act Density 0.035%

    No Known Activations