INDEX
    Explanations

    This neuron responds to words and phrases indicating solitary or isolated living situations (e.g., “alone in a shack”).

    New Auto-Interp
    Negative Logits
    (task
    -0.07
     Rh
    -0.06
    楽し
    -0.06
     vẫn
    -0.06
     hostage
    -0.06
     quienes
    -0.06
     Joh
    -0.06
    HashTable
    -0.06
     userInput
    -0.06
    Fade
    -0.06
    POSITIVE LOGITS
     liked
    0.07
     μέσα
    0.07
     earn
    0.07
    مول
    0.07
    قة
    0.06
     seventh
    0.06
     Suppliers
    0.06
    months
    0.06
    press
    0.06
     формування
    0.06
    Act Density 0.008%

    No Known Activations