INDEX
    Explanations

    The neuron responds to long, multi-syllable technical or formal nouns (often domain-specific terminology) in the text.

    New Auto-Interp
    Negative Logits
    usra
    -0.07
    arsed
    -0.06
     Faces
    -0.06
     النو
    -0.06
    haus
    -0.06
    815
    -0.06
     kuvvet
    -0.06
    воля
    -0.06
     někdy
    -0.05
     citiz
    -0.05
    POSITIVE LOGITS
     aumento
    0.09
     consistency
    0.08
    人気
    0.07
     outfits
    0.07
     rotation
    0.07
    <b
    0.07
     شدن
    0.07
     lanç
    0.07
     tipos
    0.07
    _message
    0.07
    Act Density 0.314%

    No Known Activations