INDEX
    Explanations

    Descriptions

    This neuron activates on simile constructions—especially comparisons introduced by “like a” (e.g. “like a game of Simon,” “like a universal remote control,” etc.).

    New Auto-Interp
    Negative Logits
    だろう
    -0.07
    (arguments
    -0.07
    Administrator
    -0.06
     duplication
    -0.06
    يم
    -0.06
    -filled
    -0.06
     беременности
    -0.06
    альная
    -0.06
     terrifying
    -0.06
    Jones
    -0.06
    POSITIVE LOGITS
    -aligned
    0.07
    Likes
    0.07
     hedge
    0.07
     gle
    0.06
    World
    0.06
    ़क
    0.06
    Shares
    0.06
    Chapter
    0.06
    yyy
    0.06
    0.06
    Act Density 0.042%

    No Known Activations