INDEX
    Explanations

    This neuron activates on occurrences of the word “ancillary.”

    New Auto-Interp
    Negative Logits
    Ò
    -0.06
    ΑΝΤ
    -0.06
     handc
    -0.06
     stared
    -0.06
    'icon
    -0.06
     authToken
    -0.06
    .println
    -0.06
    мовір
    -0.06
     cosy
    -0.06
    CreatedAt
    -0.06
    POSITIVE LOGITS
    0.07
    0.06
    太郎
    0.06
    acomment
    0.06
    Library
    0.06
     alternate
    0.06
    Support
    0.06
    หาย
    0.06
    Tests
    0.06
     functions
    0.06
    Act Density 0.035%

    No Known Activations