INDEX
    Explanations

    This neuron strongly activates on occurrences of the token “to.”

    New Auto-Interp
    Negative Logits
     aerobic
    -0.06
    leans
    -0.06
    -0.06
    иск
    -0.06
    queen
    -0.06
    <"
    -0.06
     UIKit
    -0.06
    .ci
    -0.06
    /'
    -0.06
    .unpack
    -0.06
    POSITIVE LOGITS
    0.07
     target
    0.07
    IntArray
    0.07
     zipfile
    0.07
    This
    0.07
    _supported
    0.07
     flo
    0.06
     Sicher
    0.06
    0.06
     Algeria
    0.06
    Act Density 0.033%

    No Known Activations