INDEX
    Explanations

    It seems there is no clear pattern of activation for Neuron 4 as it does not activate for any of the provided tokens. Without any non-zero activations to analyze, we cannot determine what this neuron is looking for

    New Auto-Interp
    Negative Logits
     cou
    -0.77
     Raider
    -0.70
     Hunters
    -0.70
     merc
    -0.64
     Kardash
    -0.64
     recess
    -0.63
    rogen
    -0.63
    obin
    -0.63
    heid
    -0.59
    yrinth
    -0.58
    POSITIVE LOGITS
    Offline
    0.71
    Best
    0.69
     Cosponsors
    0.69
     RELE
    0.67
    )</
    0.66
    mob
    0.65
    nces
    0.64
    affe
    0.62
    airo
    0.61
    Operation
    0.61
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.