INDEX
    Explanations

    It appears that neuron 4 does not activate for any tokens in the provided dataset, suggesting that the neuron is either not functioning correctly or that its specific search criteria were not present in the text

    New Auto-Interp
    Negative Logits
    ciating
    -0.79
    idan
    -0.74
    zai
    -0.73
    76561
    -0.72
     subscrib
    -0.69
    ":[
    -0.68
    Laughs
    -0.68
    ItemThumbnailImage
    -0.67
    inav
    -0.67
    Ĭ±
    -0.67
    POSITIVE LOGITS
     NEO
    0.74
     CONT
    0.69
     Tacoma
    0.68
     Moz
    0.67
     CHO
    0.65
     Cherokee
    0.63
     FTC
    0.62
    TO
    0.61
    fort
    0.60
     âī
    0.60
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.