INDEX
    Explanations

    It appears that neuron 4 does not activate for any of the provided tokens, which suggests it might be looking for something not present in the provided text excerpts or that it might be malfunctioning or inactive

    New Auto-Interp
    Negative Logits
    latest
    -0.72
    ache
    -0.72
     awa
    -0.68
    dylib
    -0.67
    notations
    -0.67
    ++)
    -0.66
     residues
    -0.64
    google
    -0.64
    soever
    -0.63
    Ĥİ
    -0.63
    POSITIVE LOGITS
     Watt
    0.73
    rouse
    0.71
    nect
    0.68
    uras
    0.68
    ner
    0.67
     Buk
    0.62
    aan
    0.62
    ners
    0.62
    slave
    0.60
    rament
    0.59
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.