INDEX
    Explanations

    Nothing; Neuron 4 does not activate for any part of the given text, indicating it does not find what it's looking for in the provided examples

    New Auto-Interp
    Negative Logits
    Roaming
    -0.72
    oven
    -0.68
    orno
    -0.66
    knit
    -0.64
    ling
    -0.63
    letal
    -0.62
    elson
    -0.61
     sunscreen
    -0.61
    Capture
    -0.61
    shore
    -0.61
    POSITIVE LOGITS
    ngth
    0.73
    ãĥķãĤ¡
    0.70
    ãĥĩãĤ£
    0.69
    terness
    0.68
    ãĥĸ
    0.67
    ãĥķãĤ©
    0.66
     hyster
    0.65
     Fortress
    0.65
    女
    0.65
    ãĥĦ
    0.65
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.