INDEX
    Explanations

    words with special characters and random strings, making it difficult to determine a specific pattern or theme that the neuron is looking for

    special characters or non-standard symbols in the text

    New Auto-Interp
    Negative Logits
    INAL
    -0.62
    ãĤ¨ãĥ«
    -0.60
    allery
    -0.60
     guiActiveUn
    -0.57
    20439
    -0.54
    ICAN
    -0.53
    untled
    -0.51
    aution
    -0.49
    phen
    -0.48
    UF
    -0.47
    POSITIVE LOGITS
    inki
    0.62
    agall
    0.54
     deserts
    0.49
    itent
    0.48
     cereal
    0.48
     orphans
    0.47
     sil
    0.47
     fame
    0.47
    cery
    0.47
     اÙĦ
    0.47
    Act Density 1.392%

    No Known Activations