INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     palp
    -0.06
    /gin
    -0.06
    _cnt
    -0.06
    .chat
    -0.06
     hazard
    -0.06
     Tale
    -0.06
    item
    -0.06
    _leader
    -0.06
    234
    -0.06
     faction
    -0.06
    POSITIVE LOGITS
     software
    0.11
     Software
    0.08
    ###↵↵
    0.07
     SOFTWARE
    0.07
     sofas
    0.07
    SF
    0.07
     Samantha
    0.07
    ]]↵↵
    0.07
    isay
    0.07
    .S
    0.07
    Act Density 0.015%

    No Known Activations