INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Checkbox
    -0.07
    nThe
    -0.07
     GOOD
    -0.07
     ery
    -0.06
    -0.06
     clean
    -0.06
     Closure
    -0.06
    richTextPanel
    -0.06
    zos
    -0.06
    ละ
    -0.06
    POSITIVE LOGITS
     me
    0.07
     them
    0.07
     něj
    0.07
     it
    0.06
    арамет
    0.06
     him
    0.06
    0.06
     Frankfurt
    0.06
    (昭和
    0.06
    }*/↵
    0.06
    Act Density 0.014%

    No Known Activations