INDEX
    Explanations

    symbols, punctuation, and specific numerals

    New Auto-Interp
    Negative Logits
    igs
    -0.16
    loor
    -0.16
    aginator
    -0.15
    scoped
    -0.14
    aptop
    -0.14
     çĽ
    -0.14
    agers
    -0.14
    Returned
    -0.14
    عÙĪØ¯
    -0.14
    ivic
    -0.14
    POSITIVE LOGITS
     bio
    0.17
    012
    0.14
    bio
    0.14
    371
    0.14
     Uploaded
    0.14
    989
    0.13
    weg
    0.13
    ÙĦÙĬÙĦ
    0.13
     ruk
    0.13
     Bio
    0.13
    Act Density 0.043%

    No Known Activations