INDEX
    Explanations

    uncommon characters or symbols

    negative contractions and various uses of the word "it."

    New Auto-Interp
    Negative Logits
    anwhile
    -0.75
     scatter
    -0.69
     scattering
    -0.68
     gad
    -0.62
     nonexistent
    -0.61
     Saga
    -0.61
    osate
    -0.61
     dangling
    -0.60
     nearest
    -0.59
     muse
    -0.59
    POSITIVE LOGITS
    º
    1.17
    £
    1.12
    ¹
    1.08
    į
    0.95
     âĢº
    0.94
    ı
    0.93
    §
    0.93
    ®
    0.91
    ¬
    0.88
    ¿
    0.88
    Act Density 0.389%

    No Known Activations