INDEX
    Explanations

    abbreviations and titles with periods

    punctuation marks, specifically periods

    New Auto-Interp
    Negative Logits
    enegger
    -0.66
    jah
    -0.64
     Mara
    -0.61
     biome
    -0.59
    oxide
    -0.56
    azeera
    -0.55
    picture
    -0.54
    ãĥķ
    -0.54
    ãĤ´ãĥ³
    -0.54
     emanc
    -0.54
    POSITIVE LOGITS
    ongyang
    0.92
     Lovecraft
    0.80
    ople
    0.76
    ĵĺ
    0.73
    sylvania
    0.67
    ivot
    0.66
    ERSON
    0.66
    terson
    0.65
    cipled
    0.64
    orters
    0.63
    Act Density 0.043%

    No Known Activations