INDEX
    Explanations

    references to historic or geographical entities

    New Auto-Interp
    Negative Logits
     ſte
    -0.81
     iſt
    -0.73
     Monfieur
    -0.73
     itſelf
    -0.72
     faſt
    -0.70
     raiſ
    -0.69
     ſever
    -0.69
     Eſ
    -0.69
     Jefus
    -0.69
    ברס
    -0.69
    POSITIVE LOGITS
    endregion
    0.66
     تانيه
    0.62
     relaj
    0.62
     Dunn
    0.59
     Sleeps
    0.57
     Goodwin
    0.56
    yanto
    0.56
    Aqua
    0.56
     Gruber
    0.56
    cox
    0.55
    Act Density 0.038%

    No Known Activations