INDEX
    Explanations

    words indicating a level of certainty or change in status

    New Auto-Interp
    Negative Logits
    ики
    -0.17
    kus
    -0.16
    еÑĢк
    -0.16
    .gdx
    -0.15
    apolis
    -0.15
    .StackTrace
    -0.15
    rai
    -0.14
    atürk
    -0.14
    avier
    -0.14
    .mongodb
    -0.14
    POSITIVE LOGITS
    847
    0.16
    pps
    0.15
    etsk
    0.15
     Boyle
    0.15
    iri
    0.14
     Jack
    0.14
    eg
    0.14
    WARDS
    0.14
    aring
    0.14
     Sammy
    0.14
    Act Density 0.003%

    No Known Activations