INDEX
    Explanations

    non-english or symbols

    New Auto-Interp
    Negative Logits
    ಂಗಳ
    0.60
    pest
    0.59
    行星
    0.56
    vaar
    0.55
    kern
    0.54
    льга
    0.54
    rho
    0.54
    coran
    0.54
    gamma
    0.53
    quinox
    0.53
    POSITIVE LOGITS
    付近
    0.66
     nettement
    0.61
     soltanto
    0.61
     Person
    0.59
     élabor
    0.58
     değil
    0.57
    പര
    0.57
     néanmoins
    0.57
     Controlled
    0.56
     liberté
    0.55
    Act Density 0.000%

    No Known Activations