INDEX
    Explanations

    expressions of uncertainty and questions regarding decisions or beliefs

    New Auto-Interp
    Negative Logits
     téléphonique
    -0.58
    extAlignment
    -0.56
    __((
    -0.53
     verdaderas
    -0.50
     himself
    -0.50
    リート
    -0.50
     بيها
    -0.49
     paysage
    -0.49
     scatola
    -0.48
     chiens
    -0.48
    POSITIVE LOGITS
    Portale
    0.71
    ingeki
    0.69
     nahilalakip
    0.66
    ?!?
    0.64
     незавершена
    0.56
    TALL
    0.56
    دانشنامهٔ
    0.55
    ametros
    0.54
     Roskov
    0.54
    ?…
    0.54
    Act Density 0.193%

    No Known Activations