INDEX
    Explanations

    phrases that express requests for feedback or assistance

    New Auto-Interp
    Negative Logits
    ENEFITS
    -0.51
    ņas
    -0.49
    стоин
    -0.49
    ņa
    -0.49
     häls
    -0.48
    neſs
    -0.48
    XVI
    -0.48
    يلات
    -0.47
    junto
    -0.46
    USET
    -0.46
    POSITIVE LOGITS
    GEBURTSDATUM
    0.91
    Diweddarwch
    0.85
     betweenstory
    0.80
     Gives
    0.70
    bewerken
    0.68
    Gives
    0.66
     चीज़ों
    0.62
    Personensuche
    0.62
     Giving
    0.61
    clusal
    0.60
    Act Density 0.110%

    No Known Activations