INDEX
    Explanations

    phrases indicating contrast or comparisons

    New Auto-Interp
    Negative Logits
     StatefulWidget
    -0.57
    astă
    -0.56
     Wikimedijinoj
    -0.56
    Népesség
    -0.49
    derabad
    -0.48
    iscope
    -0.47
     Such
    -0.46
     références
    -0.45
     poichè
    -0.45
    grine
    -0.44
    POSITIVE LOGITS
     isso
    1.16
     eso
    1.11
     disso
    0.85
     vậy
    0.85
     itu
    0.84
    นั้น
    0.83
     ello
    0.82
     that
    0.80
     ça
    0.77
     cela
    0.76
    Act Density 0.204%

    No Known Activations