INDEX
    Explanations

    comparative phrases that suggest evaluation or judgment

    New Auto-Interp
    Negative Logits
     enige
    -0.66
    -0.54
    клопе
    -0.53
    存于互联网档案馆
    -0.51
    Diwedd
    -0.50
    ांकि
    -0.50
    esModule
    -0.49
    harusnya
    -0.49
     suivants
    -0.48
    ähkö
    -0.48
    POSITIVE LOGITS
     just
    4.19
    just
    3.59
    Just
    3.06
     Just
    2.97
     JUST
    2.68
     juste
    2.67
    JUST
    2.50
     juſt
    2.10
     juft
    2.07
     simply
    1.92
    Act Density 0.919%

    No Known Activations