INDEX
    Explanations

    almost and essential qualifiers

    New Auto-Interp
    Negative Logits
     важ
    0.44
    0.41
     যেহেতু
    0.41
     చక్క
    0.41
     getWeather
    0.40
     wicht
    0.40
     красиво
    0.40
     lackluster
    0.39
     contrairement
    0.38
     subtly
    0.38
    POSITIVE LOGITS
     almost
    0.79
    almost
    0.75
    extreme
    0.72
     거의
    0.71
     unrecognizable
    0.69
     zelfs
    0.68
     bijna
    0.68
     extreme
    0.67
    ほとんど
    0.66
    几乎
    0.65
    Act Density 0.210%

    No Known Activations