INDEX
    Explanations

    terms related to sufficiency and disability criteria

    New Auto-Interp
    Negative Logits
     مشين
    -0.90
    IntoConstraints
    -0.88
     autorytatywna
    -0.88
     becauſe
    -0.84
     TestBed
    -0.82
    MessageTagHelper
    -0.80
     незавершена
    -0.79
    UserScript
    -0.78
     fevere
    -0.78
    ymm
    -0.77
    POSITIVE LOGITS
     warrant
    0.62
     be
    0.52
     jelent
    0.50
     de
    0.50
     war
    0.49
     enough
    0.49
    omos
    0.48
     un
    0.48
     need
    0.48
     des
    0.47
    Act Density 0.224%

    No Known Activations