INDEX
    Explanations

    elements related to critique and evaluation

    New Auto-Interp
    Negative Logits
     only
    -0.56
     is
    -0.52
     the
    -0.50
    -0.49
    is
    -0.48
     in
    -0.47
     }}">
    -0.47
     minori
    -0.45
    <strong>
    -0.44
    mer
    -0.44
    POSITIVE LOGITS
    +#+#
    0.82
     مشين
    0.78
    NOPQRST
    0.77
    SharedDtor
    0.77
    BibitemShut
    0.73
     للاسماء
    0.73
     дописавши
    0.73
    tvguidetime
    0.72
    BufferException
    0.71
    RetentionPolicy
    0.71
    Act Density 0.343%

    No Known Activations