INDEX
    Explanations

    negative values or indications of invalid settings

    New Auto-Interp
    Negative Logits
    -
    -0.83
    2
    -0.80
    1
    -0.77
    ,
    -0.72
    D
    -0.71
    3
    -0.70
    5
    -0.69
    T
    -0.69
    ly
    -0.69
    6
    -0.68
    POSITIVE LOGITS
     Мексичка
    1.07
     Савезне
    1.06
    djangoproject
    1.00
     autorytatywna
    1.00
     للمعارف
    0.94
    uxxxx
    0.91
    GEBURTS
    0.90
     виправивши
    0.88
    LabelTagHelper
    0.87
    ########.
    0.87
    Act Density 0.304%

    No Known Activations