INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -0.52
     disambiguazione
    -0.51
    GED
    -0.47
    ாம்
    -0.46
     embri
    -0.46
     sure
    -0.45
     Herren
    -0.45
     poste
    -0.45
    Nox
    -0.44
    OneToOne
    -0.43
    POSITIVE LOGITS
    AutoresizingMask
    0.70
    ISupport
    0.66
     CreateTagHelper
    0.59
    PyExc
    0.58
    новниш
    0.54
    tisseur
    0.53
    ScopeManager
    0.53
    NUMX
    0.53
    yl
    0.52
    сшта
    0.52
    Act Density 0.002%

    No Known Activations