INDEX
    Explanations

    references to numerical values or measurements

    New Auto-Interp
    Negative Logits
     sens
    -0.52
     lin
    -0.47
     ocup
    -0.45
     touch
    -0.45
    alski
    -0.45
     angin
    -0.44
     ma
    -0.44
     menu
    -0.44
    rescu
    -0.44
     inflation
    -0.43
    POSITIVE LOGITS
    uxxxx
    0.94
     ویکی‌پدیا
    0.93
    帖最后由
    0.75
    pexpr
    0.75
    Obrázky
    0.71
    TintMode
    0.70
    انيف
    0.70
    MockBean
    0.68
     viață
    0.67
    Diwedd
    0.67
    Act Density 0.050%

    No Known Activations