INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     disambiguazione
    -0.75
    脚注の使い方
    -0.72
    DockStyle
    -0.68
    TintMode
    -0.63
    ftagPool
    -0.63
    يكب
    -0.62
     мәкал
    -0.60
     pulumi
    -0.60
    govine
    -0.60
    enumii
    -0.59
    POSITIVE LOGITS
     things
    2.09
    Things
    1.55
     Things
    1.48
    things
    1.45
     THINGS
    1.41
     cosas
    1.37
     coisas
    1.25
     such
    1.20
     choses
    1.19
     Dinge
    1.15
    Act Density 0.000%

    No Known Activations