INDEX
    Explanations

    numerical values and their associated representations, particularly in mathematical or scientific contexts

    New Auto-Interp
    Negative Logits
    es
    -0.94
    s
    -0.72
    ersch
    -0.65
    ments
    -0.63
    ES
    -0.61
    ات
    -0.60
     Aless
    -0.59
    est
    -0.59
    پرس
    -0.58
    عر
    -0.58
    POSITIVE LOGITS
    ¹
    1.09
    1.02
    0.98
    0.93
     Roskov
    0.92
    StoryboardSegue
    0.91
    ¹,
    0.90
    0.84
    ³
    0.82
     AssemblyCulture
    0.82
    Act Density 0.001%

    No Known Activations