INDEX
    Explanations

    terms related to standards, evaluations, and qualifications in various contexts

    New Auto-Interp
    Negative Logits
    ÙĨدÙĩ
    -0.19
    ibase
    -0.18
    Ùĩار
    -0.15
    aminer
    -0.15
     bằng
    -0.15
    ahat
    -0.15
    ész
    -0.15
     alıp
    -0.15
    avigate
    -0.14
    iale
    -0.14
    POSITIVE LOGITS
     means
    0.33
    means
    0.24
     Means
    0.23
     virtue
    0.22
     alone
    0.21
     analogy
    0.20
     standards
    0.17
     Alone
    0.17
    Means
    0.17
     vote
    0.16
    Act Density 0.180%

    No Known Activations