INDEX
    Explanations

    quantities related to weight measurements

    New Auto-Interp
    Negative Logits
    com
    -0.15
    abra
    -0.14
    965
    -0.14
    veau
    -0.14
     com
    -0.14
    ÏĮÏĤ
    -0.13
    eks
    -0.13
     archae
    -0.13
    undy
    -0.13
     wÅĤ
    -0.13
    POSITIVE LOGITS
    mage
    0.18
    IFE
    0.17
    etter
    0.16
    ife
    0.15
    asa
    0.15
    Disposition
    0.15
    nage
    0.15
    _FRE
    0.14
    oftware
    0.14
    filer
    0.14
    Act Density 0.009%

    No Known Activations