INDEX
    Explanations

    references to academic metrics or sections in scientific publications

    New Auto-Interp
    Negative Logits
    SSF
    -0.16
    ients
    -0.16
     Pers
    -0.15
    AFX
    -0.15
    IFEST
    -0.14
    spor
    -0.14
    LOUD
    -0.14
    ạng
    -0.14
    ibold
    -0.14
    ̣
    -0.14
    POSITIVE LOGITS
    912
    0.16
    lear
    0.15
    enas
    0.14
    sent
    0.14
    ENU
    0.13
     krb
    0.13
    erp
    0.13
    ayi
    0.13
    dish
    0.13
    ign
    0.13
    Act Density 0.063%

    No Known Activations