INDEX
    Explanations

    references to trouble or problematic situations

    New Auto-Interp
    Negative Logits
    nez
    -0.17
    merce
    -0.16
    ropa
    -0.16
    .scalablytyped
    -0.15
    εÏģγ
    -0.15
    cribe
    -0.15
    lify
    -0.15
    nis
    -0.15
    pire
    -0.14
    aras
    -0.14
    POSITIVE LOGITS
     trouble
    0.25
     Trouble
    0.23
    Trou
    0.23
     Trou
    0.22
     spots
    0.21
    makers
    0.20
    /conf
    0.20
     spot
    0.19
    /error
    0.19
    maker
    0.19
    Act Density 0.028%

    No Known Activations