INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mijne
    -0.36
     disambiguazione
    -0.34
    เป
    -0.33
     tovább
    -0.30
    dword
    -0.30
     kjem
    -0.30
     jagung
    -0.30
    ervlak
    -0.29
     kilometres
    -0.29
     vym
    -0.29
    POSITIVE LOGITS
    Lifting
    1.48
     lifted
    1.47
     lifting
    1.45
     lift
    1.43
     Lifting
    1.41
     Lift
    1.38
    Lift
    1.38
     lifts
    1.36
    lift
    1.32
    lifted
    1.28
    Act Density 0.023%

    No Known Activations