INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    enen
    -0.71
    zeera
    -0.69
     deserts
    -0.67
     bypassed
    -0.66
     Bax
    -0.66
     Coupons
    -0.65
    AntiForgery
    -0.64
    testng
    -0.64
     modernity
    -0.64
     HSE
    -0.63
    POSITIVE LOGITS
     ruling
    5.19
     rulings
    4.44
     Ruling
    4.06
    ruling
    3.80
     ruled
    3.69
     Rul
    3.00
    ruled
    2.70
    rul
    2.58
     rul
    1.95
     ruler
    1.79
    Act Density 0.068%

    No Known Activations