INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     football
    -0.07
     religion
    -0.07
     decides
    -0.07
    BALL
    -0.07
    Hell
    -0.07
     secretary
    -0.07
    ibri
    -0.06
     Consider
    -0.06
    except
    -0.06
    Ctr
    -0.06
    POSITIVE LOGITS
     Lisp
    0.07
    adaki
    0.06
    医院
    0.06
     các
    0.06
    ']);
    0.06
     multif
    0.06
     lace
    0.06
     itemType
    0.06
    /gui
    0.06
    BACKGROUND
    0.06
    Act Density 0.001%

    No Known Activations