INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     kara
    -0.08
     jud
    -0.08
     extrem
    -0.08
     spelling
    -0.07
    Jud
    -0.07
     grand
    -0.07
     ót
    -0.07
     méd
    -0.07
     toes
    -0.07
    tea
    -0.07
    POSITIVE LOGITS
    0.09
     incomes
    0.08
     ì
    0.08
     Reef
    0.07
    Pear
    0.07
     coincidence
    0.07
     thrill
    0.07
    Charg
    0.07
     taxpayers
    0.07
     MILL
    0.07
    Act Density 0.023%

    No Known Activations