INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ounced
    -0.08
     merged
    -0.08
     níl
    -0.08
    routes
    -0.08
     iwọ
    -0.08
     번호
    -0.08
     offens
    -0.08
     vur
    -0.07
    afka
    -0.07
     nij
    -0.07
    POSITIVE LOGITS
     chewing
    0.08
    руп
    0.08
    0.07
    0.07
     proliferation
    0.07
    .defer
    0.07
    ប្រ
    0.07
     breakup
    0.07
     Anchorage
    0.07
     rica
    0.07
    Act Density 0.004%

    No Known Activations