INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     our
    -1.07
     we
    -0.87
    我们可以
    -0.83
    GARY
    -0.81
    -0.79
    普及
    -0.77
     TCG
    -0.77
     nuestro
    -0.76
     μας
    -0.76
    ׇ
    -0.76
    POSITIVE LOGITS
     federal
    0.98
    ORDER
    0.95
    &(
    0.91
    üman
    0.90
     Adventure
    0.88
    currently
    0.88
     ファミ
    0.87
    phonic
    0.83
    zsef
    0.83
     gebre
    0.82
    Act Density 0.120%

    No Known Activations