INDEX
    Explanations

    references to governmental or economic reforms

    New Auto-Interp
    Negative Logits
    hood
    -0.16
    amber
    -0.16
    kest
    -0.16
     fines
    -0.15
     Äijảo
    -0.15
    ëŀ
    -0.14
    oub
    -0.14
     bạc
    -0.14
    anka
    -0.14
    ãĤ·ãĤ¢
    -0.14
    POSITIVE LOGITS
    atted
    0.24
    ative
    0.23
    ulate
    0.17
    ulating
    0.16
    /add
    0.16
    oul
    0.16
    ulated
    0.16
    /update
    0.15
    ulates
    0.15
    ül
    0.15
    Act Density 0.024%

    No Known Activations