INDEX
    Explanations

    references to power plays in hockey

    New Auto-Interp
    Negative Logits
    .uc
    -0.17
    istrovstvÃŃ
    -0.16
    aira
    -0.15
     lick
    -0.14
    ToWorld
    -0.14
    ÛĮاÙĨ
    -0.13
    義
    -0.13
    ãĤ¸ãĤ¢
    -0.13
    çijŁ
    -0.13
    .Interop
    -0.13
    POSITIVE LOGITS
    ÅĻi
    0.16
    utom
    0.15
    gle
    0.15
    uder
    0.14
    rial
    0.14
    jen
    0.14
    allon
    0.13
    ouz
    0.13
    ubar
    0.13
    gui
    0.13
    Act Density 0.010%

    No Known Activations