INDEX
    Explanations

    phrases indicating risk or caution in sports contexts

    New Auto-Interp
    Negative Logits
    entic
    -0.08
    adiens
    -0.07
    rance
    -0.07
    .yahoo
    -0.07
    нед
    -0.07
    ention
    -0.06
    åĬĥ
    -0.06
    thur
    -0.06
     fond
    -0.06
    /cop
    -0.06
    POSITIVE LOGITS
    OnError
    0.07
     Mo
    0.07
    ifo
    0.06
    .mo
    0.06
    isol
    0.06
     McL
    0.06
     fucks
    0.05
    idar
    0.05
     mo
    0.05
    omics
    0.05
    Act Density 0.000%

    No Known Activations