INDEX
    Explanations

    words related to victories and winning outcomes

    New Auto-Interp
    Negative Logits
    ipay
    -0.17
    581
    -0.16
    sian
    -0.15
    æħİ
    -0.15
    ova
    -0.15
    onde
    -0.15
    rott
    -0.14
    881
    -0.14
    ddb
    -0.14
    tan
    -0.14
    POSITIVE LOGITS
    nable
    0.16
    -win
    0.16
    uyu
    0.15
    lessly
    0.15
    osu
    0.15
     poster
    0.14
    fulness
    0.14
    vie
    0.14
    filtr
    0.14
     imm
    0.13
    Act Density 0.045%

    No Known Activations