INDEX
    Explanations

    phrases and concepts related to "winning" in various contexts

    New Auto-Interp
    Negative Logits
    tes
    -0.20
    vore
    -0.16
    /or
    -0.16
    als
    -0.15
    uteur
    -0.15
    EB
    -0.15
    zent
    -0.14
    duct
    -0.14
    kov
    -0.14
    ìĤ¬íķŃ
    -0.13
    POSITIVE LOGITS
    nable
    0.20
    -win
    0.16
    now
    0.16
    throp
    0.14
    amaño
    0.14
    riminator
    0.14
    /win
    0.14
    ingly
    0.14
    NF
    0.13
    agli
    0.13
    Act Density 0.067%

    No Known Activations