INDEX
    Explanations

    information related to winning or achievements

    instances of the word "won" related to achievements or victories

    New Auto-Interp
    Negative Logits
    OTOS
    -0.69
    erity
    -0.68
    assies
    -0.66
    angering
    -0.62
    avia
    -0.62
    bian
    -0.62
     footprint
    -0.61
    appa
    -0.61
    repre
    -0.60
    tools
    -0.60
    POSITIVE LOGITS
    't
    1.21
    itive
    0.82
    ners
    0.72
    ALD
    0.71
    kish
    0.68
    æ©
    0.68
    now
    0.68
    rar
    0.68
     Won
    0.66
    geon
    0.64
    Act Density 0.023%

    No Known Activations