INDEX
    Explanations

    the word "won" in various contexts

    New Auto-Interp
    Negative Logits
    tures
    -0.18
    yar
    -0.16
    주ëĬĶ
    -0.16
    /Delete
    -0.15
    inem
    -0.15
    egin
    -0.14
    tin
    -0.14
    ials
    -0.14
    datable
    -0.14
     useForm
    -0.13
    POSITIVE LOGITS
    't
    0.41
    ’t
    0.35
    'T
    0.23
    ;t
    0.23
    ´t
    0.22
    def
    0.18
    `t
    0.18
    ked
    0.17
    DER
    0.17
    k
    0.17
    Act Density 0.037%

    No Known Activations