INDEX
    Explanations

    occurrences of the word "some" in various contexts

    New Auto-Interp
    Negative Logits
    irie
    -0.15
    ped
    -0.15
    umer
    -0.15
    tempt
    -0.14
    uet
    -0.14
    hec
    -0.14
    ned
    -0.14
    ÑĤÑĢи
    -0.14
    çļĦä¸Ģ个
    -0.13
    ed
    -0.13
    POSITIVE LOGITS
    /all
    0.25
    place
    0.24
    许
    0.19
    룬
    0.18
    kind
    0.18
    -times
    0.18
    ones
    0.17
    akin
    0.17
    æł·çļĦ
    0.17
    hw
    0.17
    Act Density 0.097%

    No Known Activations