INDEX
    Explanations

    the use of the verb "to be" in various forms

    New Auto-Interp
    Negative Logits
    å®ĥ
    -0.20
    ä¸Ģ个
    -0.18
    It
    -0.17
    çļĦä¸Ģ个
    -0.17
    osi
    -0.17
    ä¸Ģ个人
    -0.16
    Anything
    -0.16
    ä¸ĢåĢĭ
    -0.15
     Anything
    -0.15
    (it
    -0.15
    POSITIVE LOGITS
     they
    0.28
     we
    0.26
     these
    0.25
    /w
    0.25
     those
    0.23
    tha
    0.22
    nt
    0.21
     они
    0.20
    ady
    0.20
    /is
    0.19
    Act Density 0.043%

    No Known Activations