INDEX
    Explanations

    forms of the verb "to be."

    New Auto-Interp
    Negative Logits
     herself
    -0.15
    ä½ķ
    -0.14
     Certain
    -0.14
    ohl
    -0.14
    å¯Ħ
    -0.14
     Many
    -0.14
    ÃŃda
    -0.14
    anuts
    -0.14
    UNUSED
    -0.13
    optgroup
    -0.13
    POSITIVE LOGITS
     meant
    0.30
     happening
    0.29
     involved
    0.25
     wrong
    0.24
    Wrong
    0.23
    wrong
    0.23
    difference
    0.22
     difference
    0.21
     happen
    0.21
    included
    0.20
    Act Density 0.035%

    No Known Activations