INDEX
    Explanations

    the word "by" indicating action or agency in sentences

    New Auto-Interp
    Negative Logits
    voj
    -0.06
    allen
    -0.06
    ussen
    -0.06
    nts
    -0.06
    aight
    -0.06
    ught
    -0.06
    æĺ¯åIJ¦
    -0.06
     isize
    -0.06
    ughter
    -0.06
    izoph
    -0.06
    POSITIVE LOGITS
    edException
    0.08
    opic
    0.07
    ared
    0.07
    alion
    0.07
    iba
    0.06
    /slick
    0.06
    PEC
    0.06
    rost
    0.06
    olan
    0.06
    idi
    0.06
    Act Density 0.054%

    No Known Activations