INDEX
    Explanations

    prepositions indicating location or direction

    phrases indicating advice or suggestions for action

    New Auto-Interp
    Negative Logits
    ĸļ
    -0.63
    coni
    -0.63
    Ó
    -0.56
    ylan
    -0.56
     Gleaming
    -0.56
    ³³³³
    -0.55
    urable
    -0.55
     conced
    -0.55
    andowski
    -0.54
    agog
    -0.54
    POSITIVE LOGITS
     yourself
    1.12
     oneself
    1.04
     yourselves
    0.96
     your
    0.96
    your
    0.86
     Yourself
    0.84
     somew
    0.82
     YOUR
    0.81
     ourselves
    0.81
     somewhere
    0.80
    Act Density 0.328%

    No Known Activations