INDEX
    Explanations

    personal pronouns (him, me, us) followed by actions or movements

    New Auto-Interp
    Negative Logits
     Redacción
    -0.60
     poveznice
    -0.58
     junho
    -0.55
     Pued
    -0.55
     semblait
    -0.55
     renova
    -0.54
     Lleg
    -0.54
     airpods
    -0.54
     ajudá
    -0.53
    bonsoir
    -0.53
    POSITIVE LOGITS
     ioe
    0.59
     Republics
    0.56
     Sepp
    0.51
     emirates
    0.48
    ropshire
    0.47
     Thier
    0.47
     peasantry
    0.46
    altham
    0.46
    .
    0.45
     Colla
    0.45
    Act Density 0.168%

    No Known Activations