INDEX
    Explanations

    expressions related to personal responsibility and consequences

    New Auto-Interp
    Negative Logits
    uilder
    -0.16
    egen
    -0.15
    ìĬ¨
    -0.15
    avou
    -0.15
    ãĥ³ãĥĪ
    -0.14
    meni
    -0.14
    udev
    -0.13
    ORY
    -0.13
    pire
    -0.13
    782
    -0.13
    POSITIVE LOGITS
     place
    1.42
    place
    1.13
     Place
    1.12
    Place
    1.05
     places
    1.05
    -place
    1.01
     PLACE
    1.00
    _place
    0.94
     lugar
    0.93
     Places
    0.87
    Act Density 0.293%

    No Known Activations