INDEX
    Explanations

    phrases related to facing consequences or potential trouble, often in a legal context

    phrases referring to legal consequences or penalties

    New Auto-Interp
    Negative Logits
    ucky
    -0.78
     Paste
    -0.69
     entit
    -0.66
    rous
    -0.66
    aneous
    -0.64
    rust
    -0.64
    >>\
    -0.64
     newsletter
    -0.63
    ary
    -0.62
    rap
    -0.60
    POSITIVE LOGITS
     face
    0.93
     faces
    0.89
    nces
    0.86
     Faces
    0.79
    face
    0.77
    faces
    0.75
     faced
    0.74
    Face
    0.73
     Face
    0.73
    metics
    0.73
    Act Density 0.026%

    No Known Activations