INDEX
    Explanations

    words related to legal proceedings such as court cases and charges

    New Auto-Interp
    Negative Logits
    -0.88
     in
    -0.85
     no
    -0.81
    .
    -0.81
     a
    -0.81
     to
    -0.81
    ,
    -0.80
     e
    -0.79
     de
    -0.79
     non
    -0.79
    POSITIVE LOGITS
     alkoh
    2.25
     kask
    2.07
     karton
    2.06
     milano
    2.02
     marte
    2.02
     cannes
    2.01
     silikon
    1.99
     kosme
    1.98
     drap
    1.97
     moza
    1.96
    Act Density 0.229%

    No Known Activations