INDEX
    Explanations

    references to legal or criminal justice settings and actions

    New Auto-Interp
    Negative Logits
     plurality
    -0.17
    witz
    -0.16
     vul
    -0.15
     Nex
    -0.15
     puzz
    -0.15
     Phill
    -0.15
     validity
    -0.15
     Ranch
    -0.14
    semble
    -0.14
    thora
    -0.14
    POSITIVE LOGITS
    ictions
    0.17
     indefinitely
    0.17
    avorite
    0.16
    agy
    0.16
    âĢİ
    0.16
    artifacts
    0.16
    Aid
    0.15
    fighting
    0.15
    enary
    0.15
    meyer
    0.15
    Act Density 4.709%

    No Known Activations