INDEX
    Explanations

    phrases related to political and legal issues

    New Auto-Interp
    Negative Logits
     undertaking
    -0.65
     4090
    -0.62
    nesty
    -0.61
    ignt
    -0.61
     Hai
    -0.59
    COMPLE
    -0.57
    cellence
    -0.57
    watching
    -0.57
     issu
    -0.56
    ikhail
    -0.56
    POSITIVE LOGITS
    fully
    1.14
     sparing
    1.02
    FUL
    1.02
     techniques
    0.89
    fulness
    0.89
    ful
    0.87
     aliases
    0.86
     condoms
    0.86
     tools
    0.83
     shortcuts
    0.81
    Act Density 3.795%

    No Known Activations