INDEX
    Explanations

    phrases related to legal proceedings and consequences

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.02
    2:0.06
    3:0.12
    4:0.05
    5:0.07
    6:0.03
    7:0.03
    8:0.05
    9:0.18
    10:0.22
    11:0.08
    Negative Logits
    Favorite
    -1.12
    -1.04
    ゼウス
    -1.04
    Born
    -1.03
     Entered
    -1.03
    NAME
    -1.03
    UES
    -1.00
     Selected
    -0.97
    ointment
    -0.97
    reetings
    -0.96
    POSITIVE LOGITS
     deterrent
    1.24
     mitigation
    1.20
     mitigating
    1.18
     moot
    1.15
     loopholes
    1.14
     deterrence
    1.13
     anecdotal
    1.13
     hazard
    1.10
     situational
    1.03
     policing
    1.02
    Act Density 1.736%

    No Known Activations