INDEX
Explanations
mentions of legal terms and courtroom proceedings
references to defendants in a legal context
New Auto-Interp
Negative Logits
ories
-0.83
yip
-0.80
ür
-0.80
olen
-0.77
atism
-0.77
eful
-0.75
[|
-0.74
efully
-0.72
UNCH
-0.71
unch
-0.70
POSITIVE LOGITS
pled
0.84
defendants
0.78
defendant
0.75
plead
0.74
Defendant
0.72
accused
0.68
sentenced
0.68
convicted
0.68
contracted
0.66
briefs
0.66
Activations Density 0.014%