INDEX
Explanations
mentions of pardons, amnesties, and related legal terms
references to pardons and amnesty
New Auto-Interp
Negative Logits
Bunker
-0.65
opic
-0.65
Connell
-0.64
ritch
-0.63
Fit
-0.63
lass
-0.62
Engineers
-0.62
Princ
-0.61
MAT
-0.60
str
-0.60
POSITIVE LOGITS
pard
1.54
pardon
1.32
amnesty
0.92
20439
0.85
oning
0.83
cies
0.82
oled
0.78
cess
0.77
ardon
0.76
ige
0.76
Activations Density 0.018%