INDEX
Explanations
mentions of legal actions such as pardon, deportation, and freeing individuals from legal consequences
terms associated with legal pardons and deportations
New Auto-Interp
Negative Logits
pants
-0.78
acent
-0.69
oken
-0.67
SEM
-0.66
Dunk
-0.66
mop
-0.63
ideshow
-0.61
bra
-0.59
ibaba
-0.59
opers
-0.59
POSITIVE LOGITS
pard
0.89
pardon
0.86
revocation
0.81
vous
0.77
deported
0.77
chwitz
0.76
20439
0.75
give
0.74
itives
0.73
ilege
0.73
Activations Density 0.090%