INDEX
Explanations
mentions of legal charges or accusations against individuals
instances of the word "charged" in legal contexts
New Auto-Interp
Negative Logits
Wo
-0.78
birth
-0.73
COR
-0.70
Birth
-0.68
ARCH
-0.66
livest
-0.66
>]
-0.66
Orth
-0.65
ophe
-0.64
angular
-0.63
POSITIVE LOGITS
heet
1.14
llah
0.93
charges
0.91
eters
0.90
criminally
0.80
charging
0.78
charged
0.77
hyde
0.76
indict
0.76
Charg
0.73
Activations Density 0.028%