INDEX
Explanations
phrases related to legal and political concepts
references to rights and social justice issues
New Auto-Interp
Negative Logits
anecd
-0.64
earable
-0.62
Sample
-0.60
CV
-0.59
largeDownload
-0.59
OVER
-0.57
ODUCT
-0.57
batch
-0.57
prisingly
-0.56
actic
-0.56
POSITIVE LOGITS
tyranny
0.81
immoral
0.80
â̦"
0.79
dignity
0.79
Sharia
0.78
righteousness
0.78
corrupt
0.77
.'"
0.74
caliphate
0.73
disgrace
0.73
Activations Density 2.440%