INDEX
Explanations
mentions of specific proper nouns or entities
vocabulary related to criminal justice and incarceration
New Auto-Interp
Negative Logits
thereof
-0.79
..."
-0.78
20439
-0.75
thereto
-0.74
â̦"
-0.73
respectively
-0.71
è£ı
-0.68
..."
-0.63
#$
-0.62
ãĤ¼ãĤ¦ãĤ¹
-0.61
POSITIVE LOGITS
osate
0.80
Profile
0.76
Advice
0.71
Oversight
0.67
Overview
0.67
Analysis
0.67
Benefits
0.67
Appeal
0.67
Accountability
0.66
pedia
0.65
Activations Density 0.317%