INDEX
Explanations
mentions of prisoners, particularly in a war or conflict setting
references to prisoners, particularly in the context of war or captivity
New Auto-Interp
Negative Logits
drive
-0.72
lag
-0.71
OPA
-0.71
Boll
-0.68
orie
-0.67
wig
-0.67
Ples
-0.66
Quadro
-0.65
ernaut
-0.64
Beir
-0.64
POSITIVE LOGITS
prisoners
0.92
incarcerated
0.82
inmates
0.82
detainees
0.82
prisoner
0.82
sentenced
0.82
captives
0.77
zees
0.72
freed
0.72
icts
0.70
Activations Density 0.047%