INDEX
Explanations
phrases related to prisoners or wartime situations
references to prisoners in various contexts
New Auto-Interp
Negative Logits
orp
-0.84
amera
-0.76
orie
-0.71
Boll
-0.68
alore
-0.67
wig
-0.66
ür
-0.65
laus
-0.64
uyomi
-0.63
ories
-0.63
POSITIVE LOGITS
prisoners
1.08
captives
0.98
prisoner
0.92
inmates
0.87
detainees
0.83
sentenced
0.80
captive
0.79
hostage
0.79
hostages
0.78
incarcerated
0.77
Activations Density 0.021%