INDEX
Explanations
references to detention facilities or programs
terms related to detention and the experiences of detainees
New Auto-Interp
Negative Logits
clerosis
-0.77
ãĥ£
-0.73
bold
-0.68
soDeliveryDate
-0.64
ider
-0.62
hl
-0.61
hester
-0.60
iosyncr
-0.59
deceive
-0.59
vironment
-0.58
POSITIVE LOGITS
detention
0.94
detain
0.84
anamo
0.83
detained
0.83
detainees
0.78
rans
0.76
ainers
0.75
detainee
0.74
Guantanamo
0.72
rained
0.71
Activations Density 0.029%