INDEX
Explanations
mentions of the word 'torture'
instances of the word "tort" and its variations, indicating a focus on themes of torture or suffering
New Auto-Interp
Negative Logits
magnification
-0.74
Prospect
-0.73
Darkness
-0.72
Active
-0.67
Berlin
-0.67
Citizens
-0.66
brightest
-0.63
Czech
-0.62
FORE
-0.62
Copenhagen
-0.61
POSITIVE LOGITS
tort
1.33
urous
1.26
oise
1.18
illas
0.98
uous
0.94
isec
0.94
ured
0.94
uring
0.92
Tort
0.92
onga
0.89
Activations Density 0.005%