INDEX
Explanations
references to challenges faced by marginalized or oppressed groups
New Auto-Interp
Negative Logits
الحره
-0.68
Thess
-0.57
LAZY
-0.56
JspWriter
-0.56
africains
-0.55
Griechenland
-0.54
Agamemnon
-0.54
incompetence
-0.54
հղումներ
-0.53
compétence
-0.52
POSITIVE LOGITS
persecuted
0.71
underground
0.64
political
0.61
escape
0.61
escaping
0.61
escaping
0.61
hiding
0.60
survival
0.58
Underground
0.58
political
0.58
Activations Density 0.402%