INDEX
Explanations
phrases related to evaluation or judgment, especially in relation to individuals or their actions
references to opinions and assessments about individuals, particularly in the context of leadership and public perception
New Auto-Interp
Negative Logits
zbollah
-0.81
'/
-0.68
iae
-0.61
pora
-0.60
anguage
-0.58
izo
-0.56
oola
-0.54
imum
-0.54
ather
-0.51
(/
-0.51
POSITIVE LOGITS
him
2.49
him
1.80
his
1.65
HIM
1.63
his
1.54
Him
1.52
His
1.49
he
1.40
He
1.38
His
1.24
Activations Density 1.191%