INDEX
Explanations
opinions or speculations mentioned about a person's actions or qualities
sentiments related to public opinion and political approval
New Auto-Interp
Negative Logits
kaya
-0.61
'/
-0.56
zbollah
-0.51
\":
-0.49
/(
-0.49
clitor
-0.49
byss
-0.49
Variant
-0.48
omorphic
-0.48
esi
-0.48
POSITIVE LOGITS
his
1.82
he
1.65
his
1.58
His
1.58
He
1.40
His
1.38
himself
1.30
He
1.30
HIS
1.26
him
1.24
Activations Density 1.640%