INDEX
Explanations
phrases related to social interactions where others are kind or supportive towards the subject
references to the pronoun "you."
New Auto-Interp
Negative Logits
ĨĴ
-0.69
Commerce
-0.68
Agriculture
-0.68
tein
-0.64
Course
-0.64
majority
-0.63
Majority
-0.63
ħ
-0.61
Politico
-0.61
WW
-0.61
POSITIVE LOGITS
're
1.18
tub
1.18
guys
1.14
've
0.97
RS
0.93
hei
0.91
'll
0.89
selves
0.78
andering
0.75
kai
0.75
Activations Density 0.112%