INDEX
Explanations
expressions of arrogance and inflated self-image in individuals
New Auto-Interp
Negative Logits
œurs
-0.52
useContext
-0.43
دانشنامهٔ
-0.39
TableBody
-0.39
tortuga
-0.38
muualla
-0.38
CommonModule
-0.38
utafitiHapana
-0.37
conexao
-0.37
разобра
-0.36
POSITIVE LOGITS
pride
0.86
proudly
0.82
bragging
0.80
brag
0.80
proud
0.79
boast
0.77
pride
0.77
claim
0.76
arrog
0.75
proud
0.73
Activations Density 0.315%