INDEX
Explanations
names of individuals or entities
references to power dynamics and personal relationships within a competitive or adversarial context
New Auto-Interp
Negative Logits
specified
-0.87
Regarding
-0.85
ilater
-0.82
artments
-0.81
available
-0.80
GOODMAN
-0.80
Guest
-0.79
AE
-0.76
Update
-0.76
translation
-0.74
POSITIVE LOGITS
mentor
1.14
reputation
1.12
charisma
1.11
charismatic
1.10
legacy
1.09
arrogance
1.07
visionary
1.06
ideals
1.06
ego
1.05
fortunes
1.05
Activations Density 0.470%