INDEX
Explanations
possessive pronouns followed by a verb
pronouns and their correlation with authority or roles in various contexts
New Auto-Interp
Negative Logits
yrinth
-0.73
ablishment
-0.66
externalToEVAOnly
-0.66
nesty
-0.65
BALL
-0.64
pering
-0.63
ixon
-0.63
uph
-0.62
tenance
-0.62
vier
-0.62
POSITIVE LOGITS
newfound
1.20
clout
1.14
own
1.00
imagination
1.00
discretion
0.99
leverage
0.99
expertise
0.96
considerable
0.94
fists
0.90
powers
0.89
Activations Density 0.078%