INDEX
Explanations
phrases related to actions or processes, in particular, those associated with causing some form of change
words related to the concept of 'entitlement'
New Auto-Interp
Negative Logits
å§«
-0.88
STAR
-0.74
Accountability
-0.72
Jenner
-0.72
士
-0.70
pmwiki
-0.69
Bleach
-0.68
Responsibility
-0.65
Negative
-0.65
é¾įåĸļ士
-0.63
POSITIVE LOGITS
ent
1.07
rave
0.99
eering
0.87
inence
0.86
cyclopedia
0.86
igent
0.85
ropy
0.85
ourage
0.84
renched
0.83
urous
0.81
Activations Density 0.004%