INDEX
Explanations
words related to titles or positions within a hierarchy
phrases indicating positions, titles, or roles someone holds
New Auto-Interp
Negative Logits
raints
-0.75
venants
-0.73
reperto
-0.73
ieties
-0.73
imon
-0.72
ities
-0.72
ento
-0.72
igun
-0.71
isons
-0.70
formulations
-0.69
POSITIVE LOGITS
medi
0.77
owning
0.69
defending
0.68
facilitating
0.67
educating
0.67
course
0.66
assisting
0.66
pretending
0.65
confirming
0.65
usher
0.65
Activations Density 0.167%