INDEX
Explanations
phrases related to authority or positions of power
references to authority figures or concepts of authority
New Auto-Interp
Negative Logits
Lens
-0.76
kies
-0.76
auder
-0.74
lla
-0.70
irling
-0.69
esta
-0.67
akeru
-0.66
-0.65
Taste
-0.65
bart
-0.65
POSITIVE LOGITS
vested
1.11
delegated
1.05
authority
1.04
conferred
0.89
granted
0.86
exercised
0.86
overseeing
0.84
Reviewer
0.84
figures
0.82
nomine
0.80
Activations Density 0.031%