INDEX
Explanations
titles, ranks, positions, and roles referring to people or entities
phrases that refer to titles and positions of authority
New Auto-Interp
Negative Logits
umerable
-0.83
venants
-0.79
iland
-0.79
jad
-0.75
raints
-0.73
forms
-0.71
lements
-0.71
onga
-0.71
ples
-0.69
itions
-0.69
POSITIVE LOGITS
medi
0.76
holder
0.74
champion
0.73
¥µ
0.71
outsider
0.70
champ
0.69
cellar
0.68
savior
0.68
martyr
0.68
assador
0.66
Activations Density 0.123%