INDEX
Explanations
phrases indicating social hierarchy or status
phrases related to governance and social dynamics
New Auto-Interp
Negative Logits
arton
-0.67
ancies
-0.59
mentioned
-0.57
cies
-0.54
ispers
-0.54
ousands
-0.53
otin
-0.53
anwhile
-0.53
Vengeance
-0.50
oqu
-0.50
POSITIVE LOGITS
fodder
0.71
unto
0.69
affair
0.62
incarn
0.60
discipl
0.58
worthy
0.57
stew
0.57
conduit
0.57
asset
0.55
sleeper
0.55
Activations Density 1.028%