INDEX
Explanations
mentions of concepts related to order and structure, such as "order in nature" or "social order"
references to concepts of order and organization
New Auto-Interp
Negative Logits
ãĤ©
-0.75
rolet
-0.73
Nadu
-0.70
vae
-0.70
Miliband
-0.69
cit
-0.68
burgh
-0.67
peria
-0.67
sonian
-0.67
lehem
-0.64
POSITIVE LOGITS
lies
1.41
liness
1.10
eous
0.88
ylum
0.84
etary
0.84
books
0.80
eering
0.75
uria
0.75
able
0.75
ality
0.74
Activations Density 0.025%