INDEX
Explanations
phrases related to specific forms of disasters or intense situations, often metaphorical
phrases indicating significant themes or concepts linked to existential or societal challenges
New Auto-Interp
Negative Logits
alty
-0.75
ynes
-0.75
idas
-0.75
erion
-0.74
ername
-0.72
untarily
-0.71
arily
-0.70
fter
-0.70
fur
-0.68
quarter
-0.66
POSITIVE LOGITS
sorts
1.08
proportions
0.86
liberalism
0.84
civilization
0.78
bureaucracy
0.77
contradictions
0.76
capitalism
0.75
emotions
0.74
civilisation
0.73
lies
0.72
Activations Density 0.176%