INDEX
Explanations
references to political and philosophical ideas that contrast or prioritize certain concepts such as democracy, ideology, security, and profit
concepts related to societal governance and ethics
New Auto-Interp
Negative Logits
soon
-0.61
oso
-0.59
Catch
-0.58
WATCHED
-0.57
almost
-0.57
hot
-0.56
mma
-0.56
repre
-0.55
ASA
-0.55
+++
-0.53
POSITIVE LOGITS
nor
0.99
anymore
0.96
altogether
0.89
ones
0.81
guiActiveUn
0.79
versa
0.72
necessarily
0.71
itself
0.69
outright
0.69
anything
0.67
Activations Density 0.494%