INDEX
Explanations
words related to power dynamics and control
concepts related to the impact of attention and the effects of societal issues
New Auto-Interp
Negative Logits
TABLE
-0.65
odox
-0.62
ãĤ´ãĥ³
-0.61
ste
-0.60
PHOTO
-0.60
srfAttach
-0.59
REE
-0.59
Strike
-0.59
dry
-0.58
guiActiveUn
-0.58
POSITIVE LOGITS
imaginable
1.05
afforded
0.92
amassed
0.82
accrued
0.79
available
0.77
expended
0.77
accumulated
0.76
conceivable
0.76
necessary
0.75
spawned
0.75
Activations Density 0.681%