INDEX
Explanations
concepts related to capitalism and its effects
New Auto-Interp
Negative Logits
alta
-0.17
inaire
-0.16
electric
-0.15
udder
-0.15
.dds
-0.14
owe
-0.14
bih
-0.14
Winston
-0.14
Lit
-0.14
indo
-0.14
POSITIVE LOGITS
.production
0.16
elyn
0.15
labour
0.15
PURE
0.14
Alec
0.14
-worker
0.14
THC
0.14
edia
0.14
treadmill
0.14
ilyn
0.14
Activations Density 0.042%