INDEX
Explanations
terms related to wealth and economic status
New Auto-Interp
Negative Logits
externalToEVAOnly
-0.76
ãģ¦
-0.67
£ı
-0.66
Blaz
-0.65
apple
-0.64
WER
-0.64
cers
-0.64
Yel
-0.61
thia
-0.61
dq
-0.61
POSITIVE LOGITS
inequality
1.07
redistribution
1.06
disparity
1.03
wealth
1.01
accumulation
1.01
holdings
0.95
disparities
0.94
amassed
0.92
accumulated
0.90
inequalities
0.88
Activations Density 0.010%