INDEX
Explanations
references to systemic social and economic change efforts
New Auto-Interp
Negative Logits
finity
-0.14
untime
-0.14
fusc
-0.13
ETIME
-0.13
ptal
-0.13
enary
-0.12
ñas
-0.12
اÙĦÙĨÙĪ
-0.12
дом
-0.12
ãģĭãģ£ãģ¦
-0.12
POSITIVE LOGITS
change
0.56
change
0.44
Change
0.42
CHANGE
0.40
-change
0.40
changes
0.39
transformation
0.38
Change
0.38
.change
0.35
_change
0.34
Activations Density 0.234%