INDEX
Explanations
terms related to social and economic dynamics
New Auto-Interp
Negative Logits
isation
-0.18
riot
-0.17
ipation
-0.17
ensation
-0.17
IZATION
-0.17
ÃŃrk
-0.17
ulations
-0.17
actionDate
-0.17
endon
-0.17
atisation
-0.17
POSITIVE LOGITS
ise
0.45
ize
0.43
ze
0.36
ate
0.33
ify
0.30
IZE
0.25
itize
0.24
ISE
0.23
strate
0.23
en
0.23
Activations Density 0.039%