INDEX
Explanations
terms related to societal and institutional reform
New Auto-Interp
Negative Logits
Enlarge
-0.06
drastic
-0.06
амеÑĤ
-0.06
opoulos
-0.06
lep
-0.06
879
-0.06
stras
-0.06
âĢĥ
-0.05
tired
-0.05
 
-0.05
POSITIVE LOGITS
BITTE
0.08
Erotische
0.07
cript
0.07
tractive
0.07
indow
0.07
autiful
0.07
ÅĽmy
0.07
/Peak
0.07
ofday
0.07
auth
0.07
Activations Density 0.000%