INDEX
Explanations
phrases and concepts related to political ideology and ethical discourse
New Auto-Interp
Negative Logits
enumi
-0.56
WebVitals
-0.55
RectangleBorder
-0.52
orianCalendar
-0.50
écnicas
-0.50
réception
-0.50
gynhyrchwyd
-0.49
hülle
-0.47
+#+#
-0.46
xico
-0.46
POSITIVE LOGITS
fairness
0.95
equality
0.94
justice
0.88
tolerance
0.85
democracy
0.85
transparency
0.82
equity
0.81
honesty
0.80
liberty
0.76
liberté
0.75
Activations Density 0.458%