INDEX
Explanations
phrases and terms related to bias and political discourse
New Auto-Interp
Negative Logits
muß
-1.10
läßt
-1.09
idéia
-1.05
متعلقه
-1.02
müßte
-0.96
mußte
-0.94
daß
-0.92
Moslem
-0.91
mußten
-0.91
especiais
-0.87
POSITIVE LOGITS
impactful
1.05
microbiome
1.04
curated
0.99
leveraging
0.98
timelines
0.98
TLDR
0.98
incentiv
0.96
nuanced
0.96
sourced
0.95
overarching
0.95
Activations Density 6.160%