INDEX
Explanations
statements discussing impartiality or bias
terms related to neutrality and bias in discussions or reports
New Auto-Interp
Negative Logits
Phones
-0.78
clamation
-0.76
âĹ¼
-0.76
Pause
-0.75
RAM
-0.75
trap
-0.75
Requires
-0.74
Extended
-0.72
KER
-0.70
FFER
-0.69
POSITIVE LOGITS
impartial
1.30
unbiased
1.16
observer
1.06
observers
0.92
opinions
0.91
arbit
0.89
iate
0.86
opinion
0.82
ity
0.80
viewpoints
0.79
Activations Density 0.027%