INDEX
Explanations
opinions and critiques regarding representation and diversity in media
New Auto-Interp
Negative Logits
总之
-0.46
Espèce
-0.41
nôtre
-0.40
ainfi
-0.39
behulp
-0.38
🔕
-0.38
commandement
-0.38
enumii
-0.37
czł
-0.37
Verwendung
-0.36
POSITIVE LOGITS
Sure
2.06
Sure
2.02
sure
2.02
yes
1.60
sure
1.59
Granted
1.55
Granted
1.53
Yes
1.41
Yes
1.37
yes
1.25
Activations Density 0.371%