INDEX
Explanations
instances of text relating to political and social issues, including controversies, discrimination, scientific debates, and economic challenges
New Auto-Interp
Negative Logits
Bris
-0.71
Sapphire
-0.69
guiActiveUnfocused
-0.68
indo
-0.65
Bengal
-0.64
iewicz
-0.64
creen
-0.63
confines
-0.63
detached
-0.63
Opera
-0.63
POSITIVE LOGITS
IJ
1.13
ª
1.12
¹
1.12
ł
1.09
Ĵ
1.08
ı
1.07
ij
1.03
£
0.94
³
0.93
certain
0.91
Activations Density 0.144%