INDEX
Explanations
references to societal beliefs and values related to race and inequality
New Auto-Interp
Negative Logits
يتيمه
-0.89
BufferException
-0.71
Бележки
-0.69
ویکیپدی
-0.67
исленность
-0.60
Rüyada
-0.59
Jeografia
-0.59
CURIAM
-0.58
MigrationBuilder
-0.57
нгред
-0.57
POSITIVE LOGITS
!!
0.43
!!!
0.41
?!
0.40
?!?
0.39
folks
0.38
!!!
0.38
UnusedPrivate
0.37
!?!
0.37
!!
0.36
&
0.36
Activations Density 1.496%