INDEX
Explanations
phrases relating to Western perspectives and influences
New Auto-Interp
Negative Logits
ArgsConstructor
-0.61
Inſ
-0.60
complish
-0.60
للمعارف
-0.59
Italijanski
-0.59
NewUrlParser
-0.59
__);
-0.58
__))
-0.57
DoubleQuotes
-0.57
fea
-0.57
POSITIVE LOGITS
متحده
0.57
white
0.49
occidentale
0.48
BuildContext
0.48
white
0.47
0.47
western
0.47
west
0.47
становника
0.47
Whites
0.47
Activations Density 0.520%