INDEX
Explanations
references to Christianity and related terms
New Auto-Interp
Negative Logits
china
-0.19
reet
-0.17
chart
-0.16
ular
-0.16
Christianity
-0.15
gaard
-0.15
sert
-0.15
ustr
-0.15
sole
-0.15
IES
-0.15
POSITIVE LOGITS
ized
0.25
ity
0.21
izing
0.20
-Muslim
0.20
å¾Ĵ
0.19
like
0.18
etz
0.17
ization
0.17
ize
0.17
Broadcasting
0.16
Activations Density 0.013%