INDEX
Explanations
references to religious institutions and figures
New Auto-Interp
Negative Logits
öy
-0.18
etros
-0.15
oload
-0.15
reb
-0.14
ileo
-0.14
Minerals
-0.14
orts
-0.14
ÃŃl
-0.14
Walt
-0.13
Robbie
-0.13
POSITIVE LOGITS
Carm
0.35
Franc
0.33
fri
0.33
Domin
0.27
Sisters
0.27
Dominican
0.27
n
0.27
Fri
0.26
Sales
0.25
Cap
0.25
Activations Density 0.041%