INDEX
Explanations
references to religious figures, particularly pastors
references to individuals with the title "pastor."
New Auto-Interp
Negative Logits
urat
-0.83
plets
-0.69
axy
-0.65
EMS
-0.65
thin
-0.64
Carbuncle
-0.62
اÙĦ
-0.62
xp
-0.60
ibaba
-0.60
Petroleum
-0.60
POSITIVE LOGITS
pastor
0.99
Pastor
0.93
esses
0.88
preached
0.87
iffe
0.81
preach
0.80
angel
0.79
ess
0.78
evangel
0.76
angelo
0.76
Activations Density 0.021%