INDEX
Explanations
references to specific individuals or entities related to the clergy or religious organizations
New Auto-Interp
Negative Logits
ãĤ®
-0.74
κ
-0.70
caution
-0.67
hered
-0.65
fal
-0.64
ki
-0.64
tale
-0.64
ï¸
-0.64
sov
-0.63
gers
-0.63
POSITIVE LOGITS
oser
1.04
osing
0.98
INTON
0.96
avier
0.96
OSED
0.95
ueless
0.93
uster
0.93
amps
0.92
opez
0.92
aughlin
0.90
Activations Density 0.056%