INDEX
Explanations
phrases related to religious worship
references to worship and religious practices
New Auto-Interp
Negative Logits
Ale
-0.72
Lans
-0.64
Bagg
-0.64
aunder
-0.63
Stra
-0.62
20439
-0.62
Fif
-0.60
女
-0.60
RW
-0.60
COL
-0.59
POSITIVE LOGITS
worship
1.12
edIn
1.04
eful
0.94
efully
0.92
lication
0.91
eering
0.89
ful
0.89
sylvania
0.88
fulness
0.88
worsh
0.87
Activations Density 0.010%