INDEX
Explanations
references to religious practices and the concept of prayer
New Auto-Interp
Negative Logits
prayer
-0.26
prizes
-0.24
prayers
-0.22
prize
-0.21
pray
-0.21
ombine
-0.18
Prayer
-0.17
prison
-0.17
Prize
-0.16
prar
-0.16
POSITIVE LOGITS
ful
0.23
fully
0.18
istine
0.17
fulness
0.17
ordial
0.16
onto
0.16
fighter
0.16
atically
0.16
ess
0.15
val
0.15
Activations Density 0.069%