INDEX
Explanations
religious references and terminology
references to a religious figure or deity
New Auto-Interp
Negative Logits
*/(
-0.78
ickr
-0.71
Accessory
-0.70
issors
-0.70
OPLE
-0.70
VB
-0.69
ãĥ³ãĤ¸
-0.68
misc
-0.65
obs
-0.65
uries
-0.64
POSITIVE LOGITS
Almighty
1.11
Jesus
1.03
Ruler
0.89
Himself
0.88
bless
0.81
lord
0.81
frey
0.80
Jehovah
0.78
forbid
0.76
Krishna
0.76
Activations Density 0.014%