INDEX
Explanations
references to religious or spiritual figures and themes
New Auto-Interp
Negative Logits
llx
-0.18
MBER
-0.14
PEAT
-0.14
.bio
-0.14
.pkg
-0.14
vod
-0.14
etail
-0.14
ıcı
-0.14
Madonna
-0.14
bum
-0.14
POSITIVE LOGITS
Awake
0.22
Watch
0.20
Witnesses
0.19
Jehovah
0.18
ocratic
0.18
WT
0.18
Watch
0.16
_RENDER
0.15
Witness
0.15
Insight
0.15
Activations Density 0.005%