INDEX
Explanations
mentions of the name "Christ" or related terms
New Auto-Interp
Negative Logits
umi
-0.17
sob
-0.17
urious
-0.16
OUS
-0.16
hattan
-0.16
vanished
-0.15
icios
-0.15
KNOWN
-0.15
clamation
-0.14
ecal
-0.14
POSITIVE LOGITS
ensen
0.32
opher
0.29
ophe
0.26
church
0.24
enson
0.24
oper
0.23
iane
0.21
offer
0.21
ansen
0.20
MAS
0.19
Activations Density 0.009%