INDEX
Explanations
phrases that express religious belief and divine influence
New Auto-Interp
Negative Logits
itter
-0.18
yer
-0.14
zh
-0.14
onsense
-0.13
Lance
-0.13
("(%-0.13
Epic
-0.13
βα
-0.13
bob
-0.13
luck
-0.13
POSITIVE LOGITS
.nih
0.14
ANTE
0.14
otate
0.14
addons
0.14
entin
0.14
indeb
0.13
ī
0.13
ever
0.13
ableObject
0.13
Camden
0.13
Activations Density 0.158%