INDEX
Explanations
references to religious concepts and teachings across various faiths
New Auto-Interp
Negative Logits
illard
-0.16
stellen
-0.14
resolver
-0.14
perty
-0.14
Reserved
-0.14
agal
-0.14
ovich
-0.13
ergus
-0.13
oogle
-0.13
akespeare
-0.13
POSITIVE LOGITS
religion
0.42
religions
0.42
rel
0.40
Rel
0.38
belief
0.35
Religion
0.35
mono
0.35
faith
0.34
REL
0.34
-rel
0.33
Activations Density 0.242%