INDEX
Explanations
references to the Bible and its interpretation
New Auto-Interp
Negative Logits
uts
-0.15
eldon
-0.14
arty
-0.14
inal
-0.14
orm
-0.13
odore
-0.13
IPAddress
-0.13
agner
-0.13
BuzzFeed
-0.13
Imam
-0.13
POSITIVE LOGITS
Bib
0.43
bib
0.42
bib
0.40
Bible
0.37
Old
0.37
bible
0.34
Script
0.34
biblical
0.33
script
0.32
passages
0.32
Activations Density 0.236%