INDEX
Explanations
references or mentions of religious texts, particularly the Bible
references to the Bible
New Auto-Interp
Negative Logits
nesota
-0.82
ickr
-0.78
ials
-0.74
*/(
-0.69
llular
-0.68
starter
-0.68
ority
-0.68
details
-0.66
auder
-0.65
Flavoring
-0.65
POSITIVE LOGITS
Bible
0.91
scriptures
0.88
tracts
0.82
hovah
0.82
prophecy
0.82
Scriptures
0.81
bible
0.79
iblical
0.78
Bale
0.78
tract
0.78
Activations Density 0.011%