INDEX
Explanations
references to the Bible
references to religious texts, specifically the Bible
New Auto-Interp
Negative Logits
nesota
-0.88
ickr
-0.83
ority
-0.77
ials
-0.73
details
-0.71
*/(
-0.70
Flavoring
-0.70
ivation
-0.69
ancers
-0.67
ifles
-0.67
POSITIVE LOGITS
scriptures
0.89
patriarch
0.85
Bible
0.85
prophecy
0.83
tracts
0.82
bible
0.80
Bale
0.79
anan
0.79
translator
0.79
tract
0.78
Activations Density 0.008%