INDEX
Explanations
references to religious contexts, particularly involving the term "Holy"
New Auto-Interp
Negative Logits
AccessorTable
-0.86
EconPapers
-0.85
صوتيه
-0.79
الحره
-0.74
مشين
-0.71
httphttps
-0.66
setopt
-0.63
adaptiveStyles
-0.62
dAtA
-0.60
ViewFeatures
-0.56
POSITIVE LOGITS
grail
0.67
oke
0.67
Ghost
0.63
rood
0.61
Grail
0.60
Moly
0.60
Ghost
0.59
scriptcase
0.58
moly
0.58
ghost
0.54
Activations Density 0.177%