INDEX
Explanations
references to religious concepts, particularly those associated with holiness
Text following "Holy"
"Holy" followed by common phrases
New Auto-Interp
Negative Logits
незавершена
-1.04
RectangleBorder
-0.94
AccessorTable
-0.94
שוליים
-0.89
ViewFeatures
-0.89
صوتيه
-0.87
المعيارى
-0.83
itſelf
-0.83
dAtA
-0.82
setopt
-0.82
POSITIVE LOGITS
Holy
0.70
Holy
0.67
holy
0.64
HOLY
0.62
shit
0.59
HOLY
0.56
Sh
0.54
mol
0.54
Ground
0.53
j
0.52
Activations Density 0.066%