INDEX
Explanations
terms related to a specific religious figure or text
references to religious leaders and texts
New Auto-Interp
Negative Logits
mson
-0.84
Hurricanes
-0.83
auga
-0.79
xit
-0.77
umbnail
-0.76
swick
-0.73
orph
-0.71
oples
-0.71
Gators
-0.71
urable
-0.68
POSITIVE LOGITS
rabb
1.00
rabbi
0.97
ש
0.92
׾
0.92
anyahu
0.91
Rabbi
0.91
Netanyahu
0.86
Torah
0.86
×
0.85
×ķ
0.85
Activations Density 0.047%