INDEX
Explanations
references to religious figures, especially prophets
references to religious figures, particularly prophets
New Auto-Interp
Negative Logits
egal
-0.76
emetery
-0.70
rink
-0.69
ensing
-0.69
ucha
-0.69
tten
-0.68
fork
-0.67
anke
-0.65
YC
-0.63
Torn
-0.63
POSITIVE LOGITS
Muhammad
0.98
Prophet
0.92
zee
0.82
Mohammed
0.82
Mohammad
0.80
Mohamed
0.79
hammad
0.78
Isaiah
0.78
Quran
0.76
imum
0.76
Activations Density 0.011%