INDEX
Explanations
references to Islamic law and its cultural implications
New Auto-Interp
Negative Logits
Krish
-0.16
Krishna
-0.15
Sender
-0.14
bev
-0.14
orch
-0.13
unte
-0.13
neh
-0.13
quil
-0.13
837
-0.13
gal
-0.13
POSITIVE LOGITS
suk
0.32
Hal
0.28
Suk
0.28
hal
0.27
Hal
0.24
Islamic
0.24
Mud
0.23
ij
0.23
Mur
0.23
Wak
0.22
Activations Density 0.012%