INDEX
Explanations
references to Sharia law
New Auto-Interp
Negative Logits
REDACTED
-0.72
ername
-0.68
Upton
-0.65
GROUND
-0.64
asus
-0.64
Uran
-0.64
insula
-0.63
CRIP
-0.63
é¾įå¥ij士
-0.63
ortium
-0.62
POSITIVE LOGITS
ples
1.24
pton
1.22
pling
1.15
ãĥ£
1.13
ply
1.01
jah
1.01
riors
0.99
pless
0.99
ning
0.96
pering
0.94
Activations Density 0.011%