INDEX
Explanations
names or references related to Islamic culture and figures
New Auto-Interp
Negative Logits
ensch
-0.19
azzi
-0.19
@js
-0.18
Ŀi
-0.16
.scalablytyped
-0.16
PFN
-0.16
λογία
-0.15
jong
-0.15
REFERRED
-0.15
oden
-0.15
POSITIVE LOGITS
гÑĥ
0.16
(s
0.14
de
0.14
buz
0.14
Voc
0.14
.liferay
0.14
et
0.14
battle
0.13
telling
0.13
cat
0.13
Activations Density 0.073%