INDEX
Explanations
references to Iranian political figures and institutions
New Auto-Interp
Negative Logits
hq
-0.17
.scalablytyped
-0.15
ây
-0.15
Ø·ÙĦ
-0.15
inke
-0.14
llib
-0.14
Tuy
-0.14
hir
-0.14
Fork
-0.14
wend
-0.14
POSITIVE LOGITS
Binder
0.15
į
0.15
Ber
0.14
vac
0.14
grade
0.14
ارÙĩ
0.14
MainFrame
0.14
Grade
0.14
inas
0.13
otle
0.13
Activations Density 0.015%