INDEX
Explanations
ideas and phrases related to social and political issues
New Auto-Interp
Negative Logits
tiv
-0.16
olt
-0.16
dlg
-0.15
ارا
-0.14
æŀ
-0.14
rts
-0.14
tük
-0.13
ë¡Ŀ
-0.13
icut
-0.13
anik
-0.13
POSITIVE LOGITS
everywhere
0.16
bash
0.16
so
0.15
oh
0.14
amak
0.14
characteristic
0.14
Bash
0.14
hall
0.14
ohn
0.14
acer
0.14
Activations Density 0.220%