INDEX
Explanations
attends to tokens related to peace and subjects associated with religious or philosophical values from tokens referring to gendered pronouns or entities
New Auto-Interp
Head Attr Weights
0:0.11
1:0.13
2:0.14
3:0.07
4:0.06
5:0.06
6:0.06
7:0.33
Negative Logits
StoryboardSegue
-0.48
كويكب
-0.40
AsUp
-0.40
MigrationBuilder
-0.38
WithIOException
-0.36
"..\..\..\
-0.36
actionMode
-0.35
Meksiku
-0.34
aarrggbb
-0.34
VizieR
-0.33
POSITIVE LOGITS
<_>
0.29
</h6>
0.25
</blockquote>
0.24
éras
0.24
Loh
0.23
getRole
0.23
Shiv
0.22
olta
0.22
隍
0.21
関する
0.21
Activations Density 0.957%