INDEX
Explanations
attends to instances of emphasized or important statements from subsequent tokens
New Auto-Interp
Head Attr Weights
0:0.08
1:0.20
2:0.18
3:0.09
4:0.10
5:0.07
6:0.09
7:0.15
Negative Logits
onPostExecute
-0.39
defStyleAttr
-0.37
fromnode
-0.34
PostExecute
-0.34
hoeddwyd
-0.34
Doherty
-0.34
]',
-0.33
>');
-0.32
DockStyle
-0.32
FetchType
-0.32
POSITIVE LOGITS
Waray
0.29
africains
0.26
umani
0.25
jiga
0.24
μένες
0.24
isak
0.24
PYTHON
0.23
дописавши
0.23
humains
0.23
اریخ
0.23
Activations Density 0.050%