INDEX
Explanations
attends to the relations expressed through specific prepositions or phrases linking concepts and categories from tokens that appear later in the sequence
New Auto-Interp
Head Attr Weights
0:0.10
1:0.11
2:0.43
3:0.05
4:0.04
5:0.04
6:0.04
7:0.13
Negative Logits
✨:
-0.46
AttributeSet
-0.41
insuffisamment
-0.40
Roskov
-0.40
InstanceState
-0.39
########.
-0.37
.*")]
-0.36
فريبيس
-0.35
doInBackground
-0.35
Waray
-0.33
POSITIVE LOGITS
medesimo
0.34
AddTagHelper
0.32
SequentialGroup
0.30
grunn
0.30
awtextra
0.29
Cubit
0.29
plate
0.26
opdracht
0.25
incentive
0.25
WriteTagHelper
0.25
Activations Density 1.093%