INDEX
Explanations
attends to auxiliary verbs indicating states or actions from corresponding pronouns or subjects
New Auto-Interp
Head Attr Weights
0:0.25
1:0.37
2:0.11
3:0.03
4:0.04
5:0.04
6:0.04
7:0.08
Negative Logits
}">
-0.35
'}>
-0.34
EconPapers
-0.32
AutoScaleMode
-0.32
كومونز
-0.32
ׁ
-0.31
WebDriver
-0.31
already
-0.30
VersionUID
-0.30
contentLoaded
-0.29
POSITIVE LOGITS
Vidite
0.35
'\\;'
0.33
rzej
0.31
endpush
0.31
@[+][
0.31
IntoConstraints
0.31
bitan
0.30
ască
0.29
nextProps
0.29
+#+
0.29
Activations Density 0.390%