INDEX
Explanations
attends to opinion-related tokens from development-related tokens
New Auto-Interp
Head Attr Weights
0:0.11
1:0.14
2:0.12
3:0.12
4:0.13
5:0.05
6:0.12
7:0.17
Negative Logits
TestingModule
-0.38
дописавши
-0.28
féd
-0.27
theid
-0.24
eloku
-0.24
Външни
-0.23
IUrlHelper
-0.23
protoimpl
-0.23
ولة
-0.23
øv
-0.23
POSITIVE LOGITS
0.28
!';
0.28
تضيفلها
0.26
cillors
0.26
);*/
0.26
مشين
0.26
})`
0.26
CloseOperation
0.25
();*/
0.25
!».
0.25
Activations Density 0.168%