INDEX
Explanations
attends to constructs related to distinction or contrast from similar constructs appearing later in the sequence
New Auto-Interp
Head Attr Weights
0:0.09
1:0.10
2:0.10
3:0.21
4:0.26
5:0.03
6:0.09
7:0.09
Negative Logits
car
-0.22
istle
-0.21
Car
-0.21
sem
-0.21
ling
-0.21
ycle
-0.21
لاء
-0.21
rotto
-0.21
nergy
-0.21
无
-0.20
POSITIVE LOGITS
WriteBarrier
0.57
AccessorTable
0.43
tonode
0.40
parsedMessage
0.40
ErrIntOverflow
0.40
Diwedd
0.40
autorytatywna
0.39
propOrder
0.39
verifyException
0.39
UnusedPrivate
0.39
Activations Density 0.597%