INDEX
Explanations
attends to positive sentiment expressions from conjunction tokens that connect or contrast statements
New Auto-Interp
Head Attr Weights
0:0.47
1:0.16
2:0.11
3:0.05
4:0.04
5:0.02
6:0.03
7:0.08
Negative Logits
fallu
-0.32
ujednoznacz
-0.31
最快更新
-0.31
modore
-0.30
Programmer
-0.30
FontWeight
-0.30
photolibrary
-0.30
condamné
-0.30
насељу
-0.30
IBRARY
-0.30
POSITIVE LOGITS
nakalista
0.24
breakers
0.23
asteroide
0.22
Referències
0.22
ngOn
0.22
خارجية
0.22
elemField
0.21
Reis
0.21
ithin
0.21
widerrufen
0.21
Activations Density 0.554%