INDEX
Explanations
phrases indicating comparison or contrast
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.13
3:0.06
4:0.06
5:0.08
6:0.03
7:0.05
8:0.27
9:0.05
10:0.06
11:0.08
Negative Logits
Nap
-1.69
inav
-1.50
————
-1.50
advertisement
-1.46
ctor
-1.45
harvesting
-1.42
bath
-1.37
Grimm
-1.34
gart
-1.32
child
-1.32
POSITIVE LOGITS
msec
1.75
stellar
1.54
reon
1.53
Jindal
1.51
consider
1.44
chance
1.42
redit
1.40
riot
1.39
itates
1.39
kHz
1.38
Activations Density 0.001%