INDEX
Explanations
phrases indicating resilience or perseverance
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.43
3:0.09
4:0.10
5:0.03
6:0.03
7:0.05
8:0.04
9:0.03
10:0.07
11:0.05
Negative Logits
Vik
-1.60
spokeswoman
-1.60
Samar
-1.45
ocious
-1.43
Lithuania
-1.43
�
-1.41
ritical
-1.41
egu
-1.39
olor
-1.38
luaj
-1.35
POSITIVE LOGITS
except
2.07
imaginable
1.91
except
1.86
whatsoever
1.62
soever
1.58
plet
1.53
excluding
1.47
Except
1.41
besides
1.41
NW
1.40
Activations Density 0.704%