INDEX
Explanations
phrases related to agreements and negotiations
New Auto-Interp
Head Attr Weights
0:0.05
1:0.01
2:0.11
3:0.19
4:0.02
5:0.30
6:0.01
7:0.08
8:0.05
9:0.02
10:0.09
11:0.01
Negative Logits
ointed
-2.07
huh
-1.97
htaking
-1.91
mort
-1.91
overe
-1.87
doub
-1.87
�
-1.86
inventoryQuantity
-1.86
soType
-1.86
fools
-1.82
POSITIVE LOGITS
wavelengths
1.97
improved
1.96
improvements
1.85
Skies
1.83
Width
1.80
Athletics
1.79
southwestern
1.76
intensity
1.75
improve
1.75
habitats
1.71
Activations Density 0.656%