INDEX
Explanations
phrases related to the implementation and consequences of laws or policies
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.14
3:0.05
4:0.11
5:0.05
6:0.18
7:0.12
8:0.05
9:0.03
10:0.09
11:0.09
Negative Logits
inventoryQuantity
-1.60
etting
-1.49
igi
-1.45
osphere
-1.41
owl
-1.36
ochemistry
-1.34
olerance
-1.33
isons
-1.33
ilage
-1.32
inent
-1.31
POSITIVE LOGITS
!--
1.45
////
1.42
)=(
1.36
=-=-=-=-
1.33
Deliver
1.32
:]
1.30
mos
1.29
($)
1.29
GMT
1.29
Phase
1.26
Activations Density 0.001%