INDEX
Explanations
contexts and conditions under which specific policies and regulations apply
New Auto-Interp
Head Attr Weights
0:0.01
1:0.05
2:0.06
3:0.03
4:0.01
5:0.04
6:0.07
7:0.03
8:0.04
9:0.09
10:0.06
11:0.45
Negative Logits
herty
-1.36
onen
-1.32
ult
-1.27
dayName
-1.26
hai
-1.24
cigarette
-1.24
brace
-1.24
umn
-1.20
orah
-1.19
achine
-1.18
POSITIVE LOGITS
certific
1.62
marginally
1.53
anymore
1.53
spor
1.51
peanuts
1.42
semblance
1.41
kidding
1.36
limited
1.33
finite
1.32
fraction
1.32
Activations Density 0.324%