INDEX
Explanations
phrases related to legal or judicial contexts
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.10
3:0.07
4:0.13
5:0.02
6:0.03
7:0.39
8:0.03
9:0.04
10:0.08
11:0.04
Negative Logits
Offline
-1.38
cens
-1.23
except
-1.22
eff
-1.18
deterior
-1.18
calmly
-1.15
kept
-1.12
��
-1.11
rity
-1.10
tolerated
-1.09
POSITIVE LOGITS
irgin
1.35
ader
1.17
Shad
1.15
DOI
1.13
itan
1.12
sovere
1.12
Ranger
1.10
Horizons
1.09
Franch
1.09
Flavoring
1.09
Activations Density 0.585%