INDEX
Explanations
phrases related to ongoing issues or problems, particularly in a legal context
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.22
3:0.09
4:0.22
5:0.05
6:0.03
7:0.02
8:0.09
9:0.09
10:0.05
11:0.02
Negative Logits
fines
-1.24
conspicuous
-1.12
wine
-1.09
bra
-1.09
stra
-1.06
tumblr
-1.05
societies
-1.05
anwhile
-1.03
berra
-1.03
Drum
-1.03
POSITIVE LOGITS
王
1.49
lishes
1.47
baugh
1.42
:]
1.40
cture
1.34
Prev
1.28
%]
1.23
hement
1.22
pei
1.19
atever
1.17
Activations Density 0.001%