INDEX
Explanations
parts of the text that indicate engagement with the content, such as likes or shares
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.16
3:0.04
4:0.09
5:0.02
6:0.23
7:0.18
8:0.04
9:0.05
10:0.05
11:0.04
Negative Logits
Judicial
-1.48
scheduling
-1.44
roma
-1.42
erning
-1.32
eln
-1.25
mans
-1.24
stopping
-1.23
appointing
-1.22
jun
-1.18
adjud
-1.17
POSITIVE LOGITS
guiName
1.40
dayName
1.40
stories
1.38
[|
1.32
ideshow
1.31
ILCS
1.29
erald
1.28
vez
1.27
Saban
1.26
attm
1.25
Activations Density 0.000%