INDEX
Explanations
phrases that indicate timing or correlation of events
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.09
3:0.19
4:0.02
5:0.03
6:0.04
7:0.06
8:0.05
9:0.26
10:0.06
11:0.11
Negative Logits
alcohol
-1.27
friends
-1.19
UFF
-1.13
Haunted
-1.10
ourage
-1.09
aunted
-1.08
excuses
-1.07
index
-1.07
vious
-1.06
strapped
-1.05
POSITIVE LOGITS
Interstitial
1.78
ieth
1.30
lishes
1.18
ablishment
1.14
◼
1.12
iversal
1.11
Nieto
1.09
yang
1.09
Massacre
1.06
Ramos
1.06
Activations Density 0.006%