INDEX
Explanations
references to seasons and episodes of television shows
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.08
3:0.06
4:0.05
5:0.06
6:0.03
7:0.05
8:0.04
9:0.08
10:0.27
11:0.18
Negative Logits
soType
-1.18
behavi
-1.12
igslist
-1.07
itarian
-1.07
rencies
-1.04
orgetown
-1.03
utsche
-1.03
��
-0.97
extradition
-0.97
straw
-0.95
POSITIVE LOGITS
Preview
1.34
numbered
1.30
isode
1.23
odcast
1.19
storyline
1.13
previews
1.11
FINAL
1.11
��
1.08
Theme
1.07
preview
1.05
Activations Density 0.137%