INDEX
Explanations
punctuation marks, specifically the comma
New Auto-Interp
Head Attr Weights
0:0.13
1:0.05
2:0.01
3:0.14
4:0.22
5:0.08
6:0.06
7:0.03
8:0.12
9:0.05
10:0.02
11:0.03
Negative Logits
)))
-2.27
��
-2.25
)))
-2.25
ander
-2.22
��
-2.20
ILCS
-2.18
ngth
-2.10
"))
-2.09
essor
-2.08
��
-2.07
POSITIVE LOGITS
Lyft
1.97
preseason
1.85
startups
1.85
Hats
1.83
premie
1.81
Philly
1.80
Jays
1.79
incentives
1.78
payday
1.77
rewards
1.77
Activations Density 0.000%