INDEX
Explanations
words indicating exclusivity or uniqueness
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.09
3:0.08
4:0.08
5:0.08
6:0.08
7:0.07
8:0.08
9:0.08
10:0.07
11:0.09
Negative Logits
soDeliveryDate
-3.52
Prompt
-3.48
Rx
-3.29
Bok
-3.15
Twist
-3.05
LIC
-3.03
Needs
-3.00
Veterinary
-2.99
SY
-2.98
Lyft
-2.94
POSITIVE LOGITS
aven
3.03
rawn
2.59
thickness
2.58
hee
2.58
pan
2.54
ogram
2.50
guarded
2.49
heres
2.48
colo
2.46
arch
2.46
Activations Density 0.000%