INDEX
Explanations
numbers, quantities, and comparisons
negations or phrases indicating limitations or specificity
New Auto-Interp
Negative Logits
ussions
-0.82
lb
-0.62
unsuccessfully
-0.62
acas
-0.61
rams
-0.61
ounces
-0.60
osures
-0.59
Remain
-0.59
Disk
-0.59
Alam
-0.57
POSITIVE LOGITS
soDeliveryDate
0.85
gonna
0.79
iour
0.75
funny
0.72
TY
0.70
eyebrow
0.70
kinda
0.69
gotta
0.69
ðŁij
0.69
ðŁ
0.69
Activations Density 0.319%