INDEX
Explanations
phrases expressing possibility or speculation
New Auto-Interp
Negative Logits
Dill
-0.65
Cumm
-0.62
tons
-0.61
quickShipAvailable
-0.61
Torch
-0.60
didn
-0.60
nih
-0.59
ducers
-0.57
grounds
-0.56
Bland
-0.56
POSITIVE LOGITS
afford
0.90
speculate
0.85
aspire
0.80
imagine
0.79
assume
0.77
dream
0.77
attain
0.75
survive
0.74
approximate
0.74
opa
0.72
Activations Density 0.055%