INDEX
Explanations
the word "cannot" and its variations
phrases expressing inability or impossibility
New Auto-Interp
Negative Logits
bats
-0.79
grounds
-0.70
flame
-0.68
soDeliveryDate
-0.67
Generation
-0.67
ItemThumbnailImage
-0.65
-0.63
estate
-0.63
essen
-0.62
senal
-0.62
POSITIVE LOGITS
afford
1.14
necessarily
1.04
reproduce
0.94
tolerate
0.93
adian
0.93
distinguish
0.88
't
0.86
abide
0.86
survive
0.86
penetrate
0.85
Activations Density 0.029%