INDEX
Explanations
mentions or instances of the word "can" along with a following action or state
phrases expressing capability or potential
New Auto-Interp
Negative Logits
rejection
-0.67
Hunters
-0.64
bats
-0.61
guarding
-0.60
Rising
-0.59
revision
-0.58
Colony
-0.58
Critics
-0.57
danger
-0.57
mol
-0.56
POSITIVE LOGITS
't
1.53
berra
1.08
NOT
1.07
adian
1.05
¶ħ
0.93
isters
0.91
afford
0.91
atell
0.89
easily
0.89
tell
0.88
Activations Density 0.117%