INDEX
Explanations
instances where the word "can" is used
contextual verbs that imply possibility or capability
New Auto-Interp
Negative Logits
Fighter
-0.69
BAL
-0.63
Moz
-0.62
honoring
-0.61
spection
-0.58
edient
-0.58
Mant
-0.58
din
-0.58
Generation
-0.58
Quarter
-0.58
POSITIVE LOGITS
't
1.52
adian
1.24
berra
1.20
isters
1.09
NOT
1.03
ister
0.98
nery
0.93
afford
0.92
easily
0.92
vas
0.92
Activations Density 0.164%