INDEX
Explanations
instances of the word "can" and its variations related to possibility or ability
New Auto-Interp
Negative Logits
ically
-0.16
irror
-0.16
ought
-0.16
ialect
-0.15
iously
-0.15
atively
-0.15
themselves
-0.14
èm
-0.14
amt
-0.14
itself
-0.14
POSITIVE LOGITS
expect
0.25
always
0.24
expect
0.22
always
0.20
bet
0.20
either
0.20
Always
0.19
Expect
0.19
Expect
0.19
certainly
0.18
Activations Density 0.148%