INDEX
Explanations
instances where something could have been avoided
instances of the word "avoided" and related concepts
New Auto-Interp
Negative Logits
wow
-0.71
eem
-0.60
dyn
-0.59
vest
-0.58
Shard
-0.57
grant
-0.55
plex
-0.55
morph
-0.55
premiere
-0.54
-0.54
POSITIVE LOGITS
avoided
3.63
dodged
2.13
avoids
2.02
aver
1.88
avoiding
1.71
oided
1.68
avoid
1.68
avoid
1.62
skipped
1.55
prevented
1.51
Activations Density 0.013%