INDEX
Explanations
verbs related to actions not succeeding or functioning properly
occurrences of the word "work" and its variations in context
New Auto-Interp
Negative Logits
anamo
-0.79
gow
-0.72
ailable
-0.70
unia
-0.69
vows
-0.67
ilings
-0.65
ript
-0.63
Verd
-0.62
ensor
-0.62
archives
-0.61
POSITIVE LOGITS
heet
1.08
hops
0.90
overtime
0.88
bench
0.86
atically
0.84
synerg
0.81
seamlessly
0.77
flows
0.75
ethic
0.75
harder
0.75
Activations Density 0.051%