INDEX
Explanations
phrases related to effort and determination
New Auto-Interp
Negative Logits
concerns
-0.68
\">
-0.66
aca
-0.66
Concern
-0.66
\-
-0.64
SOURCE
-0.64
concern
-0.62
ablishment
-0.61
RIP
-0.60
ufact
-0.60
POSITIVE LOGITS
unsuccessfully
0.98
harder
0.96
hardest
0.92
luck
0.89
trick
0.84
vain
0.83
experiment
0.83
patience
0.74
uden
0.70
futile
0.70
Activations Density 0.073%