INDEX
Explanations
phrases related to putting in effort or labor towards a goal
references to effort and the value associated with it
New Auto-Interp
Negative Logits
Passage
-0.79
Places
-0.72
Dealer
-0.70
Wrap
-0.69
Kou
-0.66
passages
-0.65
warehouses
-0.65
berth
-0.64
Solitaire
-0.61
Britann
-0.60
POSITIVE LOGITS
effort
1.07
lessness
0.98
ful
0.96
lessly
0.94
less
0.89
ional
0.89
hered
0.86
ary
0.84
arily
0.81
ting
0.81
Activations Density 0.032%