INDEX
Explanations
phrases related to working hard or putting effort into tasks
references to hard work or effort
New Auto-Interp
Negative Logits
Tot
-0.67
Nest
-0.66
DATA
-0.65
Quarter
-0.63
Kut
-0.63
Sv
-0.63
Salvation
-0.62
Sut
-0.61
Expend
-0.60
Quin
-0.60
POSITIVE LOGITS
esley
0.80
itud
0.79
enough
0.79
ened
0.79
balls
0.77
hard
0.73
entimes
0.73
ball
0.72
wired
0.72
harder
0.71
Activations Density 0.027%