INDEX
Explanations
terms related to physical or mental effort
instances of the word "work" in various contexts related to effort and collaboration
New Auto-Interp
Negative Logits
ylon
-0.83
Bubble
-0.71
Ukrain
-0.71
anamo
-0.70
DragonMagazine
-0.69
netflix
-0.65
Flavoring
-0.65
emonic
-0.65
antha
-0.64
constitu
-0.62
POSITIVE LOGITS
ethic
1.19
flows
0.99
hops
0.95
bench
0.92
horse
0.89
tirelessly
0.87
manship
0.86
heet
0.83
harder
0.83
diligently
0.83
Activations Density 0.073%