INDEX
Explanations
mentions of working on various projects
instances of the word "worked."
New Auto-Interp
Negative Logits
ylon
-0.67
arta
-0.66
gran
-0.64
Bol
-0.63
idium
-0.63
ustomed
-0.63
Mae
-0.62
thur
-0.62
Tian
-0.62
venerable
-0.61
POSITIVE LOGITS
bench
0.94
hops
0.93
worked
0.88
hirt
0.88
ethic
0.86
collabor
0.85
heet
0.79
overtime
0.76
arrang
0.75
baugh
0.74
Activations Density 0.021%