INDEX
Explanations
terms related to physical exercise, specifically focusing on weightlifting and strength training
mentions of benches
New Auto-Interp
Negative Logits
alez
-0.74
lia
-0.73
yles
-0.69
actionDate
-0.69
pires
-0.68
lers
-0.66
mber
-0.66
ually
-0.66
odi
-0.66
Rings
-0.65
POSITIVE LOGITS
warm
1.15
tops
0.86
mark
0.86
lain
0.85
marked
0.84
benches
0.82
top
0.79
iron
0.79
ing
0.78
warrants
0.77
Activations Density 0.037%