INDEX
Explanations
references to the concept of effort or the amount of effort involved in various activities
New Auto-Interp
Negative Logits
ukone
-0.86
estekak
-0.86
:✨
-0.77
PhysRev
-0.77
AssemblyCulture
-0.76
nahilalakip
-0.74
"}}
-0.73
trise
-0.72
PhysRevD
-0.71
caloosa
-0.69
POSITIVE LOGITS
effort
0.97
Effort
0.82
effort
0.80
Effort
0.75
controversy
0.74
contro
0.71
endorsements
0.70
endorsement
0.68
chef
0.64
sweat
0.64
Activations Density 0.070%