INDEX
Explanations
phrases that describe the time or effort required to accomplish a task
expressions indicating the concept of effort and the time required to achieve something
New Auto-Interp
Negative Logits
tions
-0.70
rebell
-0.66
holm
-0.66
tor
-0.66
nesses
-0.65
entric
-0.63
Patron
-0.63
dor
-0.62
raid
-0.62
eers
-0.61
POSITIVE LOGITS
aways
0.94
aback
0.90
YR
0.82
advantage
0.81
overs
0.80
FINE
0.75
ioxide
0.75
reads
0.75
OVER
0.74
care
0.72
Activations Density 0.088%