INDEX
Explanations
phrases related to the amount of time a task will require
phrases indicating the duration or process of completing various actions
New Auto-Interp
Negative Logits
david
-0.69
agre
-0.69
Smile
-0.68
holm
-0.68
advertised
-0.64
eers
-0.62
hates
-0.60
Cong
-0.59
iege
-0.58
vine
-0.58
POSITIVE LOGITS
aways
1.01
advantage
0.94
precedence
0.90
care
0.87
overs
0.85
FINE
0.84
aback
0.83
ume
0.80
arnaev
0.78
inka
0.78
Activations Density 0.082%