INDEX
Explanations
references to quantities and comparisons in various contexts
New Auto-Interp
Negative Logits
ousse
-0.66
fed
-0.60
Staff
-0.60
tops
-0.58
item
-0.58
urities
-0.57
ensitive
-0.55
pd
-0.55
Desktop
-0.54
REP
-0.54
POSITIVE LOGITS
route
0.93
furthe
0.80
bye
0.73
Definitive
0.70
mile
0.69
downhill
0.69
distance
0.69
step
0.66
lengths
0.65
ither
0.64
Activations Density 0.050%