INDEX
Explanations
phrases focused on cost and preservation in various contexts
New Auto-Interp
Negative Logits
dev
-0.17
Tig
-0.16
anywhere
-0.16
nic
-0.15
anken
-0.15
asher
-0.15
Nicholson
-0.15
Col
-0.15
ces
-0.14
rein
-0.14
POSITIVE LOGITS
costs
0.23
Costs
0.19
cost
0.18
costa
0.18
fare
0.17
cost
0.17
costo
0.16
Cost
0.16
-cost
0.16
(cost
0.16
Activations Density 0.009%