INDEX
Explanations
references to mistakes or errors in various contexts
New Auto-Interp
Negative Logits
ovna
-0.15
ITO
-0.15
actors
-0.14
257
-0.14
enty
-0.14
تز
-0.14
overpower
-0.14
disturbing
-0.13
Durch
-0.13
anvas
-0.13
POSITIVE LOGITS
cost
0.53
cost
0.49
Cost
0.47
-cost
0.43
Cost
0.43
COST
0.42
costs
0.41
_cost
0.39
costing
0.38
.cost
0.36
Activations Density 0.254%