INDEX
Explanations
phrases indicating something is being sacrificed or negatively impacted in favor of something else
references to sacrifices or costs incurred for the benefit of others
New Auto-Interp
Negative Logits
odes
-0.75
Aval
-0.68
arre
-0.67
DIT
-0.67
kens
-0.64
Entry
-0.61
fuzz
-0.61
ynt
-0.60
Nig
-0.60
gran
-0.60
POSITIVE LOGITS
expense
1.05
spared
0.84
iasis
0.81
incurred
0.80
liest
0.80
altar
0.78
ãĥĨãĤ£
0.74
llo
0.72
taxpayers
0.72
rophe
0.71
Activations Density 0.011%