INDEX
Explanations
phrases related to the concept of loss
phrases indicating loss or decrease
New Auto-Interp
Negative Logits
erity
-0.67
better
-0.67
liest
-0.65
flix
-0.62
aceae
-0.62
gest
-0.61
eus
-0.61
gor
-0.60
downfall
-0.60
Better
-0.60
POSITIVE LOGITS
sorts
0.96
hostilities
0.84
consciousness
0.80
Funds
0.79
funds
0.78
limbs
0.74
course
0.72
goods
0.70
disbelief
0.68
fortune
0.66
Activations Density 0.131%