INDEX
Explanations
phrases related to rejecting, getting rid of, or discarding something
phrases centered around the concept of "throwing out" or discarding
New Auto-Interp
Negative Logits
APH
-0.80
ÃŁ
-0.77
ND
-0.76
CLA
-0.73
SET
-0.71
Marginal
-0.69
going
-0.68
achine
-0.68
Wem
-0.67
Ë
-0.67
POSITIVE LOGITS
overboard
0.91
tant
0.80
insult
0.71
blame
0.70
grenades
0.68
Tuls
0.67
poon
0.62
Shal
0.62
rav
0.62
grenade
0.62
Activations Density 0.110%