INDEX
Explanations
words related to actions in both a literal and figurative context
quid pro quo, ends meet, well done, déjà vu
New Auto-Interp
Negative Logits
-0.65
ագրություններ
-0.62
//
-0.61
anyahu
-0.60
رشف
-0.60
findpost
-0.59
MLLoader
-0.59
ſhip
-0.59
HasFactory
-0.58
Paglinawan
-0.58
POSITIVE LOGITS
we
0.34
well
0.33
ajudá
0.32
next
0.31
<bos>
0.30
very
0.29
mid
0.29
comes
0.28
come
0.28
being
0.28
Activations Density 0.039%