INDEX
Explanations
phrases related to the level of care and effort put into something, as well as mentions of competition and damage
New Auto-Interp
Negative Logits
iland
-0.75
ilet
-0.72
é¾
-0.67
vernight
-0.67
ãĤ´ãĥ³
-0.65
cffffcc
-0.62
uces
-0.62
REL
-0.62
ilaterally
-0.61
dry
-0.60
POSITIVE LOGITS
afforded
0.93
they
0.89
wrought
0.86
inherent
0.80
bestowed
0.80
she
0.75
he
0.74
antry
0.73
involved
0.72
emanating
0.70
Activations Density 0.373%