INDEX
Explanations
phrases related to change or improvement
phrases related to changing or improving a situation
New Auto-Interp
Negative Logits
gotten
-0.79
chem
-0.68
tatt
-0.64
enegger
-0.63
hip
-0.63
ged
-0.61
laid
-0.61
iston
-0.61
hetical
-0.61
nih
-0.59
POSITIVE LOGITS
wagen
0.84
ichick
0.76
itect
0.75
ruciating
0.74
allery
0.73
ernaut
0.72
erous
0.69
gressive
0.66
rocal
0.65
pter
0.65
Activations Density 0.017%