INDEX
Explanations
phrases indicating disappearance or absence
instances of the word "gone" and its variations, indicating absence or loss
New Auto-Interp
Negative Logits
emate
-0.62
advertisement
-0.61
inus
-0.61
rel
-0.60
role
-0.60
eering
-0.58
Deity
-0.58
ellation
-0.58
FIG
-0.56
step
-0.56
POSITIVE LOGITS
forever
0.94
overboard
0.85
unnoticed
0.84
bye
0.78
limp
0.78
viral
0.78
slack
0.76
rogue
0.76
feral
0.75
stale
0.72
Activations Density 0.042%