INDEX
Explanations
phrases related to things being new
repeated phrases emphasizing "new" and "and" in various contexts
New Auto-Interp
Negative Logits
hoe
-0.83
ashtra
-0.81
utterstock
-0.79
ãĤ©
-0.75
SHIP
-0.74
é¾
-0.73
Timeout
-0.68
argon
-0.67
kov
-0.66
oler
-0.66
POSITIVE LOGITS
leans
0.88
shiny
0.72
sund
0.70
angled
0.68
exclusive
0.68
exciting
0.68
refreshed
0.67
worldly
0.67
iations
0.67
debunked
0.65
Activations Density 0.138%