INDEX
Explanations
phrases indicating a comparison of increasing quantities or intensities
the repetition of the word "and" indicating a continued list or a buildup of ideas
New Auto-Interp
Negative Logits
aiden
-0.64
afety
-0.63
digy
-0.63
hoe
-0.58
aturday
-0.58
ONSORED
-0.57
dden
-0.57
edom
-0.55
oola
-0.55
ixie
-0.54
POSITIVE LOGITS
rogens
0.90
farther
0.84
more
0.82
clearer
0.81
better
0.81
rogen
0.81
louder
0.78
stronger
0.75
faster
0.74
more
0.73
Activations Density 0.027%