INDEX
Explanations
phrases expressing positivity, specifically focusing on "good news."
phrases that express positivity or good news
New Auto-Interp
Negative Logits
edIn
-0.75
ascript
-0.68
ĸļ
-0.68
lished
-0.67
akeru
-0.66
appropriated
-0.66
uilding
-0.65
structed
-0.59
avorite
-0.59
20439
-0.58
POSITIVE LOGITS
est
0.96
thing
0.95
iest
0.91
liest
0.82
ones
0.79
stuff
0.78
folks
0.77
news
0.76
side
0.76
ol
0.75
Activations Density 0.114%