INDEX
Explanations
news headlines with phrases indicating additional information or updates
occurrences of the word "more" or related phrases emphasizing an increase or continuation
New Auto-Interp
Negative Logits
xtap
-0.91
ader
-0.81
uckle
-0.72
keeping
-0.72
wered
-0.69
ãĥ¼ãĥĨ
-0.68
mobi
-0.68
itude
-0.67
itudes
-0.67
ĺħ
-0.67
POSITIVE LOGITS
than
1.09
HUD
0.79
ado
0.79
interesting
0.78
info
0.75
attractive
0.74
importantly
0.73
detailed
0.72
exciting
0.72
VIDEOS
0.71
Activations Density 0.042%