INDEX
Explanations
information on news articles and must-watch videos containing important or urgent topics
phrases emphasizing the urgency or importance of watching specific content, particularly videos
New Auto-Interp
Negative Logits
intent
-0.69
bats
-0.67
lured
-0.66
brainstorm
-0.65
academia
-0.64
ura
-0.64
metro
-0.62
nowhere
-0.61
redevelopment
-0.61
plagiar
-0.61
POSITIVE LOGITS
WATCH
1.13
VIDEOS
1.02
Thumbnails
0.86
WATCHED
0.86
Watching
0.81
esome
0.80
ARDS
0.80
CLASSIFIED
0.79
!/
0.76
rou
0.75
Activations Density 0.008%