INDEX
Explanations
specific commands for the audience to watch certain videos or read specific articles
phrases that indicate recommended viewing or importance
New Auto-Interp
Negative Logits
nowhere
-0.68
abba
-0.66
intent
-0.66
ura
-0.65
recl
-0.64
Midlands
-0.60
wound
-0.60
ibel
-0.60
depressed
-0.60
bleeding
-0.60
POSITIVE LOGITS
VIDEOS
1.09
WATCH
0.93
esome
0.88
Thumbnails
0.86
}}}
0.81
IMAGES
0.80
!]
0.77
WATCH
0.77
ARDS
0.75
WATCHED
0.74
Activations Density 0.008%