INDEX
Explanations
phrases that lead into video content or call for continued engagement
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.06
3:0.13
4:0.09
5:0.02
6:0.21
7:0.16
8:0.05
9:0.05
10:0.09
11:0.06
Negative Logits
restrial
-1.50
Directive
-1.36
76561
-1.36
ocally
-1.34
goodwill
-1.26
Lisp
-1.25
itself
-1.25
innate
-1.24
consensual
-1.23
embodied
-1.22
POSITIVE LOGITS
croft
1.99
enlarge
1.45
circles
1.42
]'
1.40
slideshow
1.39
clip
1.38
Featured
1.38
captcha
1.35
sidebar
1.34
>>
1.33
Activations Density 0.001%