INDEX
Explanations
transcripts and presentations
New Auto-Interp
Negative Logits
▀
0.36
joggers
0.35
hitters
0.35
elé
0.34
ponsorship
0.34
renovations
0.33
izedBox
0.33
amputation
0.33
牰
0.33
wrench
0.33
POSITIVE LOGITS
slides
1.18
Slides
1.17
slide
1.11
Slide
1.07
Slide
1.07
slide
1.05
Slides
1.05
slides
1.04
Presentation
0.98
Transcript
0.97
Activations Density 0.003%