INDEX
Explanations
references to films and video content, particularly in connection to sports and documentaries
New Auto-Interp
Head Attr Weights
0:0.08
1:0.03
2:0.04
3:0.07
4:0.04
5:0.08
6:0.03
7:0.03
8:0.39
9:0.07
10:0.07
11:0.03
Negative Logits
recol
-1.32
RP
-1.23
RC
-1.22
ⓘ
-1.19
ainers
-1.15
Precision
-1.15
sourcing
-1.12
collaborators
-1.09
aliases
-1.09
Option
-1.07
POSITIVE LOGITS
unfold
1.94
flix
1.61
deterior
1.50
cartoons
1.39
closely
1.35
evolve
1.31
aloud
1.29
prosper
1.25
commercials
1.24
clips
1.20
Activations Density 0.065%