INDEX
Explanations
elements related to media and publishing, particularly focusing on film and photo content
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.16
3:0.37
4:0.06
5:0.02
6:0.05
7:0.06
8:0.03
9:0.04
10:0.04
11:0.05
Negative Logits
,'"
-3.16
',"
-2.92
"—
-2.85
),"
-2.58
,"
-2.48
'"
-2.21
"(
-2.12
"[
-2.07
".[
-2.02
}"
-2.02
POSITIVE LOGITS
)
2.22
);
1.92
↵
1.92
):
1.87
.)
1.86
})
1.85
:=
1.81
).
1.79
]
1.77
1.72
Activations Density 0.024%