INDEX
Explanations
hyperlinks and titles indicating a viewer's perspective to direct attention to specific content
occurrences of the word "View" and its variations in the text
New Auto-Interp
Negative Logits
cffff
-0.68
ussian
-0.66
vernment
-0.66
indo
-0.64
ascal
-0.64
iques
-0.61
apo
-0.60
abstinence
-0.60
shroud
-0.59
bara
-0.59
POSITIVE LOGITS
ership
1.24
largeDownload
1.20
Caption
1.00
ById
0.96
ers
0.92
Transcript
0.90
Images
0.88
points
0.85
photos
0.84
topic
0.84
Activations Density 0.020%