INDEX
Explanations
references to the streaming platform "Netflix"
mentions of Netflix in various contexts
New Auto-Interp
Negative Logits
manuel
-0.80
uate
-0.77
tera
-0.73
oral
-0.68
orically
-0.68
imer
-0.68
cised
-0.67
psy
-0.67
inence
-0.67
otic
-0.67
POSITIVE LOGITS
Streaming
0.87
Netflix
0.84
Film
0.81
Observatory
0.79
Orig
0.79
streaming
0.77
Netflix
0.76
HQ
0.76
Studios
0.75
Unlimited
0.73
Activations Density 0.013%