INDEX
Explanations
proper nouns, possibly related to press articles or photos
mentions of specific names or titles related to individuals or artworks
New Auto-Interp
Negative Logits
odore
-0.61
daq
-0.58
'."
-0.58
nil
-0.58
esame
-0.57
dinand
-0.57
atre
-0.55
redes
-0.55
',"
-0.55
]."
-0.55
POSITIVE LOGITS
)
1.76
):
1.74
),
1.69
);
1.66
]
1.64
)]
1.61
).
1.57
].
1.57
))
1.53
])
1.52
Activations Density 0.659%