INDEX
Explanations
captions or subtitles in text data
references to captions or subtitles in media content
New Auto-Interp
Negative Logits
prus
-0.68
ces
-0.68
cester
-0.63
lli
-0.63
dime
-0.62
Expect
-0.60
atra
-0.60
suspic
-0.59
ndra
-0.59
dinand
-0.59
POSITIVE LOGITS
caption
0.98
deck
0.88
edin
0.85
ed
0.83
acters
0.83
Redditor
0.77
edly
0.76
TextColor
0.76
escription
0.75
xual
0.74
Activations Density 0.008%