INDEX
    Explanations

    media captions and related textual descriptions

    captions in media-related content

    New Auto-Interp
    Negative Logits
    ĪĴ
    -0.79
    walker
    -0.72
    Magikarp
    -0.72
    nesia
    -0.70
    vals
    -0.69
    Ö¼
    -0.66
     bowling
    -0.64
    urat
    -0.64
    atto
    -0.64
    »Ĵ
    -0.64
    POSITIVE LOGITS
     WATCH
    0.83
    acters
    0.77
     Theresa
    0.72
     Corbyn
    0.69
     ITV
    0.69
     BBC
    0.69
     Natasha
    0.67
     Prof
    0.67
     Survive
    0.66
    =-=-=-=-=-=-=-=-
    0.66
    Act Density 0.004%

    No Known Activations