INDEX
    Explanations

    descriptions related to cartoons

    New Auto-Interp
    Negative Logits
    forces
    -0.71
    alez
    -0.71
    govern
    -0.69
    acia
    -0.66
    utherford
    -0.66
    FUL
    -0.65
    CI
    -0.64
    forced
    -0.64
    ttp
    -0.62
    vae
    -0.62
    POSITIVE LOGITS
     cartoons
    1.18
    ishly
    1.06
     cartoon
    1.00
     caric
    0.95
     frog
    0.94
     Cartoon
    0.89
     caricature
    0.86
     sketches
    0.86
     Hebdo
    0.85
    eers
    0.83
    Act Density 0.012%

    No Known Activations