INDEX
    Explanations

    references to diversity and representation issues in media

    New Auto-Interp
    Negative Logits
    
    -0.54
     estekak
    -0.50
    esgue
    -0.49
     rechter
    -0.49
    }],
    
    -0.48
    ioutil
    -0.47
     indipendente
    -0.47
    riwal
    -0.46
     sardines
    -0.46
     ]
    
    -0.45
    POSITIVE LOGITS
     Skywalker
    0.80
     Marvel
    0.79
     lightsaber
    0.75
    olkien
    0.75
     Jedi
    0.70
    imetsu
    0.70
     movie
    0.70
     MCU
    0.68
    Marvel
    0.68
     Thanos
    0.66
    Act Density 0.190%

    No Known Activations