INDEX
Explanations
references to diversity and representation issues in media
New Auto-Interp
Negative Logits
-0.54
estekak
-0.50
esgue
-0.49
rechter
-0.49
}],
-0.48
ioutil
-0.47
indipendente
-0.47
riwal
-0.46
sardines
-0.46
]
-0.45
POSITIVE LOGITS
Skywalker
0.80
Marvel
0.79
lightsaber
0.75
olkien
0.75
Jedi
0.70
imetsu
0.70
movie
0.70
MCU
0.68
Marvel
0.68
Thanos
0.66
Activations Density 0.190%