INDEX
Explanations
occurrences of the word "stereotype" and its variations
New Auto-Interp
Negative Logits
shell
-0.17
itan
-0.16
jan
-0.15
jar
-0.15
tera
-0.15
ãĥ¡ãĥ©
-0.15
icl
-0.15
ter
-0.14
izontal
-0.14
ismet
-0.14
POSITIVE LOGITS
otypical
0.33
otyp
0.27
otyping
0.24
stere
0.24
stereotype
0.23
otypes
0.22
Ster
0.22
opsis
0.21
_typ
0.21
stereotypes
0.20
Activations Density 0.006%