INDEX
Explanations
terms related to diversity
references to the concept of diversity
New Auto-Interp
Negative Logits
ENA
-0.88
DA
-0.80
amina
-0.78
mentioned
-0.76
FIN
-0.76
ibur
-0.73
cise
-0.72
cel
-0.70
INK
-0.69
hiba
-0.68
POSITIVE LOGITS
diversity
1.25
Diversity
1.14
iveness
0.90
ensical
0.87
ĸļ
0.83
atility
0.80
icity
0.79
icultural
0.77
richness
0.75
yip
0.75
Activations Density 0.009%