INDEX
Explanations
terms associated with elitism and influential individuals
New Auto-Interp
Negative Logits
RLF
-0.17
ral
-0.16
bara
-0.15
yectos
-0.15
egen
-0.14
iani
-0.14
unci
-0.14
unts
-0.14
seau
-0.14
Gür
-0.14
POSITIVE LOGITS
abeth
0.30
orado
0.19
quent
0.17
vier
0.17
beth
0.16
asticsearch
0.15
/el
0.15
izabeth
0.15
اذ
0.15
ucid
0.15
Activations Density 0.047%