INDEX
Explanations
references to celebrity status or recognition
New Auto-Interp
Negative Logits
estekak
-0.63
poptosis
-0.54
edance
-0.51
kháu
-0.51
TextEditing
-0.50
meeting
-0.50
achal
-0.50
lâm
-0.50
Билгалдахарш
-0.49
noDo
-0.49
POSITIVE LOGITS
star
2.37
stars
1.89
estrella
1.60
estrela
1.58
звезда
1.17
superstar
1.17
звез
1.16
estrellas
1.13
estrelas
1.13
yıldız
1.07
Activations Density 0.246%