INDEX
Negative Logits
Vladimir
-0.32
Vlad
-0.30
agina
-0.28
Dmit
-0.28
Ukr
-0.27
sez
-0.26
avr
-0.26
交æį¢
-0.26
Soviets
-0.26
ilater
-0.26
POSITIVE LOGITS
idata
0.31
缸åĬ©
0.29
minor
0.29
麻辣
0.28
Minor
0.27
dio
0.26
quanto
0.26
Minor
0.25
è¿Ļç§įäºĭæĥħ
0.24
alth
0.24
Activations Density 0.022%