INDEX
Negative Logits
akers
-0.16
ointed
-0.16
ÑĢÑĸÑĩ
-0.15
ipples
-0.15
rone
-0.15
алеж
-0.15
å©ļ
-0.14
Tiger
-0.14
ched
-0.14
neider
-0.14
POSITIVE LOGITS
ury
0.39
urious
0.38
uries
0.35
embourg
0.35
emb
0.32
URY
0.30
uri
0.26
uria
0.23
ur
0.21
ardo
0.19
Activations Density 0.008%