INDEX
Negative Logits
bour
0.40
Throughout
0.38
dou
0.38
seamen
0.38
tasted
0.37
guér
0.36
Sé
0.36
Throughout
0.35
roir
0.35
ചര്യ
0.35
POSITIVE LOGITS
Lis
0.89
lis
0.80
Lis
0.79
Liss
0.72
Lisbon
0.72
Lisboa
0.71
LIS
0.69
liz
0.68
lis
0.67
Liz
0.66
Activations Density 0.003%