INDEX
Negative Logits
𒀸
0.49
್ರಾ
0.44
ल्लाला
0.44
gras
0.44
leyball
0.41
neigh
0.39
ungkan
0.38
prac
0.38
grim
0.37
ateful
0.37
POSITIVE LOGITS
than
0.50
that
0.47
decât
0.46
tỷ
0.41
rằng
0.41
kuin
0.41
Fraction
0.41
fraction
0.40
身影
0.40
That
0.39
Activations Density 0.002%