INDEX
Negative Logits
ori
-0.14
asco
-0.14
fid
-0.14
Mer
-0.14
Garn
-0.14
052
-0.13
Reb
-0.13
Abbas
-0.13
peg
-0.13
mer
-0.13
POSITIVE LOGITS
amet
0.21
itsu
0.18
овоÑĢ
0.17
اÙĦصÙģ
0.16
erosis
0.15
ázev
0.15
uze
0.15
iek
0.15
athers
0.15
çī
0.14
Activations Density 0.024%