INDEX
Negative Logits
6
1.39
3
1.31
2
1.10
م
1.07
4
1.02
7
1.01
.
0.93
非
0.91
m
0.90
9
0.90
POSITIVE LOGITS
bustle
0.85
the
0.76
in
0.76
speople
0.72
archipelago
0.72
tains
0.70
ている
0.70
difer
0.69
도시
0.69
oes
0.69
Activations Density 0.002%
6
3
2
م
4
7
.
非
m
9
bustle
the
in
speople
archipelago
tains
ている
difer
도시
oes