INDEX
Negative Logits
orda
-0.15
ReadWrite
-0.15
STALL
-0.15
.Listener
-0.14
less
-0.14
Tem
-0.14
dest
-0.14
sel
-0.14
ifik
-0.13
Diff
-0.13
POSITIVE LOGITS
ạp
0.15
chat
0.15
ond
0.15
957
0.14
Reserve
0.14
å¨
0.14
reserve
0.14
éĻ
0.14
acks
0.14
ivre
0.14
Activations Density 0.008%