INDEX
Negative Logits
ï
0.70
ad
0.60
‹
0.59
0.59
kurul
0.58
ile
0.56
ig
0.56
드를
0.56
atore
0.56
oked
0.56
POSITIVE LOGITS
↵↵
1.44
↵↵↵
1.42
↵↵↵↵
1.31
↵↵↵↵↵
1.24
↵↵↵↵↵↵
1.07
↵↵↵↵↵↵↵↵
1.06
↵↵↵↵↵↵↵↵↵
1.06
)。
1.04
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
1.00
↵↵↵↵↵↵↵
0.99
Activations Density 3.980%