INDEX
Negative Logits
almost
0.78
बहु
0.73
promotion
0.72
promote
0.71
उत्
0.70
Whether
0.70
উত্তম
0.69
nearly
0.68
as
0.67
Although
0.67
POSITIVE LOGITS
hey
1.06
persevere
1.04
persever
1.00
Hey
1.00
nonetheless
0.97
recompens
0.96
Reward
0.95
rewards
0.94
Rewards
0.92
rewarding
0.91
Activations Density 0.092%