INDEX
Negative Logits
hope
0.44
Hope
0.41
নিজেও
0.40
hopes
0.40
HOPE
0.39
admires
0.39
enjoy
0.38
희
0.37
curr
0.37
cannot
0.36
POSITIVE LOGITS
Hmmm
0.53
Oh
0.52
Hmm
0.52
Yeah
0.51
hmm
0.51
oh
0.49
Interestingly
0.49
Okay
0.48
Finally
0.47
Arkadaşlar
0.47
Activations Density 0.000%