INDEX
Negative Logits
かっこ
0.47
klassischen
0.43
kreativ
0.43
beeindruck
0.42
を実行
0.42
postulated
0.42
<0x00>
0.41
sinnvoll
0.41
𝕌
0.41
憧
0.41
POSITIVE LOGITS
unfortunately
0.59
сожалению
0.58
apologize
0.56
apologies
0.54
हमारा
0.53
apologise
0.53
हमारी
0.52
apology
0.52
确实
0.52
our
0.51
Activations Density 0.033%