INDEX
Explanations
dialogue and translation prompts
New Auto-Interp
Negative Logits
need
1.26
needs
1.21
cần
1.21
needed
1.18
need
1.13
needed
1.12
needs
1.10
необходимо
1.09
Need
1.06
需要在
1.03
POSITIVE LOGITS
あなた
0.80
USING
0.77
mittens
0.73
私
0.73
Gossip
0.73
😏
0.72
BECAUSE
0.71
dones
0.71
Using
0.71
partners
0.70
Activations Density 0.116%