INDEX
Explanations
Ultimately, moreover, consequently
New Auto-Interp
Negative Logits
didn
1.47
you
1.43
:)
1.42
YOU
1.40
тоже
1.38
kinda
1.31
你
1.30
sooo
1.29
lots
1.28
tienes
1.28
POSITIVE LOGITS
Despite
1.57
Despite
1.47
Ultimately
1.46
Moreover
1.42
Moreover
1.42
Nevertheless
1.36
Consequently
1.34
Consequently
1.34
Indeed
1.30
Beyond
1.29
Activations Density 0.135%