INDEX
Explanations
sometimes (multiple languages)
New Auto-Interp
Negative Logits
ანს
0.42
Lorsque
0.40
mtrl
0.40
abortion
0.39
ζε
0.38
നും
0.37
श्रृ
0.37
htb
0.37
াড়ার
0.37
usste
0.37
POSITIVE LOGITS
sometimes
0.44
инсу
0.43
sometimes
0.39
чнее
0.39
иногда
0.39
有时候
0.38
implementations
0.38
oftentimes
0.38
vielen
0.38
辶
0.38
Activations Density 0.000%