INDEX
Explanations
hesitation and thinking sounds
New Auto-Interp
Negative Logits
+{\2.38
Ĝ
2.29
2.28
larni
2.20
către
2.18
ită
2.17
owego
2.16
giphy
2.16
➋
2.16
محض
2.15
POSITIVE LOGITS
}-
2.22
்
2.11
Ventilation
1.92
ोग
1.87
Hedgehog
1.82
люд
1.82
🤔
1.81
fw
1.79
typo
1.74
躇
1.74
Activations Density 0.029%