INDEX
    Explanations

    connecting or listing ideas

    New Auto-Interp
    Negative Logits
    2.11
    1.83
    1.83
    1.82
    フォ
    1.78
    ِي
    1.75
    ファイル
    1.75
    調
    1.70
    Воз
    1.68
    Рис
    1.68
    POSITIVE LOGITS
     bundan
    1.99
     gerçekten
    1.98
     δεν
    1.97
     bunu
    1.96
     hiçbir
    1.91
     hiç
    1.88
     sadece
    1.86
     özellikle
    1.84
     yalnızca
    1.78
     bugün
    1.76
    Act Density 0.036%

    No Known Activations