INDEX
    Explanations

    multi-lingual sentence endings

    New Auto-Interp
    Negative Logits
    0.62
    0.61
    0.57
     त्यानंतर
    0.55
     sebagainya
    0.55
    ia
    0.54
    okban
    0.54
     Saturday
    0.51
    ו
    0.50
    on
    0.50
    POSITIVE LOGITS
    0.81
    0.80
     as
    0.78
    ٹ
    0.76
     in
    0.73
    ق
    0.72
    ۔
    0.71
    گ
    0.70
     for
    0.69
    ن
    0.68
    Act Density 0.103%

    No Known Activations