INDEX
    Explanations

    indicators of comparison or similarity in context

    New Auto-Interp
    Negative Logits
     also
    -0.51
     led
    -0.50
    دين
    -0.49
     so
    -0.49
    ToDelete
    -0.48
    بوابة
    -0.47
    chọn
    -0.46
    ruitment
    -0.46
     see
    -0.46
     come
    -0.45
    POSITIVE LOGITS
    While
    1.11
    Indeed
    1.08
     Indeed
    1.05
    Fortunately
    1.03
    Perhaps
    1.03
    Because
    1.02
     Essentially
    1.02
    Basically
    1.01
    Essentially
    1.01
    Whereas
    1.01
    Act Density 0.344%

    No Known Activations