INDEX
    Explanations

    lists of items or states

    New Auto-Interp
    Negative Logits
    并不
    0.94
    Gli
    0.89
     این
    0.88
    Jadi
    0.87
     sogenannten
    0.87
    不僅
    0.83
     Основ
    0.82
     Besonders
    0.82
     sfera
    0.81
     اين
    0.81
    POSITIVE LOGITS
     likewise
    0.96
     equally
    0.95
     correspondingly
    0.91
     ones
    0.90
     similarly
    0.83
     others
    0.81
     also
    0.79
     conversely
    0.79
    0.73
     etc
    0.69
    Act Density 2.516%

    No Known Activations