INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     còn
    0.86
     ainda
    0.81
    0.81
    还可以
    0.77
    還可以
    0.77
     еще
    0.75
    还会
    0.75
     noch
    0.74
     todavía
    0.74
     masih
    0.73
    POSITIVE LOGITS
    了解
    0.96
     understand
    0.73
     understands
    0.66
     NOR
    0.65
    NOR
    0.64
     slightest
    0.63
     paham
    0.63
    understand
    0.62
     Understand
    0.61
    0.61
    Act Density 0.002%

    No Known Activations