INDEX
    Explanations

    descriptions, observations, and relationships

    New Auto-Interp
    Negative Logits
     fréquemment
    0.69
     frequentemente
    0.68
     häufig
    0.66
     เกี่ยว
    0.66
    częściej
    0.65
     нередко
    0.64
     primarily
    0.63
     predominantly
    0.63
     traditionally
    0.63
     complicating
    0.63
    POSITIVE LOGITS
    简直
    0.93
    真是
    0.80
    实在是
    0.71
    真的是
    0.71
     looks
    0.68
     ನನಗೆ
    0.66
     amazed
    0.64
    looks
    0.62
    really
    0.61
     정말
    0.61
    Act Density 0.260%

    No Known Activations