INDEX
    Explanations

    questions or phrases seeking clarification and meaning

    New Auto-Interp
    Negative Logits
    得以
    -0.36
     tev
    -0.31
    }}$\\
    -0.30
     kark
    -0.29
     дописавши
    -0.29
    ()}}
    -0.28
    "));
    -0.27
     contribue
    -0.26
     lectura
    -0.26
    ↵↵↵
    -0.26
    POSITIVE LOGITS
     gemeint
    0.86
     referring
    0.86
     dimaksud
    0.82
     bedo
    0.75
     nahilalakip
    0.75
    AndEndTag
    0.73
    OGND
    0.71
     Referring
    0.70
    指的是
    0.69
     chodzi
    0.69
    Act Density 0.553%

    No Known Activations