INDEX
    Explanations

    programming code

    New Auto-Interp
    Negative Logits
    ly
    -0.66
     dàng
    -0.65
    indest
    -0.65
    langes
    -0.60
    linge
    -0.60
     Liege
    -0.58
    lang
    -0.57
    langs
    -0.57
    tedly
    -0.56
    erçe
    -0.54
    POSITIVE LOGITS
    i
    0.96
    e
    0.85
    a
    0.75
    expandindo
    0.69
    y
    0.60
    ה
    0.60
    z
    0.57
    intios
    0.56
    й
    0.55
    iis
    0.52
    Act Density 0.111%

    No Known Activations