INDEX
    Explanations

    instances of specific technical terms and instructions related to functionality and control

    overhang, grasp, injunction, supervision

    New Auto-Interp
    Negative Logits
    <bos>
    -1.55
    enskap
    -0.73
    ítmény
    -0.71
    𝓴
    -0.63
    𝓲
    -0.62
    𝓫
    -0.61
    𝓸
    -0.60
    𝓵
    -0.59
    𝓭
    -0.59
    isa
    -0.58
    POSITIVE LOGITS
    s
    1.05
    ים
    0.71
    ième
    0.70
     sánchez
    0.66
    sweise
    0.63
     parís
    0.63
     rodríguez
    0.60
    stanbul
    0.59
     dezelve
    0.59
     saveiro
    0.59
    Act Density 0.154%

    No Known Activations