INDEX
    Explanations

    back of, behind, backend

    New Auto-Interp
    Negative Logits
     appelez
    0.57
     događ
    0.55
     yana
    0.55
     jis
    0.52
     différence
    0.52
     labios
    0.52
     în
    0.52
     ý
    0.52
     têm
    0.52
     nDims
    0.52
    POSITIVE LOGITS
    Back
    1.05
     Back
    1.04
    back
    1.02
    1.00
     belakang
    0.99
     behind
    0.95
    0.91
     الخلف
    0.91
     back
    0.90
     पीछे
    0.90
    Act Density 0.051%

    No Known Activations