INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _hit
    -0.06
     oppose
    -0.06
    aspect
    -0.06
    paragraph
    -0.06
    erro
    -0.06
    oví
    -0.06
    osal
    -0.06
     majors
    -0.06
    amines
    -0.06
     System
    -0.06
    POSITIVE LOGITS
    !↵
    0.06
    .loc
    0.06
    woord
    0.06
     Stand
    0.06
     upstream
    0.06
    Write
    0.06
     Ö
    0.06
    
    0.06
     stood
    0.06
    0.06
    Act Density 0.020%

    No Known Activations