INDEX
    Explanations

    references to the concept of scale, particularly in relation to dimensions or levels of measurement

    New Auto-Interp
    Negative Logits
    zelf
    -0.18
    ernals
    -0.17
    veau
    -0.15
    sell
    -0.15
    assis
    -0.15
    uries
    -0.15
    iates
    -0.15
     Hakk
    -0.15
    ession
    -0.14
    respond
    -0.14
    POSITIVE LOGITS
    -down
    0.28
    -up
    0.27
    able
    0.23
    out
    0.22
    -out
    0.20
    way
    0.20
    ToFit
    0.20
    tron
    0.18
    up
    0.18
    azy
    0.18
    Act Density 0.020%

    No Known Activations