INDEX
    Explanations

    references to feeding or food-related actions

    New Auto-Interp
    Negative Logits
    ogl
    -0.16
    opus
    -0.15
    heit
    -0.15
    epad
    -0.15
    ophon
    -0.14
    ity
    -0.14
     ведÑĮ
    -0.14
    ely
    -0.14
     Klopp
    -0.14
    ue
    -0.14
    POSITIVE LOGITS
    /feed
    0.21
    -feed
    0.19
    .feed
    0.16
    /drivers
    0.16
    ruary
    0.15
    è¡Ĩ
    0.15
    rought
    0.15
     uá»ijng
    0.14
    _feed
    0.14
    linkplain
    0.14
    Act Density 0.028%

    No Known Activations