INDEX
    Explanations

    indigestion, heartburn

    New Auto-Interp
    Negative Logits
    、『
    -0.07
     населения
    -0.07
     Positioned
    -0.07
     follando
    -0.07
    liness
    -0.06
    actly
    -0.06
     holes
    -0.06
     slit
    -0.06
     brut
    -0.06
    ленных
    -0.06
    POSITIVE LOGITS
    kem
    0.07
    away
    0.06
    urally
    0.06
    	uint
    0.06
     kindly
    0.06
    rim
    0.06
     kart
    0.06
    IN
    0.06
     rom
    0.06
    0.06
    Act Density 0.001%

    No Known Activations