INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Algorithm
    -0.07
     algorithm
    -0.07
    IBM
    -0.07
     slang
    -0.07
     precision
    -0.07
     Deine
    -0.07
    Algorithm
    -0.07
     IBS
    -0.07
     CI
    -0.07
     istit
    -0.07
    POSITIVE LOGITS
     fireplaces
    0.11
     fireplace
    0.10
    /lounge
    0.10
     spacious
    0.10
     помещения
    0.09
    (er
    0.09
     cozy
    0.09
    0.09
     noho
    0.09
    0.09
    Act Density 0.013%

    No Known Activations