INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Linear
    -0.08
    Linear
    -0.08
     core
    -0.07
     linear
    -0.07
     прор
    -0.07
    isbn
    -0.07
     Colomb
    -0.07
    where
    -0.06
    .Linear
    -0.06
     Grand
    -0.06
    POSITIVE LOGITS
     mist
    0.16
     Mist
    0.16
    mist
    0.10
     Frost
    0.09
     mistress
    0.08
    --------------------------------------------------------------------------------
    0.07
    .assertNull
    0.07
     MSC
    0.07
    0.07
     overlooking
    0.07
    Act Density 0.003%

    No Known Activations