INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     libri
    -0.08
     centimet
    -0.08
     boeken
    -0.08
    	book
    -0.08
     Libro
    -0.08
     પુસ્ત
    -0.08
     પુસ્તક
    -0.08
     nieder
    -0.08
     पुस्त
    -0.07
    -0.07
    POSITIVE LOGITS
     closures
    0.08
    arto
    0.07
     مشكلة
    0.07
     expensive
    0.07
     coût
    0.07
     gated
    0.07
    тя
    0.07
     companionship
    0.07
    property
    0.07
    ataan
    0.07
    Act Density 0.005%

    No Known Activations