INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     friable
    -0.75
    FORMANCE
    -0.69
     cantile
    -0.67
     ductile
    -0.65
     gtx
    -0.64
     Shakspeare
    -0.63
     idolat
    -0.63
     ocze
    -0.62
    certainty
    -0.62
     scanty
    -0.61
    POSITIVE LOGITS
     MÁ
    1.01
     Lég
    0.99
     Février
    0.94
     Lombar
    0.94
     Ră
    0.93
     Châ
    0.92
     Gén
    0.92
     geograf
    0.92
     Regula
    0.91
     Că
    0.91
    Act Density 0.143%

    No Known Activations