INDEX
    Explanations

    Roman numerals IX (9) and X (10)

    New Auto-Interp
    Negative Logits
    --------------------------------------------------------
    -0.71
     rule
    -0.65
     clutch
    -0.63
    TPS
    -0.62
     mount
    -0.58
    fixes
    -0.58
    thritis
    -0.58
    xon
    -0.58
     Serious
    -0.57
    forth
    -0.56
    POSITIVE LOGITS
    iew
    1.10
    irus
    1.09
    isions
    1.09
    olution
    1.06
    isible
    1.03
    entric
    1.00
    ision
    1.00
    itamin
    0.97
    iral
    0.97
    orst
    0.97
    Act Density 0.030%

    No Known Activations