INDEX
    Explanations

    numbers and roman numerals followed by commas or letters

    New Auto-Interp
    Negative Logits
    0
    -1.79
     you
    -1.72
     even
    -1.65
    c
    -1.55
    f
    -1.48
    s
    -1.46
    ly
    -1.46
     there
    -1.43
    w
    -1.41
    b
    -1.38
    POSITIVE LOGITS
    </strong>
    1.69
     cortada
    1.66
    lepší
    1.63
    ambut
    1.63
    islamic
    1.61
    när
    1.59
    notre
    1.56
    garmin
    1.55
    motorola
    1.54
    gabung
    1.53
    Act Density 0.062%

    No Known Activations