INDEX
    Explanations

    quantitative comparisons using the word "times"

    phrases indicating multiplicative comparisons or ratios

    New Auto-Interp
    Negative Logits
    cember
    -0.78
    services
    -0.69
    ################################
    -0.64
    rals
    -0.64
    iku
    -0.62
    apolis
    -0.61
    LCS
    -0.60
    uctions
    -0.60
    liction
    -0.59
    bies
    -0.59
    POSITIVE LOGITS
     slower
    0.89
     stronger
    0.88
     greater
    0.88
     faster
    0.87
     louder
    0.84
    avier
    0.84
     cheaper
    0.83
     hotter
    0.83
     worse
    0.81
     heavier
    0.80
    Act Density 0.041%

    No Known Activations