INDEX
    Explanations

    instances where comparisons are made between different situations or entities

    New Auto-Interp
    Negative Logits
     ftu
    -1.00
     igno
    -0.94
     secon
    -0.94
     fta
    -0.93
     inder
    -0.92
     fto
    -0.91
     seiz
    -0.90
     uniqu
    -0.88
     fup
    -0.88
     unil
    -0.88
    POSITIVE LOGITS
     still
    1.39
    still
    1.31
    Still
    1.23
     Still
    1.16
     STILL
    1.03
     vẫn
    0.95
     nevertheless
    0.88
     nonetheless
    0.85
     remain
    0.83
     nadal
    0.82
    Act Density 0.330%

    No Known Activations