INDEX
    Explanations

    exploring themes without explicit details

    New Auto-Interp
    Negative Logits
     especially
    0.87
     quick
    0.87
     meteen
    0.83
     особенно
    0.83
    especially
    0.82
     sofort
    0.82
    Immediate
    0.82
     क्विक
    0.81
     immediate
    0.81
     cepat
    0.80
    POSITIVE LOGITS
     nevertheless
    1.45
     nonetheless
    1.43
     Nonetheless
    1.28
    Nonetheless
    1.25
     Nevertheless
    1.22
    Nevertheless
    1.21
     analogous
    1.16
     dennoch
    1.12
     comunque
    1.11
     néanmoins
    1.07
    Act Density 1.001%

    No Known Activations