INDEX
    Explanations

    conjunctions indicating contrast or opposition

    New Auto-Interp
    Negative Logits
     Majefty
    -0.66
     comfy
    -0.66
     Shakspeare
    -0.65
     ͡°
    -0.59
     grandkids
    -0.58
     Efq
    -0.58
    Продам
    -0.57
     שלנו
    -0.55
     לכם
    -0.55
     loving
    -0.54
    POSITIVE LOGITS
     although
    0.84
     tuttavia
    0.81
     however
    0.77
    findpost
    0.77
     toutefois
    0.73
    новниш
    0.72
    AddTagHelper
    0.72
     tačiau
    0.71
     However
    0.68
     تضيفلها
    0.67
    Act Density 0.254%

    No Known Activations