INDEX
    Explanations

    sentences that indicate strong statements of benefit or effectiveness

    New Auto-Interp
    Negative Logits
     [â̦
    -0.15
    â̦the
    -0.14
    ãn
    -0.14
    â̦
    -0.14
    â̦↵
    -0.14
     [â̦]↵
    -0.13
     Elev
    -0.13
    â̦.
    -0.13
    â̦and
    -0.13
     [â̦]
    -0.12
    POSITIVE LOGITS
    æ±Ĺ
    0.14
    -sama
    0.13
    CJK
    0.13
    #{@
    0.13
     fant
    0.13
     اÙĦعظ
    0.13
     minh
    0.13
    लà¤Ĺ
    0.13
    à¥ĩà¤ķर
    0.13
    abbo
    0.13
    Act Density 0.000%

    No Known Activations