INDEX
    Explanations

    contrasting or alternative ideas presented with 'instead' or 'rather'

    New Auto-Interp
    Negative Logits
    ä¸įæĺ¯
    -0.25
    ä¸įèĥ½
    -0.24
     tidak
    -0.23
     nicht
    -0.23
     ikke
    -0.22
     não
    -0.22
     not
    -0.22
     cannot
    -0.22
     không
    -0.22
    ä¸įä¼ļ
    -0.21
    POSITIVE LOGITS
     merely
    0.20
    gaard
    0.16
    eti
    0.15
     opting
    0.14
    hen
    0.14
    ÙĨØ´
    0.14
    848
    0.14
    .cgi
    0.14
     pá
    0.13
    licht
    0.13
    Act Density 0.059%

    No Known Activations