INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     have
    -0.81
     you
    -0.78
     are
    -0.77
     there
    -0.75
     can
    -0.74
     while
    -0.73
     will
    -0.73
     cannot
    -0.71
     therefore
    -0.70
     and
    -0.69
    POSITIVE LOGITS
     ivelany
    0.52
     تضيفلها
    0.47
    EDEFAULT
    0.46
     Савезне
    0.45
    agisse
    0.45
    >())
    0.44
    himself
    0.43
    astă
    0.42
     fibrillation
    0.42
     насељу
    0.42
    Act Density 0.003%

    No Known Activations