INDEX
    Explanations

    location/site

    New Auto-Interp
    Negative Logits
     of
    -0.65
    HtmlAttribute
    -0.59
    ContextCompat
    -0.57
    KEYCODE
    -0.55
    styleType
    -0.54
    RectangleBorder
    -0.54
    <bos>
    -0.53
    CloseOperation
    -0.49
    InstanceState
    -0.48
    SPATH
    -0.47
    POSITIVE LOGITS
     nahilalakip
    0.70
     تضيفلها
    0.65
    addContainerGap
    0.61
     away
    0.60
     niacin
    0.57
     Vikipedi
    0.56
    alians
    0.53
     ashore
    0.53
     feminina
    0.53
     foxes
    0.53
    Act Density 0.016%

    No Known Activations