INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    jboss
    -0.56
     General
    -0.54
     who
    -0.52
     Gemein
    -0.49
     ring
    -0.49
    ochond
    -0.49
     general
    -0.47
    RING
    -0.47
    ring
    -0.46
     Ren
    -0.46
    POSITIVE LOGITS
     ProtoMessage
    0.75
     تضيفلها
    0.71
     متعلقه
    0.70
     سكانية
    0.67
     transfieras
    0.66
     للاسماء
    0.65
    0.64
    цездатний
    0.64
    '}>
    0.63
    __);
    0.63
    Act Density 0.223%

    No Known Activations