INDEX
    Explanations

    instances of the word "self" indicating object-oriented programming concepts

    New Auto-Interp
    Negative Logits
    OGND
    -0.87
    RegistryLite
    -0.78
     للمعارف
    -0.78
    __':
    
    -0.75
    PreferredItem
    -0.65
     ویکی‌پدی
    -0.65
     يتيمه
    -0.63
    ].)
    -0.62
    Personendaten
    -0.61
    :✨
    -0.61
    POSITIVE LOGITS
    lasma
    0.49
    Jvm
    0.48
    contentView
    0.48
     Sp
    0.48
    lar
    0.47
    MOUTH
    0.47
    tawesome
    0.46
    destruct
    0.46
     P
    0.45
    scaleX
    0.45
    Act Density 0.040%

    No Known Activations