INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     please
    -0.99
    Please
    -0.96
    please
    -0.93
     Please
    -0.88
     bitte
    -0.84
     пожалуйста
    -0.79
    PLEASE
    -0.71
     PLEASE
    -0.70
     Bitte
    -0.69
     pls
    -0.65
    POSITIVE LOGITS
     propOrder
    0.92
     дописавши
    0.81
    InitVars
    0.75
    Identyfik
    0.75
    ArrowToggle
    0.73
     calendriers
    0.73
    AccessorTable
    0.72
    BeginInit
    0.72
    :✨
    0.72
    IsMutable
    0.71
    Act Density 0.059%

    No Known Activations