INDEX
    Explanations

    instances of phrases indicating starting or initiating actions

    New Auto-Interp
    Negative Logits
    .scalablytyped
    -0.18
     اÙĪÙĦÛĮÙĨ
    -0.18
     Starting
    -0.17
    åħ¥åı£
    -0.17
    Starting
    -0.17
    (first
    -0.16
    第ä¸Ģ
    -0.15
     starting
    -0.15
     Entry
    -0.15
     first
    -0.14
    POSITIVE LOGITS
     begin
    0.52
    begin
    0.47
    .begin
    0.43
     Begin
    0.41
     begins
    0.40
    Begin
    0.40
     began
    0.39
     BEGIN
    0.37
    BEGIN
    0.35
     begun
    0.35
    Act Density 0.019%

    No Known Activations