INDEX
    Explanations

    phrases indicating a functional role or purpose

    New Auto-Interp
    Negative Logits
    InlineData
    -0.56
     enem
    -0.54
     indeed
    -0.52
    expandindo
    -0.51
    indeed
    -0.51
     enjo
    -0.51
     rospy
    -0.49
     informée
    -0.48
     Sanger
    -0.48
    dflare
    -0.48
    POSITIVE LOGITS
     toimi
    0.74
     autorytatywna
    0.72
    adaptiveStyles
    0.71
     serve
    0.65
     serves
    0.64
     functioned
    0.62
    RunAsync
    0.62
     sirven
    0.60
     fornecer
    0.59
     servem
    0.59
    Act Density 0.265%

    No Known Activations