INDEX
    Explanations

    various phrases that convey actions and methodologies

    New Auto-Interp
    Negative Logits
     somehow
    -0.14
    rán
    -0.14
    ocus
    -0.13
    ÏĦά
    -0.13
     Favor
    -0.13
    aim
    -0.13
     Ðļод
    -0.13
     cr
    -0.13
    onaut
    -0.13
    289
    -0.13
    POSITIVE LOGITS
     approach
    0.22
     approached
    0.20
     Approach
    0.20
    appro
    0.19
     expression
    0.19
     approaching
    0.19
     вÑĭÑĢаж
    0.18
     expressing
    0.17
    Appro
    0.17
    express
    0.17
    Act Density 0.081%

    No Known Activations