INDEX
    Explanations

    verbs and phrases indicating an action or movement

    New Auto-Interp
    Negative Logits
     Yours
    -0.16
     yourselves
    -0.15
    flix
    -0.15
    orno
    -0.14
    ạt
    -0.14
     ours
    -0.14
    basePath
    -0.14
    iltr
    -0.13
    bilt
    -0.13
     ########.
    -0.13
    POSITIVE LOGITS
     sua
    0.38
     seu
    0.37
     his
    0.33
     suo
    0.32
     seus
    0.30
     suas
    0.29
     svůj
    0.28
     seine
    0.27
     her
    0.27
     their
    0.26
    Act Density 0.068%

    No Known Activations