INDEX
    Explanations

    actions involving movement or physical interaction

    New Auto-Interp
    Negative Logits
    igi
    -0.15
     Overnight
    -0.15
    ownik
    -0.14
    onaut
    -0.14
     primitive
    -0.14
    ride
    -0.13
    avicon
    -0.13
    onom
    -0.13
    banner
    -0.13
    fits
    -0.13
    POSITIVE LOGITS
    offee
    0.15
    ghan
    0.15
    èĸ
    0.15
    Calibri
    0.14
    欲
    0.14
    marshaller
    0.14
     walking
    0.14
    alloca
    0.14
    ummer
    0.14
    walking
    0.14
    Act Density 0.176%

    No Known Activations