INDEX
    Explanations

    occurrences of verbs and phrases indicating actions and processes

    New Auto-Interp
    Negative Logits
    _CAT
    -0.16
     Kew
    -0.15
    /
    -0.15
     MAV
    -0.14
     Interpret
    -0.14
    yon
    -0.14
    è«ĸ
    -0.14
    fo
    -0.14
     Gew
    -0.14
     Slee
    -0.14
    POSITIVE LOGITS
    _MACRO
    0.15
    .Listener
    0.15
    oyer
    0.15
    ogui
    0.15
    -alist
    0.15
    ODB
    0.15
    .jpa
    0.14
    ÏĥÏĦά
    0.14
    Absent
    0.14
    _AUX
    0.14
    Act Density 0.002%

    No Known Activations