INDEX
    Explanations

    indicating a tool action

    New Auto-Interp
    Negative Logits
    قتي
    0.44
    kke
    0.42
    ха
    0.41
    0.39
    сова
    0.39
    Kuota
    0.38
    0.37
    ginger
    0.37
    voucher
    0.37
    ंख्य
    0.36
    POSITIVE LOGITS
     Action
    0.60
     action
    0.55
    action
    0.51
     Auro
    0.50
     Association
    0.46
     Input
    0.44
    Action
    0.44
     Next
    0.44
     ACTION
    0.43
     Execution
    0.43
    Act Density 0.003%

    No Known Activations