INDEX
    Explanations

    phrases that indicate actions or processes associated with orders and instructions

    New Auto-Interp
    Negative Logits
     Jefus
    -1.46
     pleaſure
    -1.44
     Efq
    -1.40
     houſe
    -1.39
     Monfieur
    -1.37
     myſelf
    -1.36
     Majefty
    -1.33
     Theſe
    -1.32
     faſt
    -1.32
     themſelves
    -1.30
    POSITIVE LOGITS
     afin
    0.88
     inorder
    0.83
     أجل
    0.81
     чтобы
    0.77
    是为了
    0.74
     to
    0.73
     upang
    0.71
    kében
    0.70
    Чтобы
    0.70
     כדי
    0.70
    Act Density 0.071%

    No Known Activations