INDEX
    Explanations

    phrases related to personal and physical actions

    New Auto-Interp
    Negative Logits
     pama
    -0.92
     guma
    -0.86
    <bos>
    -0.84
     susun
    -0.82
     ilang
    -0.79
     katun
    -0.77
     tanong
    -0.76
     maging
    -0.75
    membrance
    -0.74
     pinak
    -0.74
    POSITIVE LOGITS
     pregn
    0.99
     reluct
    0.98
     himself
    0.98
    himself
    0.97
     disreg
    0.96
     peppa
    0.95
     michelin
    0.93
     unden
    0.92
     intermitt
    0.91
     impra
    0.89
    Act Density 0.687%

    No Known Activations