INDEX
    Explanations

    phrases emphasizing the importance and necessity of certain actions or precautions

    New Auto-Interp
    Negative Logits
    аки
    -0.15
     Ridley
    -0.15
    gem
    -0.15
    hd
    -0.14
    仲
    -0.14
    eken
    -0.14
     Gem
    -0.14
     gem
    -0.14
    æīįèĥ½
    -0.14
    iken
    -0.14
    POSITIVE LOGITS
     wise
    0.34
    wise
    0.31
     Wise
    0.30
    -wise
    0.26
     helpful
    0.24
     wisdom
    0.23
     advisable
    0.23
     prudent
    0.23
     worthwhile
    0.23
     Wisdom
    0.22
    Act Density 0.201%

    No Known Activations