INDEX
    Explanations

    terms related to effects and impacts of actions or phenomena

    New Auto-Interp
    Negative Logits
    Ïģιά
    -0.17
    ÅĻet
    -0.16
    arie
    -0.15
    ria
    -0.14
    ató
    -0.14
     ones
    -0.13
    oba
    -0.13
    someone
    -0.13
    contained
    -0.13
    ẳn
    -0.13
    POSITIVE LOGITS
     of
    0.20
    /effects
    0.19
     played
    0.19
     cá»§a
    0.18
     Played
    0.15
    /import
    0.14
     wrought
    0.14
    à¸Ĥà¸Ńà¸ĩ
    0.14
    OfClass
    0.14
    .of
    0.14
    Act Density 0.056%

    No Known Activations