INDEX
    Explanations

    phrases related to change or transformation

    phrases indicating actions or processes involving interaction or alteration

    New Auto-Interp
    Negative Logits
    War
    -0.69
    Shin
    -0.68
    Wall
    -0.67
     Wiz
    -0.66
     Okin
    -0.65
     Allied
    -0.63
     Winged
    -0.61
     Walton
    -0.61
    Image
    -0.60
     Wall
    -0.60
    POSITIVE LOGITS
    etheless
    0.98
    mosp
    0.89
    rontal
    0.87
    terday
    0.82
     UNIVERS
    0.78
    acters
    0.78
    ilogy
    0.78
    FTWARE
    0.77
    anwhile
    0.76
    ossibility
    0.73
    Act Density 0.331%

    No Known Activations