INDEX
    Explanations

    words related to changes or modifications

    references to significant alterations or adjustments in various contexts

    New Auto-Interp
    Negative Logits
    amina
    -0.78
    ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
    -0.72
     AFB
    -0.72
    xious
    -0.69
     DRAGON
    -0.68
    ECD
    -0.67
    Äĩ
    -0.67
    ZE
    -0.67
    BILITY
    -0.67
    âĸ¬
    -0.66
    POSITIVE LOGITS
     effected
    0.91
    atile
    0.90
     wrought
    0.86
    hift
    0.85
    ettings
    0.83
    uits
    0.83
    ĸļ
    0.82
    oodoo
    0.81
    undown
    0.79
    ilver
    0.77
    Act Density 0.023%

    No Known Activations