INDEX
    Explanations

    standing by no matter what

    New Auto-Interp
    Negative Logits
     collaps
    -0.09
    átka
    -0.09
     unre
    -0.09
    ursed
    -0.08
     horn
    -0.08
     handed
    -0.08
    akat
    -0.08
    alem
    -0.08
    SEG
    -0.08
     uncert
    -0.08
    POSITIVE LOGITS
     loyalty
    0.25
    loy
    0.23
     loyal
    0.22
     Loy
    0.22
    LOY
    0.18
     backs
    0.15
     stick
    0.15
     support
    0.15
     Stick
    0.15
     defend
    0.14
    Act Density 0.050%

    No Known Activations