INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     appContext
    0.38
     मस्त
    0.38
     হইতেছে
    0.37
    рована
    0.37
     Eh
    0.36
    embros
    0.36
     acord
    0.35
     BlueprintName
    0.35
    eleron
    0.35
     внимания
    0.35
    POSITIVE LOGITS
     advantage
    1.45
    优势
    1.44
     vantagem
    1.37
     ventaja
    1.35
     advantages
    1.27
     преимуще
    1.25
    advantage
    1.24
     vantaggio
    1.23
     superiority
    1.18
     Advantage
    1.16
    Act Density 0.015%

    No Known Activations