INDEX
    Explanations

    following your instructions

    New Auto-Interp
    Negative Logits
     følgende
    0.45
     FOLLOWING
    0.39
     उल्लंघन
    0.38
     કરવાનો
    0.38
    ộc
    0.37
     khỏi
    0.37
     folgenden
    0.37
     följande
    0.37
    enerbah
    0.36
    годи
    0.36
    POSITIVE LOGITS
     suit
    1.15
     closely
    0.94
     along
    0.91
    suit
    0.85
     Suit
    0.79
    Along
    0.72
    along
    0.71
     Along
    0.70
    Suit
    0.70
    スーツ
    0.60
    Act Density 0.026%

    No Known Activations