INDEX
    Explanations

    phrases and variations of the word "introduce."

    New Auto-Interp
    Negative Logits
    رى
    -0.72
    ma
    -0.67
    ar
    -0.67
    5
    -0.64
    na
    -0.63
     kas
    -0.63
    mo
    -0.62
    m
    -0.61
    nasa
    -0.61
     B
    -0.61
    POSITIVE LOGITS
     Introduce
    1.67
     introduces
    1.55
     introductions
    1.54
    Introduce
    1.52
    introduce
    1.49
     introduce
    1.48
     introduction
    1.48
     introdu
    1.44
     Introducing
    1.44
     introducing
    1.42
    Act Density 0.060%

    No Known Activations