INDEX
    Explanations

    phrases related to the introduction of something

    New Auto-Interp
    Negative Logits
    pots
    -0.83
    ply
    -0.79
    conserv
    -0.71
    pot
    -0.69
     knots
    -0.68
    vest
    -0.67
    tmp
    -0.66
    licts
    -0.63
     owed
    -0.63
    eters
    -0.62
    POSITIVE LOGITS
     introduction
    3.80
     introdu
    2.54
     Introduction
    2.16
    introdu
    2.13
     introducing
    1.91
     introduce
    1.88
     reintrodu
    1.81
     introductory
    1.66
    Introduction
    1.65
     Introdu
    1.62
    Act Density 0.009%

    No Known Activations