INDEX
    Explanations

    the presence of punctuation, specifically periods, indicating the end of sentences

    New Auto-Interp
    Negative Logits
    vaux
    -0.68
    zha
    -0.68
    InputModule
    -0.64
    babwe
    -0.60
     InputDecoration
    -0.60
     GenerationType
    -0.59
    IRECT
    -0.58
    稲田
    -0.57
    cheidet
    -0.57
    metheus
    -0.56
    POSITIVE LOGITS
    .)
    2.00
     .)
    1.54
    ,)
    1.54
    .)}
    1.52
    .]
    1.49
    。)
    1.49
    .))
    1.48
    .”)
    1.47
    ].)
    1.42
    .")
    1.32
    Act Density 0.297%

    No Known Activations