INDEX
    Explanations

    sentences that use the word "like" to initiate comparisons or examples

    New Auto-Interp
    Negative Logits
    åłĤ
    -0.18
     EITHER
    -0.16
     Verfügung
    -0.16
    either
    -0.16
    illin
    -0.14
    onical
    -0.14
    .Îķ
    -0.14
    plemented
    -0.14
    orno
    -0.14
    ifix
    -0.13
    POSITIVE LOGITS
     many
    0.41
     most
    0.33
    many
    0.33
     any
    0.27
    许å¤ļ
    0.26
    Many
    0.25
     MANY
    0.25
     Many
    0.24
     with
    0.23
     everything
    0.23
    Act Density 0.073%

    No Known Activations