INDEX
    Explanations

    the occurrence of the word "only."

    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.06
    2:0.08
    3:0.07
    4:0.07
    5:0.08
    6:0.08
    7:0.07
    8:0.09
    9:0.08
    10:0.07
    11:0.10
    Negative Logits
     mute
    -3.20
     crow
    -3.08
     Mandarin
    -3.03
     Yel
    -2.73
     Monk
    -2.72
     Rahman
    -2.71
     audible
    -2.63
     hawk
    -2.60
     subscrib
    -2.59
     bub
    -2.59
    POSITIVE LOGITS
    idth
    3.21
    anton
    3.01
    toc
    2.99
     [/
    2.97
    groups
    2.85
    arget
    2.72
    breeding
    2.65
    abor
    2.65
    IDA
    2.65
     Compact
    2.59
    Act Density 0.000%

    No Known Activations