INDEX
    Explanations

    phrases related to frequency or repetition

    New Auto-Interp
    Negative Logits
    olas
    -0.76
    phabet
    -0.73
     Metatron
    -0.67
    zai
    -0.66
    dict
    -0.66
     Viz
    -0.64
    hess
    -0.62
     notwithstanding
    -0.61
     Osw
    -0.61
    Ü
    -0.58
    POSITIVE LOGITS
    THING
    1.44
     conceivable
    1.03
    where
    0.98
    single
    0.95
     imaginable
    0.92
     single
    0.91
    things
    0.90
    WHERE
    0.87
    Ĥª
    0.82
    body
    0.81
    Act Density 0.044%

    No Known Activations