INDEX
    Explanations

    phrases that describe daily activities or experiences

    New Auto-Interp
    Negative Logits
    /INFO
    -0.15
    UNCH
    -0.15
    ugin
    -0.14
     grandma
    -0.14
    boa
    -0.14
    inely
    -0.13
    pData
    -0.13
    UID
    -0.13
    UGIN
    -0.13
    izik
    -0.13
    POSITIVE LOGITS
     Hub
    0.45
     hubs
    0.42
     hub
    0.42
    hub
    0.40
    Hub
    0.39
     Husband
    0.38
     dh
    0.37
     DH
    0.36
     Hubb
    0.35
    DH
    0.34
    Act Density 0.277%

    No Known Activations