INDEX
    Explanations

    terms and variations related to the word "head."

    New Auto-Interp
    Negative Logits
     nahilalakip
    -0.78
    solicit
    -0.71
    Allo
    -0.71
    momix
    -0.70
     disponibilités
    -0.70
    UNITY
    -0.70
    nique
    -0.68
     Urqu
    -0.68
    rungsseite
    -0.68
    ]]);
    -0.66
    POSITIVE LOGITS
     HEAD
    1.86
     head
    1.76
     Head
    1.76
     heads
    1.75
    Head
    1.66
     Heads
    1.64
    head
    1.59
    HEAD
    1.56
    heads
    1.47
    Heads
    1.44
    Act Density 0.054%

    No Known Activations