INDEX
    Explanations

    references to the concept of "head" and its various contexts

    New Auto-Interp
    Negative Logits
    Allo
    -0.72
     nahilalakip
    -0.72
     disponibilités
    -0.71
    UNITY
    -0.68
    ="{{$
    -0.66
     Chisholm
    -0.64
    miracle
    -0.64
     Thurman
    -0.63
     Lester
    -0.63
    -0.63
    POSITIVE LOGITS
     head
    2.69
     Head
    2.57
     HEAD
    2.55
    Head
    2.47
     heads
    2.41
    head
    2.31
    HEAD
    2.23
     Heads
    2.15
    heads
    2.03
    Heads
    1.92
    Act Density 0.037%

    No Known Activations