INDEX
Explanations
references to the concept of "head" and its various contexts
New Auto-Interp
Negative Logits
Allo
-0.72
nahilalakip
-0.72
disponibilités
-0.71
UNITY
-0.68
="{{$-0.66
Chisholm
-0.64
miracle
-0.64
Thurman
-0.63
Lester
-0.63
ⓘ
-0.63
POSITIVE LOGITS
head
2.69
Head
2.57
HEAD
2.55
Head
2.47
heads
2.41
head
2.31
HEAD
2.23
Heads
2.15
heads
2.03
Heads
1.92
Activations Density 0.037%