INDEX
Explanations
instances of the word "hit" in various contexts
New Auto-Interp
Negative Logits
Tundra
-0.69
🔹
-0.69
Dahl
-0.68
McCartney
-0.66
Jacobsen
-0.65
]_
-0.65
esinde
-0.64
°)
-0.63
()}
-0.63
надцать
-0.63
POSITIVE LOGITS
HIT
1.56
Hit
1.53
hit
1.52
HIT
1.48
Hit
1.46
hits
1.46
hit
1.45
Hits
1.39
hitting
1.36
+#+#
1.33
Activations Density 0.034%