INDEX
Explanations
instances of the word "hit" in various contexts
New Auto-Interp
Negative Logits
hire
-0.17
ei
-0.17
iac
-0.17
ered
-0.16
erge
-0.15
htdocs
-0.15
hotel
-0.15
Bor
-0.15
emente
-0.14
hem
-0.14
POSITIVE LOGITS
achi
0.22
TING
0.21
ting
0.20
ACHI
0.17
parade
0.17
arget
0.16
omi
0.16
REC
0.16
maker
0.15
.Hit
0.15
Activations Density 0.018%