INDEX
Explanations
phrases related to noticing or grabbing attention
catching attention or up
New Auto-Interp
Negative Logits
WriteBarrier
-0.47
betweenstory
-0.41
tryck
-0.40
ariatric
-0.39
UserDao
-0.39
lesias
-0.38
mX
-0.37
verständ
-0.37
essential
-0.37
%)$
-0.37
POSITIVE LOGITS
catching
1.34
caught
1.29
catches
1.28
catch
1.25
Caught
1.19
Caught
1.18
caught
1.17
Catch
1.12
CATCH
1.11
catching
1.09
Activations Density 0.009%