INDEX
Explanations
references to giving or trying something, particularly in a context of effort or exploration
After the tokens "a" or "one"
articles followed by actions
New Auto-Interp
Negative Logits
+#+#
-0.56
DialogInterface
-0.55
Riproduzione
-0.55
soie
-0.53
Vidite
-0.53
Rapport
-0.51
مرئيه
-0.50
occuper
-0.50
createState
-0.50
sonriendo
-0.49
POSITIVE LOGITS
shot
1.65
shot
1.36
go
1.29
crack
1.28
Shot
1.27
shots
1.26
SHOT
1.21
Shot
1.17
try
1.16
SHOT
1.14
Activations Density 0.191%