INDEX
Explanations
phrases indicating attempts or efforts
attempt or shot
New Auto-Interp
Negative Logits
hyö
-0.50
connexe
-0.47
kyllä
-0.46
tärke
-0.42
confirmer
-0.41
tė
-0.41
détruire
-0.40
confirmé
-0.40
näky
-0.39
őket
-0.39
POSITIVE LOGITS
attempt
1.12
attempt
1.08
attempts
1.08
Attempt
1.03
Attempt
1.02
attempted
1.00
Attempts
0.99
attempts
0.94
attempting
0.93
Attempts
0.92
Activations Density 0.061%