INDEX
Explanations
punctuation followed by Spanish phrases
New Auto-Interp
Negative Logits
egregious
1.02
proactive
0.99
limited
0.96
overarching
0.96
pesky
0.95
needing
0.95
tailored
0.93
decent
0.92
ethical
0.92
intelligent
0.92
POSITIVE LOGITS
Pentru
1.10
În
1.01
mostrar
1.01
Untuk
1.01
если
1.01
Esta
0.99
aparición
0.98
desde
0.97
Dengan
0.95
μπορεί
0.95
Activations Density 0.660%