INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
não
0.80
לא
0.74
não
0.72
ikke
0.70
גם
0.69
не
0.65
না
0.63
nicht
0.63
NÃO
0.62
non
0.60
POSITIVE LOGITS
necessarily
1.75
necessarily
1.32
necessariamente
1.22
necesariamente
1.14
obstante
0.94
nécessairement
0.93
orious
0.90
withstanding
0.89
anymore
0.87
forcément
0.85
Activations Density 0.766%