INDEX
Explanations
definite articles and their variations in different languages
New Auto-Interp
Negative Logits
kautta
-0.67
regionales
-0.66
Esau
-0.64
Pflanze
-0.64
küche
-0.63
pernas
-0.62
tarko
-0.61
învă
-0.60
juridiques
-0.60
temporales
-0.60
POSITIVE LOGITS
')],
0.98
"):
0.98
'):
0.92
%")
0.90
")));
0.90
%";
0.87
"])
0.87
)];
0.85
]='\
0.85
>",
0.84
Activations Density 0.022%