INDEX
Explanations
the article "a" and its variations, indicating a focus on singular nouns
New Auto-Interp
Negative Logits
صوتيه
-0.74
للمعارف
-0.73
RegressionTest
-0.71
kasarigan
-0.71
Hentet
-0.69
ujednoznacz
-0.66
SEGUIR
-0.63
Chwiliwch
-0.63
المكان
-0.61
GOTREF
-0.60
POSITIVE LOGITS
with
0.94
WITH
0.87
With
0.87
With
0.85
with
0.81
WITH
0.71
avec
0.69
עם
0.67
Avec
0.66
با
0.63
Activations Density 0.047%