INDEX
Explanations
phrases indicating comparisons and contrasts
New Auto-Interp
Negative Logits
لس
-0.47
hun
-0.47
nour
-0.47
גב
-0.44
ANS
-0.43
ueses
-0.43
朋
-0.43
eps
-0.43
Class
-0.42
sika
-0.42
POSITIVE LOGITS
disambiguazione
0.86
مرئيه
0.77
MLLoader
0.71
تضيفلها
0.71
Савезне
0.71
SBATCH
0.69
compared
0.66
endregion
0.66
GIVEREF
0.65
كومونز
0.65
Activations Density 0.264%