INDEX
Explanations
specific nouns and phrases in a descriptive or explanatory context
New Auto-Interp
Negative Logits
Roskov
-0.87
estekak
-0.84
Rüyada
-0.81
Paglinawan
-0.77
Signalez
-0.76
цездатний
-0.74
rungsseite
-0.73
bootstrapcdn
-0.72
astify
-0.72
>=",
-0.72
POSITIVE LOGITS
yalnızca
0.60
şun
0.52
genellikle
0.50
küpe
0.50
mümkün
0.49
karş
0.49
vektör
0.49
kesin
0.48
kılıf
0.48
tamamen
0.48
Activations Density 0.004%