INDEX
Explanations
phrases that express affirmation or correctness
New Auto-Interp
Negative Logits
“
-0.50
[
-0.45
”
-0.44
جهت
-0.43
post
-0.43
ække
-0.41
/
-0.41
(
-0.40
milieux
-0.40
demo
-0.40
POSITIVE LOGITS
AccessorTable
0.89
Signalez
0.87
Personensuche
0.86
Paglinawan
0.86
تقاوى
0.86
المعيارى
0.85
GEBURTSDATUM
0.84
principalColumn
0.80
kasarigan
0.79
脚注の使い方
0.78
Activations Density 0.196%