INDEX
Explanations
numerical values and specific formatting used in scientific papers and reports
New Auto-Interp
Negative Logits
d
-0.48
kre
-0.44
growing
-0.39
ger
-0.38
ynes
-0.37
Da
-0.36
//
-0.35
require
-0.35
#!/
-0.34
scroll
-0.34
POSITIVE LOGITS
المكان
0.97
OGND
0.96
lenker
0.95
ьаж
0.94
دانشنامهٔ
0.93
дописавши
0.91
beginnetje
0.90
archiviato
0.90
للاسماء
0.85
الحياه
0.85
Activations Density 0.377%