INDEX
Explanations
statements indicating meaning or implications
New Auto-Interp
Negative Logits
sight
-0.57
W
-0.55
hä
-0.46
ub
-0.46
Kob
-0.46
Cordialement
-0.45
自行
-0.44
py
-0.43
trends
-0.43
ynomial
-0.42
POSITIVE LOGITS
مرئيه
1.09
للاسماء
0.96
MEANS
0.93
意味着
0.92
means
0.91
évaluateur
0.88
DockStyle
0.88
Means
0.85
artinya
0.83
gynhyrchwyd
0.83
Activations Density 0.210%