INDEX
Negative Logits
dry
-1.08
dry
-1.02
Dry
-0.98
Dry
-0.94
يتيمه
-0.90
DRY
-0.90
للاسماء
-0.89
DRY
-0.85
期刊论文
-0.80
dryness
-0.79
POSITIVE LOGITS
out
0.49
outs
0.48
time
0.43
Denna
0.41
i
0.41
зами
0.40
worms
0.39
time
0.39
urp
0.39
wall
0.39
Activations Density 0.003%