INDEX
Explanations
allowing natural development
New Auto-Interp
Negative Logits
असतात
0.51
amental
0.43
asher
0.43
doesn
0.42
refusing
0.41
असतो
0.41
õe
0.40
entiende
0.40
are
0.39
immediately
0.38
POSITIVE LOGITS
freely
0.76
自然
0.74
undisturbed
0.74
자연
0.69
naturali
0.69
自然的
0.69
自由
0.68
uninterrupted
0.68
natural
0.68
自由に
0.68
Activations Density 0.013%