INDEX
Explanations
slight or marginal differences
New Auto-Interp
Negative Logits
ૌ
0.42
मर्या
0.41
പല
0.41
Muchas
0.40
simpler
0.39
далеко
0.39
birçok
0.38
πολλ
0.38
不像
0.38
অনেক
0.37
POSITIVE LOGITS
slightly
2.13
slightly
1.90
Slightly
1.78
légèrement
1.70
slight
1.68
ligeramente
1.63
약간
1.55
marginally
1.51
немного
1.45
Slight
1.45
Activations Density 0.068%