INDEX
Explanations
range, probability, current, target
New Auto-Interp
Negative Logits
đều
0.45
obnie
0.42
çeşit
0.42
sorun
0.41
难
0.41
problemen
0.41
সমস্যায়
0.41
这边
0.41
法轮
0.40
驷
0.40
POSITIVE LOGITS
highest
0.58
current
0.55
estimated
0.55
your
0.55
current
0.54
amount
0.54
total
0.54
perceived
0.53
value
0.50
weighted
0.50
Activations Density 0.314%