INDEX
Explanations
desire, need, positioning, shine
New Auto-Interp
Negative Logits
bamb
0.47
┑
0.43
spaceShip
0.43
پول
0.43
einmal
0.42
インタ
0.42
គ្រឿង
0.41
ড়ের
0.41
pamię
0.41
鉆
0.41
POSITIVE LOGITS
hyp
0.37
'
0.37
abler
0.37
Hooper
0.36
0.35
fulness
0.35
eto
0.35
stift
0.35
imed
0.34
-
0.34
Activations Density 0.000%