INDEX
Explanations
asking for definition or exact meaning
New Auto-Interp
Negative Logits
सकती
0.31
nemus
0.30
뻔
0.29
والأ
0.29
别人
0.29
ujarnya
0.29
blico
0.28
How
0.28
temas
0.28
त्यावेळी
0.28
POSITIVE LOGITS
exactly
0.56
actually
0.54
meant
0.50
causing
0.49
sebenarnya
0.49
exactly
0.48
egent
0.47
entails
0.46
meaning
0.46
constitutes
0.45
Activations Density 0.036%