INDEX
Explanations
phrasal verbs or definitions
New Auto-Interp
Negative Logits
redi
0.49
ley
0.49
red
0.48
illen
0.46
تور
0.46
깍
0.46
ുകെ
0.45
iav
0.44
고
0.44
놀
0.44
POSITIVE LOGITS
сюда
0.48
Tattha
0.44
करतात
0.43
Goodbye
0.43
totality
0.42
दिस
0.42
needing
0.42
暐
0.41
requiring
0.41
Uns
0.41
Activations Density 0.000%