INDEX
Explanations
descriptive nouns and verbs
New Auto-Interp
Negative Logits
厷
0.43
價
0.41
٧
0.41
十
0.40
económicos
0.40
varit
0.39
Ә
0.39
ഷ
0.38
verwend
0.37
biti
0.37
POSITIVE LOGITS
donated
0.42
inspired
0.41
conquered
0.40
Lime
0.39
Leather
0.38
overlapped
0.38
াবাদের
0.37
সের
0.37
succeeded
0.37
overtaken
0.37
Activations Density 0.003%