INDEX
Negative Logits
ħ
-0.08
અસર
-0.08
host
-0.07
Kuj
-0.07
מוש
-0.07
adh
-0.07
Zo
-0.07
ми
-0.07
అంత
-0.07
brass
-0.07
POSITIVE LOGITS
ponde
0.08
.game
0.08
rophe
0.08
-directed
0.08
resent
0.08
tributed
0.08
Game
0.08
/top
0.07
.language
0.07
oconut
0.07
Activations Density 0.072%