INDEX
Explanations
nouns and related forms in a non-English language context
New Auto-Interp
Negative Logits
Heath
-0.15
upert
-0.15
feet
-0.15
focal
-0.15
neatly
-0.14
straight
-0.14
fig
-0.14
dramatically
-0.14
ample
-0.14
vertically
-0.14
POSITIVE LOGITS
yonel
0.18
вико
0.18
oks
0.17
kart
0.17
LOPT
0.17
GINE
0.16
krv
0.16
šet
0.16
â̦↵↵↵
0.15
anness
0.15
Activations Density 0.024%