INDEX
Explanations
exclamatory statements expressing enthusiasm or appreciation
New Auto-Interp
Negative Logits
omens
-0.17
Shepard
-0.15
osu
-0.14
Jon
-0.14
Whale
-0.14
uru
-0.14
-Regular
-0.14
ix
-0.14
bourg
-0.14
anh
-0.14
POSITIVE LOGITS
_CD
0.17
idf
0.15
anela
0.15
fid
0.14
èĵ
0.14
eled
0.14
CD
0.14
measure
0.13
dried
0.13
blr
0.13
Activations Density 0.179%