INDEX
Explanations
phrases that include the term "so-called" used to describe something in a skeptical or dismissive manner
New Auto-Interp
Negative Logits
yr
-0.15
zet
-0.15
ek
-0.14
nodoc
-0.14
.Aggressive
-0.14
ksi
-0.14
ekl
-0.14
dist
-0.14
бо
-0.13
ship
-0.13
POSITIVE LOGITS
ly
0.17
oop
0.15
McMahon
0.14
ÙģØ§Øª
0.14
ollipop
0.14
ernen
0.14
大ä¼ļ
0.14
uku
0.14
urr
0.13
hood
0.13
Activations Density 0.007%