INDEX
Explanations
personal pronouns and possessive forms
Possessive pronouns followed by nouns
possessive pronouns followed by nouns
New Auto-Interp
Negative Logits
)_/¯
-0.69
vibe
-0.69
badass
-0.69
️
-0.68
backstory
-0.67
_$
-0.64
curated
-0.62
Heist
-0.62
permalink
-0.60
+#+
-0.60
POSITIVE LOGITS
daß
0.62
muß
0.59
own
0.57
InstrumentedTest
0.54
luß
0.53
müßte
0.52
own
0.51
Boas
0.48
hitherto
0.47
Own
0.47
Activations Density 0.303%