INDEX
Explanations
the word stem "ur" at the end of words
the word "ur" or its variations, indicating a focus on a specific phrasing or slang
New Auto-Interp
Negative Logits
otle
-0.69
pleas
-0.66
pleasing
-0.62
eas
-0.61
Benedict
-0.59
faithful
-0.58
Zucker
-0.57
e
-0.57
Feder
-0.55
least
-0.54
POSITIVE LOGITS
geon
1.40
ricane
1.18
geons
1.18
thur
1.08
ricanes
1.05
iosity
1.00
rences
1.00
assic
0.97
andom
0.96
idad
0.96
Activations Density 0.048%