INDEX
Explanations
the word "originally" and its variations, indicating a focus on the origins or initial intentions behind subjects
New Auto-Interp
Negative Logits
avier
-0.18
al
-0.16
ackson
-0.16
ayne
-0.15
ifer
-0.15
amines
-0.15
ere
-0.15
Williamson
-0.14
William
-0.14
Miller
-0.14
POSITIVE LOGITS
ãĥ¼ãĤ¹ãĥĪ
0.16
nings
0.16
ural
0.15
istrat
0.15
ochen
0.15
inals
0.15
_trampoline
0.14
yssey
0.14
bine
0.14
ĵåIJį
0.14
Activations Density 0.024%