INDEX
Explanations
words ending in "der" with an emphasis on those ending in "der" followed by a high-activation digit
words related to individuals or people
New Auto-Interp
Negative Logits
African
-0.61
INTER
-0.59
Echoes
-0.57
elapsed
-0.56
Korean
-0.56
zzo
-0.55
Asian
-0.55
Pros
-0.55
Impossible
-0.54
[|
-0.54
POSITIVE LOGITS
der
1.21
theless
1.12
cair
0.98
iving
0.97
rors
0.92
iver
0.91
igans
0.90
ricks
0.88
rine
0.87
mil
0.87
Activations Density 0.004%