INDEX
Explanations
proper names or entities, such as people's names
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
emale
-0.63
ãĥ¼ãĥĨ
-0.57
shapeshifter
-0.55
eteria
-0.54
arettes
-0.53
Interstitial
-0.52
083
-0.50
circulate
-0.50
../
-0.50
ãĥ¼ãĥ³
-0.49
POSITIVE LOGITS
GOODMAN
0.65
Rice
0.61
Reed
0.61
Roberts
0.58
Hos
0.57
Miller
0.57
Foster
0.56
Butler
0.56
Jenkins
0.56
Bradley
0.56
Activations Density 0.476%