INDEX
Explanations
names or terms related to famous people or characters
proper nouns, particularly names
New Auto-Interp
Negative Logits
ikuman
-0.61
akeru
-0.58
etheus
-0.58
Pwr
-0.55
arbon
-0.55
ishi
-0.55
illions
-0.55
oola
-0.54
ovie
-0.53
rative
-0.53
POSITIVE LOGITS
aternity
0.61
heit
0.56
pert
0.56
hers
0.55
ngth
0.54
beard
0.54
clave
0.53
ascript
0.53
cko
0.53
pants
0.52
Activations Density 1.519%