INDEX
Explanations
proper nouns appearing in sentences
New Auto-Interp
Negative Logits
ebook
-0.95
arian
-0.94
arians
-0.92
ional
-0.90
expr
-0.86
onal
-0.81
ocre
-0.81
oral
-0.80
unity
-0.80
atories
-0.77
POSITIVE LOGITS
Mond
1.16
Reed
0.97
Winc
0.90
Sob
0.87
Payton
0.85
Cron
0.84
Isaac
0.83
Walter
0.80
Ley
0.79
Io
0.77
Activations Density 0.013%