INDEX
Explanations
proper nouns, especially names
New Auto-Interp
Negative Logits
TestFixture
-0.17
aras
-0.16
lette
-0.15
HD
-0.15
tails
-0.15
ifu
-0.14
ngh
-0.14
iyon
-0.14
associ
-0.14
Nit
-0.13
POSITIVE LOGITS
Ph
0.30
ph
0.30
Ph
0.23
/ph
0.22
(ph
0.19
ippines
0.18
ph
0.18
adelphia
0.16
-ph
0.16
.ph
0.15
Activations Density 0.039%