INDEX
Explanations
proper nouns, specifically names of individuals
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
TTL
-0.76
DNA
-0.75
igers
-0.69
CAP
-0.67
Ocean
-0.66
Population
-0.65
ULT
-0.65
tainment
-0.64
CEPT
-0.63
1600
-0.63
POSITIVE LOGITS
bery
1.07
bling
1.02
artisan
0.98
wich
0.98
etooth
0.95
ij
0.90
hari
0.90
ı
0.90
arest
0.88
bles
0.88
Activations Density 0.029%