INDEX
Explanations
names of politicians and universities
abbreviated names and acronyms related to organizations or entities
New Auto-Interp
Negative Logits
thood
-0.68
DragonMagazine
-0.67
Carnage
-0.65
Paras
-0.63
Canaver
-0.63
Carbuncle
-0.61
xual
-0.61
Aph
-0.60
repatri
-0.58
Kubrick
-0.58
POSITIVE LOGITS
nesota
1.12
erville
1.11
adelphia
1.11
neapolis
1.03
mington
1.02
esville
1.02
ansas
1.00
achusetts
0.99
anooga
0.99
aukee
0.97
Activations Density 0.293%