INDEX
Explanations
references to the word "New" in various contexts
"New" followed by a location
new place names
New Auto-Interp
Negative Logits
/*
-0.62
ymce
-0.57
Bluffs
-0.55
aanwezig
-0.54
ftagPool
-0.54
ModelExpression
-0.53
estors
-0.53
PRD
-0.53
よいよ
-0.51
civiles
-0.51
POSITIVE LOGITS
tonsoft
1.01
york
1.00
fang
0.94
York
0.92
sworth
0.91
york
0.90
Zealand
0.88
York
0.79
zealand
0.79
sprint
0.77
Activations Density 0.163%