INDEX
Explanations
references to newness or novelty
references to the concept of being "new."
New Auto-Interp
Negative Logits
acebook
-0.81
aminer
-0.81
iage
-0.74
istine
-0.72
AFTA
-0.70
itta
-0.69
ammers
-0.69
UMP
-0.67
OPA
-0.66
icone
-0.66
POSITIVE LOGITS
bie
1.42
bies
1.41
Zealand
1.10
foundland
0.98
egg
0.85
born
0.85
arrivals
0.85
Orleans
0.80
Testament
0.80
found
0.80
Activations Density 0.114%