INDEX
Explanations
proper nouns or names of places and entities
instances of the word "NEW" followed by various numerical values and contexts
New Auto-Interp
Negative Logits
ppe
-0.87
stood
-0.76
mop
-0.69
76561
-0.67
minecraft
-0.64
arching
-0.64
osate
-0.64
ãĤ¼
-0.63
agn
-0.63
Reloaded
-0.61
POSITIVE LOGITS
YORK
1.33
foundland
1.19
bie
1.03
PORT
0.96
York
0.93
Orleans
0.91
ARK
0.89
bies
0.89
Zealand
0.87
CAST
0.84
Activations Density 0.010%