INDEX
Explanations
locations or addresses
proper nouns, specifically names of places and possibly dates
New Auto-Interp
Negative Logits
environment
-0.80
Spoiler
-0.68
advertisement
-0.64
behavior
-0.62
develop
-0.61
environments
-0.60
behav
-0.60
anew
-0.60
causation
-0.59
supervision
-0.58
POSITIVE LOGITS
Ave
1.14
Blvd
1.07
Rd
1.07
Ct
0.86
illion
0.86
acres
0.85
Points
0.85
hrs
0.82
STATES
0.78
Tickets
0.78
Activations Density 0.160%