INDEX
Explanations
references to modes of transportation, specifically trains
references to trains
New Auto-Interp
Negative Logits
osi
-0.70
Tablet
-0.69
Sphere
-0.68
erenn
-0.67
Reach
-0.62
pop
-0.62
Prairie
-0.60
cised
-0.60
alien
-0.58
metics
-0.58
POSITIVE LOGITS
wreck
1.28
roads
1.02
loads
1.00
conductor
1.00
ees
0.97
cars
0.92
wreck
0.89
cars
0.88
locom
0.85
trains
0.85
Activations Density 0.033%