INDEX
Explanations
references to transportation systems and infrastructure
New Auto-Interp
Negative Logits
OLON
-0.16
966
-0.16
rito
-0.16
Pipes
-0.16
wing
-0.15
Roads
-0.14
comed
-0.14
ëĤ
-0.14
Streets
-0.14
olon
-0.14
POSITIVE LOGITS
train
0.43
Train
0.43
trains
0.42
Train
0.39
train
0.33
rail
0.33
Rail
0.32
_train
0.31
railway
0.30
.train
0.30
Activations Density 0.209%