INDEX
Explanations
references to train stations or train-related activities
mentions of trains and train-related topics
New Auto-Interp
Negative Logits
erenn
-0.78
FACE
-0.70
eff
-0.68
Alc
-0.65
berra
-0.64
presumptive
-0.64
Ĥİ
-0.62
Mortal
-0.60
null
-0.59
etheless
-0.59
POSITIVE LOGITS
Train
1.05
train
1.03
wreck
1.01
trains
0.97
girls
0.89
roads
0.89
tracks
0.85
Train
0.84
ement
0.83
train
0.83
Activations Density 0.011%