INDEX
Explanations
words related to trains and training
references to the word "train" in various contexts
New Auto-Interp
Negative Logits
uid
-0.75
hed
-0.74
hedral
-0.69
hern
-0.67
hedon
-0.66
Fernandez
-0.66
Gawker
-0.65
cens
-0.63
Wid
-0.62
arag
-0.62
POSITIVE LOGITS
train
3.82
trains
2.87
Train
2.81
train
2.70
Train
2.55
Amtrak
1.62
railway
1.60
training
1.55
trained
1.54
railroad
1.51
Activations Density 0.013%