INDEX
Explanations
words related to physical journeys or trips
references to journeys or travels
New Auto-Interp
Negative Logits
cand
-0.72
somet
-0.69
latent
-0.68
slightest
-0.66
ocr
-0.65
¢
-0.65
contents
-0.63
candy
-0.63
crystals
-0.63
positives
-0.63
POSITIVE LOGITS
ashore
0.84
commute
0.84
east
0.82
abroad
0.81
between
0.81
convoy
0.80
cffffcc
0.79
west
0.78
voyage
0.78
Afric
0.78
Activations Density 0.114%