INDEX
Explanations
occurrences of the word "travel" and its variations
New Auto-Interp
Negative Logits
edly
-0.17
imple
-0.16
ario
-0.15
ymous
-0.15
theid
-0.15
enos
-0.15
ика
-0.14
chamber
-0.14
yne
-0.14
aren
-0.14
POSITIVE LOGITS
elling
0.29
esty
0.27
eller
0.27
olta
0.24
ellers
0.23
ails
0.21
AILS
0.20
ertino
0.19
anc
0.19
esti
0.18
Activations Density 0.005%