INDEX
Explanations
phrases related to traveling, specifically trips to different locations
instances of the word "to" indicating travel or destination
New Auto-Interp
Negative Logits
tons
-0.80
rated
-0.73
going
-0.72
standing
-0.71
transmitted
-0.68
piv
-0.67
indicators
-0.67
achy
-0.65
flowed
-0.64
handled
-0.64
POSITIVE LOGITS
meet
0.89
Uganda
0.87
Disneyland
0.87
Osaka
0.86
ascus
0.86
Liberia
0.84
Thailand
0.83
Mongolia
0.82
Ethiopia
0.82
Omaha
0.81
Activations Density 0.138%