INDEX
Explanations
phrases related to travel and exploration
references to travel and movement
New Auto-Interp
Negative Logits
uracy
-0.68
FORE
-0.62
Emer
-0.60
occupant
-0.58
moil
-0.57
rets
-0.57
Leave
-0.57
ukes
-0.56
inherit
-0.56
buster
-0.56
POSITIVE LOGITS
corridors
1.12
stairs
1.02
treacherous
1.01
halls
0.98
distances
0.94
winding
0.92
lengths
0.90
streets
0.90
perilous
0.87
rooft
0.86
Activations Density 0.301%