INDEX
Explanations
references to paths or routes in textual descriptions
New Auto-Interp
Negative Logits
ſtate
-0.88
purpoſe
-0.83
noft
-0.78
Majefty
-0.77
gants
-0.77
ſmall
-0.77
ſta
-0.75
Efq
-0.74
Conſ
-0.74
perſon
-0.73
POSITIVE LOGITS
camino
0.88
journey
0.76
journey
0.67
way
0.65
drodze
0.65
along
0.64
דרך
0.64
cesty
0.64
caminho
0.63
menuju
0.62
Activations Density 0.111%