INDEX
Explanations
words and phrases related to travel and exploration experiences
New Auto-Interp
Negative Logits
AMI
-0.16
kola
-0.16
-gnu
-0.15
ities
-0.14
iyat
-0.14
eated
-0.14
aight
-0.14
chw
-0.14
едж
-0.14
cheng
-0.14
POSITIVE LOGITS
lasting
0.23
into
0.20
lasting
0.19
into
0.18
king
0.17
undertaken
0.16
ogue
0.16
gone
0.15
Gone
0.15
gone
0.15
Activations Density 0.119%