INDEX
Explanations
mentions of travel-related destinations and attractions
New Auto-Interp
Negative Logits
outs
-0.20
enda
-0.19
tures
-0.16
uya
-0.16
Ñģли
-0.15
lew
-0.15
adows
-0.15
aker
-0.15
ayan
-0.15
leys
-0.15
POSITIVE LOGITS
/source
0.20
inations
0.20
à¸Ĺาà¸ĩ
0.19
ì§Ģ를
0.18
/target
0.17
(destination
0.17
é»ŀ
0.16
Ãłng
0.16
ì§Ģ
0.15
ì§Ģê°Ģ
0.15
Activations Density 0.014%