INDEX
Explanations
references to the city of Tokyo
mentions of the city Tokyo
New Auto-Interp
Negative Logits
lv
-0.79
apple
-0.77
lain
-0.72
onies
-0.71
ebook
-0.70
BOOK
-0.69
hemy
-0.69
о
-0.67
estern
-0.67
mble
-0.66
POSITIVE LOGITS
Lumpur
0.88
Babel
0.83
Tokyo
0.80
ichi
0.78
amaru
0.73
Xan
0.72
eln
0.70
yo
0.70
ikawa
0.70
iji
0.70
Activations Density 0.006%