INDEX
Explanations
occurrences of the word "York."
New Auto-Interp
Negative Logits
Bron
-0.15
Vern
-0.15
Terminal
-0.14
otts
-0.14
jem
-0.14
ernals
-0.14
diffusion
-0.14
apr
-0.14
tam
-0.14
tit
-0.14
POSITIVE LOGITS
minster
0.17
hire
0.16
town
0.16
ton
0.16
isté
0.16
tone
0.16
ommen
0.16
lyn
0.15
Ñīе
0.15
923
0.15
Activations Density 0.005%