INDEX
Explanations
words related to geographical locations, particularly countries
proper nouns, particularly names of locations and geographical entities
New Auto-Interp
Negative Logits
cube
-0.71
kid
-0.71
bread
-0.69
worn
-0.65
Ö¼
-0.64
iT
-0.63
better
-0.62
starter
-0.62
sheet
-0.61
天
-0.61
POSITIVE LOGITS
ð
1.09
ñ
0.91
veland
0.89
velength
0.84
cci
0.84
veyard
0.82
ignt
0.82
ÄŁ
0.79
zza
0.79
zzi
0.79
Activations Density 0.012%