INDEX
Explanations
famous places or events related to countries
New Auto-Interp
Negative Logits
orc
-0.86
hon
-0.84
phen
-0.83
cot
-0.82
are
-0.82
lon
-0.80
atl
-0.79
bre
-0.79
elf
-0.77
nai
-0.77
POSITIVE LOGITS
Images
1.15
Album
0.94
UTERS
0.92
Caption
0.92
Pool
0.87
Photos
0.85
Journals
0.84
Gallery
0.84
Citation
0.84
Photo
0.82
Activations Density 0.051%