INDEX
Explanations
words related to a specific place or person
proper nouns referring to people and places
New Auto-Interp
Negative Logits
gal
-0.68
caution
-0.66
Juno
-0.65
ibaba
-0.57
sow
-0.56
peg
-0.55
scrap
-0.55
reel
-0.54
¯
-0.54
impulse
-0.53
POSITIVE LOGITS
ulhu
1.19
pillar
1.03
berus
1.00
ortium
0.92
emporary
0.86
etary
0.85
igham
0.85
aminer
0.79
estial
0.78
cled
0.78
Activations Density 0.093%