INDEX
Explanations
names and locations
proper nouns, particularly names and locations
New Auto-Interp
Negative Logits
drawn
-0.79
birth
-0.70
necessities
-0.67
mable
-0.67
drivers
-0.65
spring
-0.65
circ
-0.64
master
-0.63
direction
-0.62
going
-0.62
POSITIVE LOGITS
oglu
0.95
pta
0.92
lde
0.89
zyk
0.88
ón
0.84
ère
0.79
cin
0.79
aja
0.78
aser
0.78
fer
0.78
Activations Density 0.252%