INDEX
Explanations
proper nouns
instances of the name "Val" or variations of it
New Auto-Interp
Negative Logits
pread
-0.91
oven
-0.71
ģĸ
-0.67
EntityItem
-0.64
wide
-0.63
ship
-0.62
ELF
-0.60
Employ
-0.60
steps
-0.58
omes
-0.58
POSITIVE LOGITS
idation
1.40
idated
1.24
ueless
1.14
ibr
1.13
idity
1.10
idate
1.09
uable
1.06
uations
1.04
uation
1.02
encia
0.97
Activations Density 0.024%