INDEX
Explanations
proper nouns and names in various contexts
New Auto-Interp
Head Attr Weights
0:0.06
1:0.04
2:0.02
3:0.02
4:0.02
5:0.54
6:0.01
7:0.01
8:0.06
9:0.08
10:0.06
11:0.02
Negative Logits
antha
-1.50
petition
-1.49
semen
-1.49
iable
-1.46
ITE
-1.46
estimating
-1.46
lease
-1.40
requesting
-1.39
UU
-1.38
ITS
-1.36
POSITIVE LOGITS
Unloaded
1.81
aida
1.76
/*
1.75
UCHIJ
1.74
Annotations
1.72
Flavoring
1.66
perture
1.65
emouth
1.61
ora
1.61
heid
1.61
Activations Density 0.914%