INDEX
Explanations
specific morphological patterns or formations in words, particularly focusing on prefixes and suffixes
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.04
3:0.04
4:0.04
5:0.02
6:0.56
7:0.03
8:0.03
9:0.04
10:0.05
11:0.03
Negative Logits
independents
-1.33
accuser
-1.32
Ruk
-1.27
gat
-1.24
Hank
-1.20
edges
-1.18
puppies
-1.15
dogs
-1.14
anchester
-1.13
Topics
-1.12
POSITIVE LOGITS
ioxide
1.56
ophen
1.49
arte
1.41
odor
1.33
etric
1.27
ão
1.27
nell
1.27
insula
1.25
Entered
1.25
lished
1.25
Activations Density 0.009%