INDEX
Explanations
phrases that indicate ownership or possession and the word "the"
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.12
3:0.04
4:0.17
5:0.02
6:0.25
7:0.13
8:0.03
9:0.04
10:0.05
11:0.04
Negative Logits
ennes
-1.77
hene
-1.71
ryu
-1.53
thood
-1.45
anu
-1.43
suburbs
-1.42
croft
-1.41
ularity
-1.40
artisan
-1.39
sburg
-1.35
POSITIVE LOGITS
)</
1.61
ジ
1.44
</
1.42
Sah
1.41
[&
1.34
Shop
1.31
courtesy
1.31
supplied
1.31
February
1.30
<<
1.26
Activations Density 0.001%