INDEX
Explanations
the concept of friendship or companionship
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.06
3:0.08
4:0.08
5:0.07
6:0.09
7:0.08
8:0.08
9:0.09
10:0.09
11:0.08
Negative Logits
eus
-2.90
acan
-2.86
Niet
-2.84
Beaver
-2.83
Oregon
-2.80
Chero
-2.78
Ducks
-2.77
zu
-2.75
Hai
-2.75
zees
-2.71
POSITIVE LOGITS
designer
2.83
costume
2.76
lig
2.66
costumes
2.63
Shape
2.63
figure
2.60
2.57
iatures
2.54
HM
2.54
organis
2.53
Activations Density 0.000%