INDEX
Explanations
comparative phrases that describe relationships or similarities
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.08
3:0.05
4:0.16
5:0.02
6:0.03
7:0.36
8:0.02
9:0.02
10:0.05
11:0.12
Negative Logits
loop
-1.75
MRI
-1.50
lear
-1.46
escal
-1.43
eca
-1.39
raints
-1.34
Loop
-1.33
iat
-1.32
hips
-1.30
Sting
-1.28
POSITIVE LOGITS
Medals
1.87
senal
1.61
Winning
1.50
verages
1.47
unbeliev
1.43
Defeat
1.42
Median
1.41
TOTAL
1.40
DragonMagazine
1.40
rarity
1.36
Activations Density 0.002%