INDEX
Explanations
references to historical events or technological advancements
New Auto-Interp
Head Attr Weights
0:0.04
1:0.04
2:0.11
3:0.08
4:0.18
5:0.08
6:0.03
7:0.03
8:0.10
9:0.17
10:0.05
11:0.02
Negative Logits
redirected
-1.26
anamo
-1.21
etting
-1.20
Lumpur
-1.20
channelAvailability
-1.19
EStream
-1.19
ansk
-1.18
DPR
-1.16
jri
-1.16
abee
-1.14
POSITIVE LOGITS
staple
1.27
cuisine
1.27
imum
1.25
Upgrade
1.24
appet
1.23
Crusher
1.18
stay
1.18
Cruiser
1.17
sust
1.16
rium
1.16
Activations Density 0.003%