INDEX
Explanations
repeated prepositional phrases indicating relationships or belonging
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.07
3:0.08
4:0.17
5:0.03
6:0.02
7:0.34
8:0.03
9:0.04
10:0.06
11:0.06
Negative Logits
uez
-1.58
daq
-1.57
Tsukuyomi
-1.54
ebook
-1.50
ebted
-1.46
hypot
-1.42
achus
-1.37
quished
-1.36
freely
-1.34
wered
-1.33
POSITIVE LOGITS
burgh
1.33
toughness
1.30
sophistication
1.29
Hack
1.28
Colour
1.28
refinement
1.28
Studio
1.28
engagement
1.26
rity
1.25
professionalism
1.23
Activations Density 0.001%