INDEX
Explanations
prepositions and relational phrases indicating movement or connections
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.04
4:0.07
5:0.02
6:0.04
7:0.39
8:0.03
9:0.03
10:0.17
11:0.06
Negative Logits
natureconservancy
-1.64
deteriorated
-1.55
redund
-1.54
strain
-1.53
degraded
-1.49
bonded
-1.48
stains
-1.47
humidity
-1.44
corrosion
-1.44
oxide
-1.42
POSITIVE LOGITS
1.80
uthor
1.75
PsyNetMessage
1.73
answ
1.70
jab
1.66
ilege
1.56
hello
1.56
Kare
1.55
umsy
1.52
iphany
1.50
Activations Density 0.006%