INDEX
Explanations
phrases indicating adaptation or adjustment to new situations
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.07
3:0.22
4:0.02
5:0.08
6:0.02
7:0.06
8:0.04
9:0.02
10:0.34
11:0.02
Negative Logits
))
-2.28
)))
-2.20
contributed
-2.14
))
-2.13
avia
-2.10
)))
-2.08
))))
-2.07
})
-2.07
CrossRef
-2.06
Contribut
-2.02
POSITIVE LOGITS
colder
2.65
quicker
2.55
slower
2.50
harsher
2.49
quieter
2.48
warmer
2.39
smoother
2.32
surroundings
2.23
environments
2.22
glare
2.08
Activations Density 0.031%