INDEX
Explanations
references to research findings and statistical analysis
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.04
3:0.14
4:0.02
5:0.06
6:0.01
7:0.04
8:0.02
9:0.02
10:0.53
11:0.02
Negative Logits
oller
-1.79
attRot
-1.77
$.
-1.72
transpired
-1.68
Became
-1.67
iasm
-1.66
Suppose
-1.65
edition
-1.63
Writer
-1.58
IDER
-1.55
POSITIVE LOGITS
differently
3.21
faster
3.18
twice
3.13
sparing
3.03
separately
3.02
concurrently
3.01
continuously
2.92
quicker
2.90
inconsist
2.90
simultaneously
2.88
Activations Density 1.741%