INDEX
Explanations
instances of physical actions or movements
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.06
3:0.04
4:0.07
5:0.02
6:0.08
7:0.48
8:0.03
9:0.03
10:0.06
11:0.06
Negative Logits
warning
-1.72
paren
-1.61
calling
-1.55
Office
-1.55
spection
-1.48
omnia
-1.47
pection
-1.43
lance
-1.42
fax
-1.40
nesday
-1.39
POSITIVE LOGITS
heap
1.73
rabbits
1.70
rabbit
1.63
pile
1.62
ranks
1.60
bushes
1.58
basket
1.56
piles
1.55
spiral
1.54
cliffs
1.52
Activations Density 0.021%