INDEX
Explanations
instances of competition or rivalry
New Auto-Interp
Head Attr Weights
0:0.03
1:0.04
2:0.07
3:0.06
4:0.12
5:0.04
6:0.08
7:0.27
8:0.03
9:0.05
10:0.11
11:0.05
Negative Logits
mosquit
-1.55
soType
-1.52
precaution
-1.50
joy
-1.49
forgiven
-1.43
RIP
-1.42
inventoryQuantity
-1.42
regrets
-1.40
tragedies
-1.40
advis
-1.36
POSITIVE LOGITS
Flavoring
1.54
negotiator
1.53
constitu
1.45
Survive
1.42
reproduction
1.41
ois
1.40
Occupations
1.40
oux
1.39
Breed
1.39
English
1.37
Activations Density 0.000%