INDEX
Explanations
references to quantity and abundance in a context related to capabilities and attributes
New Auto-Interp
Head Attr Weights
0:0.15
1:0.07
2:0.17
3:0.04
4:0.05
5:0.06
6:0.02
7:0.03
8:0.18
9:0.05
10:0.04
11:0.08
Negative Logits
stuff
-2.06
levant
-1.91
*/(
-1.88
selves
-1.86
oples
-1.68
illac
-1.66
anyways
-1.63
respectively
-1.57
sters
-1.57
Stuff
-1.56
POSITIVE LOGITS
understatement
2.11
avoidance
1.96
blindness
1.90
indifference
1.80
absence
1.79
reluctance
1.78
amplification
1.75
resemblance
1.75
ハ
1.74
rejection
1.69
Activations Density 0.008%