INDEX
Explanations
references to the concept of "fabric" in various contexts, especially in relation to society and stability
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.07
3:0.05
4:0.11
5:0.03
6:0.04
7:0.43
8:0.02
9:0.03
10:0.05
11:0.07
Negative Logits
verbs
-1.85
zech
-1.69
baugh
-1.64
odan
-1.61
phies
-1.58
ritic
-1.58
wm
-1.55
issues
-1.53
pause
-1.52
olesterol
-1.52
POSITIVE LOGITS
fabric
1.95
Shirt
1.72
alliances
1.58
���
1.57
subcontract
1.50
universe
1.48
representation
1.47
proport
1.46
alliance
1.45
caricature
1.44
Activations Density 0.002%