INDEX
Explanations
phrases that describe fundamental concepts or qualities
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.08
3:0.06
4:0.08
5:0.02
6:0.04
7:0.38
8:0.03
9:0.04
10:0.13
11:0.07
Negative Logits
records
-1.65
logs
-1.58
usercontent
-1.52
depos
-1.51
ebook
-1.45
receipts
-1.43
batches
-1.39
lists
-1.39
recordings
-1.35
polls
-1.35
POSITIVE LOGITS
Concepts
1.79
envis
1.55
concepts
1.46
Struct
1.44
Concept
1.43
framework
1.41
Tradable
1.39
concept
1.38
Turing
1.38
paren
1.38
Activations Density 0.006%